Forensic Language Analysis
Forensic Language Analysis
Forensic Language Analysis
The Parliamentary Office of Science and Technology, Westminster, London SW1A 0AA T 020 7219 2840 E post@parliament.uk www.parliament.uk/post
POSTnote 509 September 2015 Forensic Language Analysis Page 2
Linguistics, Phonetics and Criminal Justice Box 3. Forensic Science Regulation and The Expert Witness
Use in the Criminal Justice System Regulation
In criminal investigations, expert linguists or phoneticians The Forensic Science Regulator (FSR) seeks to ensure that all
may be engaged by the police, Crown Prosecution Service forensic science used in the criminal justice system meets quality
or defence. Since its creation in 2008, experts from the standards,39 but it does not have statutory powers. The Science and
Technology Select Committee supported statutory regulation in
Centre for Forensic Linguistics have provided more than 2013,40 which the Home Office is reviewing.41 Guidance on legal
500 evidential and investigative reports. 9Error! Bookmark obligations and codes of practice and conduct for the profession are
not defined. J.P. French Associates, the largest private UK produced by the FSR. Forensic phonetics is classed by the FSR as a
forensic phonetics laboratory, works on approximately 200 branch of digital forensics, and organisations performing analyses and
cases per year. laboratory procedures (rather than the experts themselves) must be
accredited by October 2017.42 Forensic linguistics is not recognised
by the FSR as forensic science as it argues that analyses are
Specific Procedures
subjective. However, forensic linguists must conform to Ministry of
Expert sociolinguistic profiling of a specimen of writing by an Justice expectations laid out in the Criminal Procedure Rules.43 The
unknown author can help to reduce the pool of suspects.33 International Association of Forensic Linguists44 and the International
Though used in investigations, it is not used evidentially in Association for Forensic Phonetics and Acoustics45 have codes of
the UK. Where a text has been sent from the mobile phone practice, but these are guidelines and are not enforced.
of a person who is missing and presumed dead, an expert The Expert Witness
analyses the texts of interest and reports on their similarity The Ministry of Justice’s Criminal Procedure Rules46 and Criminal
to known texts from the missing person, suspect or Practice Directions47 specify who can act as an expert witness and
someone else (Box 1, bullet 2). The police engage experts what can be admitted as expert evidence. The CPS also provides
guidance for experts and on expert evidence. 48,49 Its guiding
to carry out speaker profiling when a recording of an principles50 are in line with those set down by the Forensic Science
offender’s voice exists but there is as yet no suspect. This Regulator. Admissibility is determined by common law:51 R v Turner
procedure is used in UK investigations, but is not admissible [1975] set the precedent that expert evidence must be ‘outside of the
as evidence, unlike in other parts of Europe. If there is an experience of a judge or jury’.52 In 2011, the Law Commission
earwitness to a crime, the police can use a voice parade (a reviewed the use of expert evidence and recommended
line-up of voice recordings including the suspect’s). A 2003 standardisation but this was not taken up by Government. 53,54
Home Office circular advises that police should collaborate
with phoneticians who apply techniques of speaker science raise expectations.3 This presents two problems:
comparison when selecting voices.34 When there is doubt jurors expect certain procedures to be possible which
over who was speaking in a criminal interaction and there experts assert are not, such as personality analysis,
are several suspects, the CPS or defence engage an expert determining truth and falsity, and assessing threat in
analyst, who presents their findings as evidence. speech intonation (although this is a research interest).55
jurors expect conclusions to be presented with certainty.
One current research interest in linguistics is in police
interviews in rape and sexual assault cases. The CPS often Expressing Conclusions
decides not to prosecute following the initial police Some phoneticians and linguists are frustrated by the
investigation.35 In cases where consent is an issue, suspect pressure to present numerical conclusions and express
and victim interviews are particularly important sources of certainty, since these are not generally possible.2
evidence. Research indicates that current interviewing Expressing conclusions so that judges and juries can
practices may not enable the best quality evidence to be understand and evaluate them has been highlighted as an
elicited.36 Training developed in collaboration with linguists issue across forensic science.56,57,58 There are concerns as
could improve this. to how understandable probability statistics are and how
and whether it is possible to convert them accurately to
Concerns about use in Criminal Investigations qualitative conclusions.
Academic researchers and practitioners have four main
areas of concern. Technical Limitations
Some academics and practitioners highlight limitations in
Regulation of Expertise various procedures used in criminal investigations. They
The main concern is that substandard expertise may lead to assert that whilst sociolinguistic profiling is used in
miscarriages of justice. Although it is not known whether this investigations, it cannot and should not be submitted as
has happened in the UK, a person was wrongly jailed in Italy evidence.59 Voice parades are deemed effective by
because of errors in the forensic phonetic report.37,38 Since phoneticians but are lengthy and costly.60 Academics are
there is no statutory regulation or accreditation of experts in researching how to streamline the design and use of voice
the UK (Box 3), individuals with inadequate expertise can parades.61 Some psychologists are sceptical about voice
present themselves as experts. parades because of the complex nature of memory of
voice.62 They consider voice recognition less reliable than
Understanding Limitations face recognition, the reliability of which is itself uncertain.63
Researchers and practitioners highlight the danger of the
‘CSI effect’ whereby TV and film representations of forensic
POSTnote 509 September 2015 Forensic Language Analysis Page 4
Asylum and Language Analysis Box 4. Language Analysis in Refugee Status Determination
If there is doubt surrounding an asylum applicant’s claimed When someone seeks asylum, the Home Office takes the applicant’s
origin, the Home Office carries out several procedures details and checks if he or she has already sought asylum.84 The
including Language Analysis (LA) to test the claim (Box 4). applicant is then interviewed. Since August 2014, the Swedish
Claimed nationalities associated with a high number of company Verified is the principal LA provider, with the Swedish
company Sprakab providing secondary support. An in-house analyst,
fraudulent asylum applications (currently Palestine, Syria
who is a native speaker of the applicant’s language, interviews the
and Kuwait)64 are tested systematically. As well as detecting applicant by telephone and then, with a linguist, assesses the
fraud in asylum claims, the Home Office’s rationale for using linguistic behaviour against predetermined criteria. They produce a
LA includes speeding up application processing, cutting report concluding how consistent the applicant’s speech is with
costs and deterring fraudulent claims.65 However, a Home speech in the claimed place of origin or former places of residence.
Office report (2011)65 concluded that it is not possible to banks). A Government Digital Service-sponsored project is
know whether LA is a deterrent. Between 2008-2010, 2,198 looking into the potential use of ASV for telephone access to
LAs were carried out at a cost of £174,000.66 Home Office government services and to confirm online identity.
data on more recent costs and cost savings is not available.
Automatic Speaker Recognition (ASR)
Issues of using LA in Determining Refugee Status This is used to identify whether one specimen of speech
Although phoneticians and linguists support the theoretical consistently closely resembles another (for example from a
concept of language analysis67,68 some,69,70 along with database) to help determine whether it is likely to be the
social scientists71,72 and legal practitioners,73,74 criticise same person speaking. ASR technology was used to
procedures that attempt to determine nationality or origin identify a perpetrator of the 2006 Madrid airport bombing85
such as LA. They all assert that the relationship between and has been used by UK security agencies since 2007.
language and nationality or origin is complex; language ASR output is used evidentially in 35 countries86 but not in
does not always map neatly onto geographical the UK. J.P. French Associates is carrying out research with
boundaries.69 Displacement (often repeated) is a common ASR with a view to integrating it into future practice.
experience for refugees, which often affects language.75
Issues with ASV and ASR
Standards, Methods and Practice
ASV and ASR give numerical conclusions but the inherent
LA practitioners do not have to conform to the standards for
variability in voice means 100% certainty is impossible. ASV
experts in the criminal justice system. There are calls to
and ASR are usually used with other personal data to
raise the standard of expertise.76 Unlike some other
increase reliability. In simulations, error rates are low (in the
European countries (such as Norway) the UK, does not
range of 1.5-2.6%)87,88 but real data may be of poorer
have statutory minimum requirements. Internationally
quality and environmental and cross-channel distortions (if
authored non-statutory guidelines (2004)67 exist, but
one sample comes from a telephone and another from
academics argue that they need updating.77 Practitioners
Skype) make the technology less successful. Fraudulent
wrote a set of ‘minimal requirements’ in 2008 but they were
access using stolen voice recordings is a technical concern
not pursued.77 There are varying practices across Europe:
that industry is addressing: the company Agnitio reports
some agencies use a ‘specialised linguist method’; others
detecting up to 99% of spoofs.89 Improving reliability is a
use a trained native speaker and a linguist.77 There has
research interest. Standardisation of voice biometrics is not
been no independent analysis of methodologies, though
as advanced as other biometrics; however, an International
studies indicate that procedures can be accurate.78 The lack
Standards Organisation code of practice for implementing
of research means LA’s reliability is unknown.79,77 The
biometric systems is expected in 2016.90
ESRC funded research to address this, but a lack of access
to some providers’ data prevented any meaningful
Intelligence and Counter-Terrorism
analysis.80,81 Academics criticise the lack of methodological
The security services use technologies and engage experts
transparency in the field.77 The practices of Sprakab (the
to carry out procedures outlined earlier, as well as:91
primary commercial provider until August 2014) were
Infiltration and disruptive policing: forensic linguists
criticised in a 2014 UK Supreme Court judgment in which
are training West Midlands Police in assuming online
guidance for the role of the analyst was laid out.82 The
identities to infiltrate paedophile networks. The ESRC is
Home Office has addressed some of the issues identified.
funding research (£400k) into methodologies for this.92
Analysing digital personas: computer scientists are
Biometrics: Verification and Identification researching technology to develop a language ‘fingerprint’
The use of voice in biometric recognition offers new
of online personas.11 These are of interest to law
opportunities in various domains.
enforcement agencies as a way to identify cybercriminals.
Automatic Speaker Verification (ASV) Speaker Identification Integrated Project: in response
ASV uses voice instead of a password as a means of to a European Commission call93 (€15m funding), a
verifying identity to access a system. In the UK it is used consortium of 17 European partners is developing
commercially, for example by Santander and Barclays83 and transnational technology to identify criminals.94
will be used by Atom Bank (one of the first internet-only UK-
POST is an office of both Houses of Parliament, charged with providing independent and balanced analysis of policy issues that have a basis in science and technology.
POST is grateful to Sarah Foxen for researching this briefing, to the Arts & Humanities Research Council for funding her parliamentary fellowship, and to all contributors
and reviewers. For further information on this subject, please contact the co-author, Dr Sarah Bunn. Parliamentary Copyright 2015. Image copyright iStockphoto
POSTnote 509 September 2015 Forensic Language Analysis Page 5
Endnotes
1 Oxford English Dictionary.
2 Grant, T. & Perkins, R. 2013. Forensic Linguistics. In J. A. Siegel & P.J. Saukko 38 ‘Fast-Track Injustice’ documentary highlights the case for European Arrest
(eds.). Encyclopedia of Forensic Sciences, 2nd edn. 174-177 Warrant reform. 2014.
3 Watt, D. 2010. The Identification of the Individual through Speech. In C. Llamas 39 Forensic Science Regulator: What we do.
& D. Watt (eds.). Language and Identities, Edinburgh: Edinburgh University 40 Science and Technology Committee, Forensic Science, Second Report of
Press, 84-85 Session 2013-2014. Vol I. (HC610), Paragraph 50.
4 Home Office: Language Analysis, Instruction. 41 Home Office: Consultation on New statutory powers for the Forensic Science
5 Irish Bank Resolution Corporation Ltd & ors v Quinn & ors [2015] IEHC 175. Regulator. November 2013.
6 Litigation Futures. 2015. Predictive coding technology validated in landmark 42 Forensic Science Regulator: Codes of Practice and Conduct: for Forensic
ruling Science Providers and Practitioners in the Criminal Justice System. Version 2,
7 Turnitin: Our company. p. 4.
8 R v Uter, Abouakkour, Roberts, Wood and Williams. 2014. 43 Criminal Procedure Rules: Part 33: Expert Evidence. 2014..
9 Centre for Forensic Linguistics: Consulting 44 International Association of Forensic Linguists: Code of Practice.
10 Coulthard, M. Grant, T & Kredens, K. 2010. Forensic Linguistics. In R. Wodak,
45 International Association for Forensic Phonetics and Acoustics: Code of
B. Johnstone, & P. Kerswill (eds.). Handbook of Applied Linguistics, Thousand Practice.
Oaks and London: SAGE Publications 46 Ministry of Justice, Criminal Procedure Rules 2014.
11 Rashid, A., Baron, A. Rayson, P., May-Chahal, C. Greenwood, P. &
47 Courts and Tribunals Judiciary. Criminal Practice Directions 2014.
Walkerdine, J. 2013. Who Am I? Analyzing Digital Personas in Cybercrime 48 Crown Prosecution Service, Guidance Booklet for Experts 2010.
Investigations. IEEE Computer 46(4), 54-61. 49 Crown Prosecution Service, Guidance on Expert Evidence 2014.
12 Prentice, S., Rayson, P. & Taylor, P. J. 2012. The language of Islamic
50 Crown Prosecution Service: Key Requirements for Forensic Science Providers.
extremism: Towards an Automated Identification of Beliefs, Motivations and 51 Forensic Science Regulator: Legal Obligations. Issue 3.
Justifications, International Journal of Corpus Linguistics, 17(2), 259-286. 52 R v Turner [1975] 1 All ER 70.
13 French, J.P. & Harrison, P. 2006. Investigative and Evidential Applications of
53 The Law Commission: Expert Evidence in Criminal Proceedings in England
Forensic Speech Science. In A. Heaton-Armstrong, E. Shepherd, G. and Wales. 2011. No. 325.
Gudjonsson & D. Wolchover (eds.) Witness Testimony: Psychological, 54 Ministry of Justice, The Government’s Response to the Law Commission
Investigative and Evidential Perspectives. Oxford: Oxford University Press. Report: ‘Expert Evidence in Criminal Proceedings in England and Wales’, 2013.
14 Foulkes, P. & French, P. 2012. Forensic Speaker Comparison: a Linguistic-
55 Watt, D. Kelly, S. & Llamas, C. 2013. Inference of Threat from Neutrally-
Acoustic Perspective. In L. Solan & P. Tiersma (eds.) The Oxford Handbook of Worded Utterances in Familiar and Unfamiliar Languages. York Papers in
Language and Law. Oxford: Oxford University Press, 418-421. Linguistics, Series 2. Issue 13.
15 Position Statement Concerning Use of Impressionistic Likelihood Terms in
56 Lynch, M. & McNally, R. 2003. ‘Science’, ‘Common Sense’, and DNA Evidence:
Forensic Speaker Comparison Cases. a Legal Controversy about the Public Understanding of Science. Public
16 European Network of Forensic Science Institutes. 2010. ENSFI Guideline for
Understanding of Science, 83-103.
Evaluative Reporting in Forensic Science: Strengthening the Evaluation of 57 Martire, K. A. & Watkins, I. 2015. Perception Problems of the Verbal Scale: a
Forensic Results across Europe, p. 64. Reanalysis and Application of a Membership Function Approach. Science and
17 Garcia-Romero, D. 2012. Robust Speaker Recognition Based on Latent
Justice 55, 264-273.
Variable Models. Unpublished PhD, University of Maryland. 58 Ligertwood, A. & Edmond, G. 2012. Expressing Evaluative Forensic Science
18 Nuance, Public Security Solutions for a Safer World: Voice Biometrics
Opinions in a Court of Law. Law Probability and Risk 11, 289-302.
Solutions for Public Safety. 59 Grant, T. 2008. Approaching Questions in Forensic Authorship Analysis. In J.
19 Agnitio, Speaker Recognition Technology: How did Agntio’s [sic] Software
Gibbons & M. T. Turell (eds.). Dimensions of Forensic Linguistics. Amsterdam:
Evolution to where it is Today? John Benjamins Publishing.
20 Coulthard, M. & Johnson, A. 2007. The Work of the Forensic Phonetician and
60 Nolan, F. 2003. A Recent Voice Parade. International Journal of Speech,
the Document Examiner. In An Introduction to Forensic Linguistics: Language Language and the Law 10 (2), 277-291.
in Evidence. London: Routledge. 61 A grant proposal has been submitted to the ESRC by the Department of
21 Questioned Content Analysis.
Theoretical and Applied Linguistics at the University of Cambridge in
22 Authentication of Audio.
collaboration with Oxford Wave Research.
23 Nemesysco’s Technology.
62 Stevenage, S. V., Neil, G. J., Barlow, J., Dyson, A., Eaton-Brown, C. &
24 Kirchhübel, C. 2013. The Acoustic and Temporal Characteristics of Deceptive
Parsons, B. 2012. The Effect of Distraction on Face and Voice Recognition,
Speech. Unpublished PhD, University of York Psychological Research, 77(2), 167-175.
25 Kirchhübel, C. & Howard, D. M. 2013. Detecting Suspicious Behaviour Using
63 Hope, L. 2015. Eyewitness Testimony. In D. A. Crighton & G. J. Towl (eds.).
Speech: Acoustic Correlates of Deceptive Speech – An Exploratory Forensic Psychology. Oxford: Wiley, 2nd edn. 45-64.
Investigation. Applied Ergonomics, 44, 694-702. 64 Language Analysis Testing Authorisation 2013: Palestinian, Syrian & Kuwaiti
26 French, P., Nolan, F. Foulkes, P., Harrison, P. & McDougall, K. 2010. The UK
(No. 2).
position statement on Forensic Speaker Comparison: a Rejoinder to Rose and 65 Language Analysis Testing of Asylum Applicants: Impacts and Economic Costs
Morrison, The International Journal of Speech, Language and the Law 17(1). and Benefits, September 2011. Redacted Version, April 2012.
27 Nolan, F. 1990. The Limitations of Auditory-Phonetic Speaker Identification.
66 Freedom Of Information request no. 16782 submitted to Home Office.
Texte zu Theorie und Praxis forensischer Linguistik. Tübingen: Max Niemeyer 67 Guidelines for the Use of Language Analysis in Relation to Questions of
Verlag. National Origin in Refugee Cases 2004.
28 Nolan, F. 1997. Speaker Recognition and Forensic Phonetics. In W. Hardcastle
68 International Association for Forensic Phonetics and Acoustics: Resolution on
& J. Laver (eds.). The Handbook of Phonetic Sciences. Oxford: Blackwell, 744- Language and Determination of National Identity Cases.
67. 69 Eades, D. 2010. Nationality Claims: Language Analysis and Asylum Cases. In
29 DigiLog UK.
M. Coulthard & A. Johnson (eds.). The Routledge Handbook of Forensic
30 DigiLog Advanced Validation Solutions: Testimonials.
Linguistics.
31 Eriksson, A. & Lacerda, F. 2007. ‘Charlatanry in Forensic Speech Science: A
70 “Patrick, P. forthcoming. Asylum and Language Analysis. In S. K Brown & F. D
Problem to be Taken Seriously’, International Journal of Speech, Language Bean (eds.). Encyclopedia of Migration. Heidelberg: Springer Verlag.
and the Law 14(2), 169-193. 71 Tutton, R. Hauskeller, C. & Sturdy, S. 2014. Suspect Technologies: Forensic
32 Hobson, J. L., Mayhew, W. K. & Venkatachalam, M. 2011. Analyzing Speech to
Testing of Asylum Seekers at the UK Border. Ethnic and Racial Studies 37(5).
Detect Financial Misreporting. Journal of Accounting Research 50(2), 349-392. 72 Campbell, J. 2012. Language Analysis in the United Kingdom’s Refugee Status
33 Leonard, R. A. 2005. Forensic Linguistics: Applying the Scientific Principles of
Determination System: Seeing through the Policy Claims about ‘Expert
Language Analysis to Issues of the Law. International Journal of the Knowledge’. Ethnic and Racial Studies.
Humanities 3, 1-9. 73 The Immigration Law Practitioners Association Evidence. In Home Affairs
34 Home Office circular 057/2003: Advice on the use of Voice Identification
Committee Asylum: Seventh Report of Session 2013-2014. Vol 1. (HC 71 – I).
Parades. 74 Craig, S. 2012. The Use of Language Analysis in Asylum Decision-making in
35 Home Office Research Study 293. 2005. A gap or a chasm? Attrition in
the UK – A Discussion. Journal of Immigration Asylum and Nationality Law
Reported Rape Cases. 26(3) 255-268.
36 Haworth, K. 2015. The Discursive Construction of Evidence in Police
75 Fraser, H. 2911. Language Analysis for the Determination of Origin (LADO). In
Interviews: Case Study of a Rape Suspect, Applied Linguistics. C. A. Chapelle (ed.). Encyclopedia of Applied Linguistics. Wiley-Blackwell.
37 El Periodico: La Peligrosa Euroorden. June 2014.
POSTnote 509 September 2015 Forensic Language Analysis Page 6