Larner, Samuel ORCID: https://orcid.org/0000-0002-8386-3789 (2017) Using a core word to identify different forms of semantically related formulaic sequences and their potential as a marker of authorship. Corpora, 11 (3). pp. 343-369. ISSN 1749-5032
|
Accepted Version
Available under License In Copyright. Download (908kB) | Preview |
Abstract
Formulaic sequences should make an excellent marker of style because if authors treat them as one lexical choice, they are unlikely to be aware of the individual words contained within. However, there is no clear-cut way to robustly identify all, and only, formulaic sequences in text. If one particular word can be isolated which occurs frequently in formulaic sequences—a core word—then a reasonable sub-set of word sequences will be identified, the majority of which can be expected to be formulaic. Using the core word 'way' which occurs in many formulaic sequences (e.g., in a way, by the way, by way of), the aim of this research is to establish whether individual authors use different way-phrases from each other and, for comparative purposes, whether authors use alternative non-formulaic realisations of the same semantic content. If inter-authorial differences can be found, way-phrases may hold potential as a marker of authorship. The results indicate that for one author, the phrase 'in a way' appeared to be used distinctively. Therefore, there is potential for formulaic sequences to be used as a marker of authorship, albeit for only one author out of twenty, which limits the usefulness of such a marker in a forensic context.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.