Formula to detect an author's literary 'fingerprint'
Using literature written by Thomas Hardy, DH Lawrence and Herman Melville, physicists in Sweden have developed a formula to detect different authors' literary 'fingerprints'. New research published today, Thursday 10 December, in New Journal of Physics (co-owned by the Institute of Physics and German Physical Society), describes a new concept from a group of Swedish physicists from the Department of Physics at Umeå University called the meta book which uses the frequency with which authors use new words in their literature to find distinct patterns in authors' written styles.
For more than 75 years George Kingsley Zipf's maxim, based on a carefully selected compilation of American English called Brown Corpus, suggested a universal pattern for the frequency of new words used by authors. Zipf's law suggests that the frequency ranking of a word is inversely proportional to its occurrence.
New research suggests however that the truth behind word frequency is less universal than Zipf asserted and has more to do with the author's linguistic ability than any over-arching linguistic rule.
The researchers first found that the occurrence of new words in the texts by Hardy, Lawrence and Melville did begin to drop off in their texts as their book gets longer, despite new settings and plot-twists.
Their evidence also shows however that the rate of unique word drop-off varies for different authors and, most significantly, is consistent across the entire works of any one of the three authors they analysed.
The statistical analysis was applied to entire novels, sections from novels, complete works and amalgamations from different works by the same authors – they all had a unique word-frequency 'fingerprint'.
By using the statistical patterns evident from their study, the researchers have pondered the idea of a meta-book – a code for each author which could represent their entire work, completed or in the mental pipeline.
As the researchers write, "These findings lead us towards the meta book concept – the writing of a text can be described by a process where the author pulls a piece of text out of a large mother book (the meta book) and puts it down on paper. This meta book is an imaginary infinite book which gives a representation of the word frequency characteristics of everything that a certain author could ever think of writing."
Source: Institute of Physics
- Formula to detect an author’s literary ‘fingerprint’from Science DailyThu, 10 Dec 2009, 20:07:38 EST
- Formula to detect an author's literary 'fingerprint'from PhysorgThu, 10 Dec 2009, 7:28:09 EST
- Formula to detect an author's literary 'fingerprint'from Science BlogThu, 10 Dec 2009, 2:14:25 EST
Latest Science NewsletterGet the latest and most popular science news articles of the week in your Inbox! It's free!
Check out our next project, Biology.Net
From other science news sites
Popular science news articles
- Bats' flight technique could lead to better drones
- Six new fossil species form 'snapshot' of primates stressed by ancient climate change
- 'Slow' NZ seabed quake sheds light on tsunami-earthquake mechanism
- World's shallowest slow-motion earthquakes detected offshore of New Zealand
- UCI astronomers determine precise mass of a giant black hole
- Seeking to rewind mammalian extinction
- Perceived diversity in neighborhoods is related to more prejudice, study finds
- Imodium for a legal high is as dumb and dangerous as it sounds
- An experiment seeks to make quantum physics visible to the naked eye
- Made better through science: Calcite tuned to be mollusk-tough
- Critically Endangered and ancient Himalayan wolf needs global conservation attention
- Nearby massive star explosion 30 million years ago equaled detonation of 100 million suns
- Newly discovered titanosaurian dinosaur from Argentina, Sarmientosaurus
- One oil field a key culprit in global ethane gas increase
- First multi-year study of honey bee parasites and disease reveals troubling trends