1. Who Was John F. Burrows?
John F. Burrows is an Australian literary scholar and pioneer of computational stylistics and digital humanities.
He spent most of his career studying 18th- and 19th-century English literature, particularly authors such as Jane Austen.
Academic background
- Professor of English literature at the University of Newcastle (Australia).
- Early adopter of computers in literary analysis (1970s–80s).
- Known for combining statistics, linguistics, and literary criticism.
One of his most influential earlier works is the book:
- Computation into Criticism (1987)
In that book he demonstrated how computers could analyze style in the novels of Jane Austen.
Burrows is often considered one of the founders of modern stylometry, the statistical study of literary style.
2. The Intellectual Problem He Wanted to Solve
Burrows was interested in a classic literary question:
Who wrote a particular text?
Many works in literary history have uncertain authorship.
Examples include:
- anonymous poems
- disputed plays
- collaborative works
- pseudonymous publications
Traditional literary scholarship usually answered this question through:
- historical documents
- letters
- biography
- thematic interpretation
- stylistic intuition
But these approaches often relied on subjective judgement.
Burrows wanted a more objective method.
His basic insight:
An author’s style leaves statistical traces in language.
Even when writers try to imitate others, certain patterns remain.
3. The Field: Stylometry
Before discussing Delta, we must understand stylometry.
Stylometry is the quantitative study of writing style.
Instead of interpreting meaning, it measures things like:
- word frequency
- sentence length
- vocabulary distribution
- punctuation patterns
These measurable patterns can act like a linguistic fingerprint.
Stylometry assumes:
Every writer has unconscious habits in language.
These habits are difficult to control consciously.
For example:
Writers unconsciously prefer certain small words like
- and
- but
- of
- to
- in
These are called function words.
4. The Key Insight of Burrows
Most literary scholars look at important words (themes, metaphors).
Burrows realized something surprising:
The most revealing stylistic markers are often the most boring words.
Words like:
- the
- of
- and
- to
- in
- with
Why?
Because they are content-neutral.
They are not influenced by topic or theme.
They reflect habitual grammar and rhythm of the writer.
5. The Delta Method
Burrows introduced his famous method in the article:
“Delta: A Measure of Stylistic Difference and a Guide to Likely Authorship” (2002).
Delta: a Measure of Stylistic Difference and a Guide to Likely Authorship
The method compares texts using the frequency of the most common words.
6. How Delta Works (Simple Explanation)
Imagine you have texts by three authors:
Author A
Author B
Author C
And you find an anonymous text.
You want to know which author wrote it.
Step 1 — Select frequent words
Choose the most frequent words in the corpus.
For example:
the
and
of
to
in
with
Burrows used around 150 common words.
Step 2 — Measure frequency
For each author, calculate how often these words appear.
Example:
| word | Author A | Author B | Author C |
|---|---|---|---|
| the | 7% | 5% | 8% |
| of | 4% | 6% | 3% |
Step 3 — Standardize the values
Different texts have different lengths.
So Burrows converts the numbers into z-scores (standardized values).
This removes distortions caused by text size.
Step 4 — Calculate distance
Then he measures the difference between two texts.
Delta simply measures the average difference in word frequency.
Smaller difference = more similar style.
Lower Delta value = more likely same author.
7. What the Delta Score Means
Delta measures stylistic distance.
Think of it like distance between fingerprints.
Example:
Anonymous text vs Author A → Delta = 0.6
Anonymous text vs Author B → Delta = 1.3
Anonymous text vs Author C → Delta = 1.7
The smallest number indicates the most similar style.
So the text was probably written by Author A.
8. Experiments Burrows Conducted
Burrows tested the method on English poetry of the 17th century.
His dataset included:
- 25 poets
- 200 poems
The results showed:
- texts by the same author cluster together
- different authors are clearly distinguishable
The method works best for texts longer than about 1500 words, but it can still help reduce candidate authors even with shorter texts.
9. Philosophical Implications
Burrows’ work challenges traditional literary assumptions.
Traditional view
Literary style is:
- unique
- artistic
- difficult to measure
Burrows’ view
Style is also statistical.
A writer’s voice emerges from thousands of unconscious choices.
Thus style can be:
- quantified
- modeled
- compared mathematically.
10. Relation to Traditional Literary Criticism
Burrows does not reject literary interpretation.
Instead he proposes two complementary approaches.
Close Reading
Traditional criticism focuses on:
- themes
- symbolism
- rhetoric
- narrative
Example: analyzing metaphors in Hamlet.
Distant Reading
Computational criticism examines:
- thousands of words
- statistical patterns
- linguistic distributions
Burrows helped establish this perspective.
Later critics like Franco Moretti expanded this idea with the concept of distant reading.
11. Influence on Digital Humanities
Burrows’ Delta became one of the most influential algorithms in literary computation.
It is now widely used in:
- authorship attribution
- plagiarism detection
- translation studies
- historical linguistics
- AI text detection
Later researchers built tools implementing Delta.
One major example is the stylo package in R, which allows scholars to perform stylometric analysis easily.
12. Later Improvements and Criticism
Other scholars tested and refined Burrows’ method.
For example:
David L. Hoover studied how Delta performs on different genres.
His results showed:
- Delta works well for prose and poetry
- using more frequent words can improve accuracy.
But there are limitations.
Delta may fail when:
- the text genre differs greatly
- an author’s style changes over time
- texts are very short.
13. Burrows’ Broader Philosophy of Literature
Burrows belongs to a tradition sometimes called empirical literary studies.
Its principles:
- Literature can be studied scientifically.
- Style emerges from patterns in language.
- Computers allow large-scale literary analysis.
- Quantitative methods complement interpretation.
In other words:
Literary studies can combine humanistic interpretation with data analysis.
14. Why Delta Became Famous
Burrows’ method became famous for several reasons.
Simplicity
The algorithm is surprisingly simple.
Accuracy
It performs very well in authorship attribution.
Interpretability
Unlike complex machine learning models, Delta is easy to understand.
Generality
It works across languages and literary periods.
15. Historical Importance
Burrows’ Delta marks a turning point.
Before Delta:
Computational literary analysis was experimental.
After Delta:
Stylometry became a central method of digital humanities.
Today, many modern techniques in natural language processing follow similar principles.
16. The Core Thesis of Burrows’ Article
The central argument of the paper is:
Statistical analysis of frequent function words provides a reliable method for identifying stylistic similarity between texts and guiding authorship attribution.
Or in simpler words:
Even the smallest words reveal the identity of the writer.
17. A Simple Analogy
Imagine hearing someone speak on the phone.
You might recognize them not by what they say, but by how they speak:
- rhythm
- accent
- pacing
Burrows’ Delta does the same thing with text.
It recognizes the accent of writing.
Conclusion
John F. Burrows transformed literary studies by demonstrating that style has measurable structure.
His Delta method showed that:
- language contains hidden statistical signatures
- computers can reveal these patterns
- literary interpretation can benefit from quantitative analysis
His work helped create the modern field of computational literary studies.