Argument
In a modern world where a countless number of new publications are being released everyday, and countless more lost from ages past, are yet to be discovered, it seems almost impossible to account for all the opinions, stories, cultures, histories, that these works carry within them. Even the most speedy of readers could not hope to conquer all the literature the world offers. This is where a term labeled distant reading can help the aspiring bookwork to grasp an otherwise unimaginable quantity of work.
The coin-er of the term distant reading, Franco Moretti quoted, in his duly titled book “Distant Reading”, Johann Wolfgang von Goethe’s statement “Literature is the fragment of fragments”. I believe that this really sums up in a way how distance reading attempts to tackle such large volumes of books. Distance reading attempts to capture the essence of these fragments, finding the underlying heart of the work, and exploiting it to show new empirical evidence regarding otherwise unseen themes. This is the first of the reasons that distance reading is so important for modern literary analysis. Not only does it decipher extreme quantity in a short period of time, but as Underwood puts it, ” we’re (distant readers) trying to reveal large-scale patterns that wouldn’t be evident in ordinary reading”. This deeper analysis of patterns unseen to the close readers eye, is what makes this practice so worthwhile. To continue on that point, the second reason for the importance of distance reading is similarly well summed up by Underwood, “this representation (digital representation) of text is radically different from readers”. By using digital representations of works rendered by distant reading techniques, the picture drawn to the observer will usually be starkly different than the picture drawn to the close reader. This brings us to the next step in the analysis, putting the two types of reading side by side in order to compare and learn from their difference. This process is called differential reading.
Clement writes that “differential reading … positions close and distant reading practices as both subjective and objective methodologies”, continuing later with, “Both close and distant reading practices can facilitate interpretation through subjective and objective means”. Differential reading is the contrastation of these two practices, offering the intimate, first hand, human biased knowledge discovered in close reading, and the empirical, underlying, objectively computed data discovered by distance reading. It offers a view into a work from different angles, providing the ability to back up claims made by data, and debunk the opinions presumed by the reader, as well as visa versa. Layering these two methods on top of each other, through differential reading, is best described by Clement’s simile; “This many-eyed perspective might be like ‘eye vision,’ which involves shooting a dynamic event, such as a soccer game, from multiple cameras placed at different angles”. This ability to offer an exclusive view into the text, is what makes differential reading an essential tool in the literary analyzers toolbox. Using this method unlocks a plethora of opportunities as Hoover writes in his article.To paraphrase a few: Testing hypotheses, supporting the claims of critical work, investigating how authors differentiate between the voices of characters, discovering radical shifts in style, etc. All of these seemingly unknown answers can be found, using differential reading.
And while this is a powerful tool there are certain drawbacks and difficulties. Distance Reading is not yet a perfect method.
- Uncertainty of the Role of Digital Analysis in the Humanities- There is an uncertainty to the validity of the empirical data provided by digital analysis. This though is a falsely held doubt, as Hoover writes; “Many kinds of evidence produced by statistical methods are simply not accessible without a computer”
- Dealing with Human Emotion in Texts- Computers are unable to interpret human emotion, and often analysis regarding mood, if not discoverable by words frequency or other parameters, must be realized through close reading
- Copyright Issues- Many texts are not out of copyright yet, such as most published after 1927 (in America). Furthermore, laws differentiate between countries, making things even more complicated.
- Many Texts are not even Digitized- Even if everything from above checks out, the work you are looking for might not even be available in digital form.
Experimentation
For the purpose of showing the power of the tool, I decided to do an analysis of the reading we did by Clement. I began my simply cleaning her article and throwing it into a word cloud to count for words frequency.
There was no surprise, some of the terms that arose from the wordle. One thing the cloud shows which may not be so apparent at first, is how closely the digital part of distance reading is tied to the actual literary part. Reading and literary are almost as involved as digital and text, showing that these digital methods are more connected to the works than the idea of “empiricizing” a novel would suggest.
Next, I decided to throw the most common terms into Ngram, in order to see how their frequency in the article compares with their frequency in all digitized english texts.
This shows shows some interesting trends. Most interesting, seems to be the rise in the word text, shown in blue. Text had a huge jump, but was sitting in an alright place to begin with, unlike data shown in orange. This is probably because of the shift in the meaning of the word. As the definition has changed from a body of writing, to an instant message, the words has become more popular.
Furthermore I decided to look into the New York Times records of articles, and see the trends using the same words.
Looking at this and comparing it to the Ngram is a great way to summarize the power of distance reading. By looking at these two graphs I now see that text finds itself at the bottom of the NYT, while in Ngram it is second from the top. This could suggest many things, for example, that writing about modern texting is not new worthy and is therefor not considered for articles. This would explain why it has grew the least, compared to the other four words.
Just from looking at a wordle of one of my assignments, I have now inference all of this data. I have analyzed and drew conclusion that I set out not knowing I could discover. This is why distance reading can be a powerful tool.