SlideShare a Scribd company logo
Introduction to Text Analysis

MLA Annual Convention
Getting Started in the Digital Humanities
January 9, 2014
Lauren F. Klein
Georgia Institute of Technology
lauren.klein@lmc.gatech.edu
@laurenfklein
Introduction to Text Analysis
• What is text analysis?
Introduction to Text Analysis
• What is text analysis?
• Why should you use it?
Introduction to Text Analysis
• What is text analysis?
• Why should you use it?
• How do you use it?
– Examples
– Tools
What is Text Analysis?
What is Text Analysis?
According to Geoffrey Rockwell:

•

“Text analysis systems can search large texts quickly. They do this by preparing
electronic indexes to the text so that the computer does not have to read through
the entire text. When finding words can be done so quickly that it is "interactive",
it changes how you can work with the text - you can serendipitously explore
without being frustrated by the slowness of the search process.

•

“Text analysis systems can conduct complex searches. Text analysis systems will
often allow you to search for lists of words or for complex patterns of words. For
example you can search for the co-occurrence of two words.

•

“Text analysis systems can present the results in ways that suit the study of
texts. Text analysis systems can display the results in a number of ways; for
example, a Keyword In Context display shows you all the occurrences of the found
word with one line of context.”

http://tada.mcmaster.ca/Main/WhatTA
Introduction to Text Analysis
http://www.wordle.net
Introduction to Text Analysis
http://www.wordle.net
Introduction to Text Analysis
Why Use Text Analysis?
Why Use Text Analysis?
Geoff Rockwell, again:

•
•
•

“Text analysis tools aide the interpreter asking questions of electronic texts.”
“Text analysis practices encourage reflection on the questions asked and
formalization of queries.”
“Text analysis is a way of targeting rereading that tests intuitions.”
Why Use Text Analysis?
Geoff Rockwell, again:

•
•
•

“Text analysis tools aide the interpreter asking questions of electronic texts.”
“Text analysis practices encourage reflection on the questions asked and
formalization of queries.”
“Text analysis is a way of targeting rereading that tests intuitions.”
Why Use Text Analysis?
Geoff Rockwell, again:

•
•
•

“Text analysis tools aide the interpreter asking questions of electronic texts.”
“Text analysis practices encourage reflection on the questions asked and
formalization of queries.”
“Text analysis is a way of targeting rereading that tests intuitions.”

Ted Underwood:
• “Proving a literary thesis with statistical analysis is often like cracking a nut with a
jackhammer. You can do it: but the results are not necessarily better than you
would get by hand.”
Why Use Text Analysis?
Geoff Rockwell, again:

•
•
•

“Text analysis tools aide the interpreter asking questions of electronic texts.”
“Text analysis practices encourage reflection on the questions asked and
formalization of queries.”
“Text analysis is a way of targeting rereading that tests intuitions.”

Ted Underwood:
• “Proving a literary thesis with statistical analysis is often like cracking a nut with a
jackhammer. You can do it: but the results are not necessarily better than you
would get by hand.”

What I think (in the spirit of Movable Type):
• Text analysis as “a way to tell a new story.”
How to Use Text Analysis?
Ben Blatt,
http://www.slate.com/articles/arts/culturebox/2013/11/hunger_games_catching_fire_a_text
Sarah Lohman,
http://www.fourpoundsflour.com/the-gallery-data-visualization-of-a-timeline-of-taste/
Daniel, http://lkleincourses.lmc.gatech.edu/dh12/2012/02/22/the-role-of-senses-in-a-studyin-scarlet/
Ted Underwood and Jordan Sellers, http://journalofdigitalhumanities.org/1-2/theemergence-of-literary-diction-by-ted-underwood-and-jordan-sellers/
Rob Nelson, http://dsl.richmond.edu/dispatch/
Matt Jockers, http://www.nbcnews.com/technology/data-mining-classics-makes-beautifulscience-954577
Matt Jockers, from Macroanalysis (Univ. of Illinois Press, 2013)
Lauren Klein, from “The Image of Absence” (American Literature 85.4)
Tools for Text Analysis
•
•
•
•
•
•
•
•

Wordle
Google Ngram Viewer
IBM Many Eyes
Voyant
MONK (requires institutional access)
MALLET
Stanford’s Natural Language Processing Toolkit
R
Google Ngram Viewer

Google Ngram Viewer
https://books.google.com/ngrams
IBM Many Eyes

Many Eyes
http://www-958.ibm.com/software/analytics/manyeyes/
Voyant Tools

Voyant Tools
http://voyant-tools.org/
MALLET

MALLET
http://mallet.cs.umass.edu/
Stanford NLP Toolkit

Stanford NLP Toolkit
http://nlp.stanford.edu/downloads/
R Programming Language

R (programming language)
http://www.r-project.org/
TAPoR

TAPoR (Text Analysis PoRtal)
http://tapor.ca/
More Lists of Tools
• http://toolingup.stanford.edu/?page_id=367
• http://guides.library.upenn.edu/dhtextanalysi
s
• http://dirt.projectbamboo.org/categories/text
-mining
Many Eyes Demo

http://lkle.in/1bTr2eT
Voyant Tools Demo

http://lkle.in/1e186zN

More Related Content

PPTX
Text analysis
PPT
Text analysis presentation ppt
PPTX
what is stylistics and its levels 1.Phonological level 2.Graphological leve...
PPTX
Computational linguistics
PPTX
Introduction to Phonetics and Phonology
PPTX
Ambiguity
PPTX
Dictionaries
PPTX
Stylistic devices
Text analysis
Text analysis presentation ppt
what is stylistics and its levels 1.Phonological level 2.Graphological leve...
Computational linguistics
Introduction to Phonetics and Phonology
Ambiguity
Dictionaries
Stylistic devices

What's hot (20)

DOC
Limitations Of Traditional Grammar
PPSX
Catford Translation Theory
DOCX
Levels of Stylistic Analysis.docx
PPTX
LEXICOGRAPHY
PPTX
Writting skills
PPTX
Linguistics levels of foregrounding in stylistics
PPTX
Traditional grammar ppt
PPT
Types of translation
PPTX
Phrase Structure Grammar
PPTX
I C ANALYSIS
PPT
THE PROCESS APPROACH TO WRITING
PPTX
Style and Stylistics
PPTX
Textual Analysis
PPTX
Theory of translation
PPT
Stylistics 3 The Purpose of Stylistics.ppt
PPTX
Phrase structure
DOCX
Computational linguistics
PPTX
Morphological Analysis
DOCX
Stylistics
PPTX
Stylistics
Limitations Of Traditional Grammar
Catford Translation Theory
Levels of Stylistic Analysis.docx
LEXICOGRAPHY
Writting skills
Linguistics levels of foregrounding in stylistics
Traditional grammar ppt
Types of translation
Phrase Structure Grammar
I C ANALYSIS
THE PROCESS APPROACH TO WRITING
Style and Stylistics
Textual Analysis
Theory of translation
Stylistics 3 The Purpose of Stylistics.ppt
Phrase structure
Computational linguistics
Morphological Analysis
Stylistics
Stylistics
Ad

Viewers also liked (10)

PPT
Functional styles of the english language
PPT
Features of translation 2 (1)
DOCX
Text analysis essay
PPT
Colloquial & Literary types of communiation
PPT
Textual Analysis
PPTX
Functional Styles
PPTX
Translation methods
PPT
The analysis of the text
PPT
Translation Types
PPT
Methods Of Translation
Functional styles of the english language
Features of translation 2 (1)
Text analysis essay
Colloquial & Literary types of communiation
Textual Analysis
Functional Styles
Translation methods
The analysis of the text
Translation Types
Methods Of Translation
Ad

Similar to Introduction to Text Analysis (20)

PDF
lexical-semantics-221118101910-ccd46ac3.pdf
PPTX
Lexical Semantics, Semantic Similarity and Relevance for SEO
PPTX
Literature search
PPTX
Presentation on the use of AI tools.pptx
PPT
LitSearch for Postgraduate Thesis selection (NXPowerLite Copy).ppt
PPTX
Word vectorization(embedding) with nnlm
PPT
1 Introduction.ppt
PPTX
Chapter 3 class version
PDF
15. political discourseinthenewskb
PPTX
Writing Seminar Surface
PPTX
Contextualized Online Search and Research Skills.pptx
PPTX
Writing Seminar Babbitt Spring 2012
PPTX
Natural Language Processing (NLP)
PDF
Literature Review- Dr. Mangeni.pdf ffhhg
PPTX
naturallanguageprocessingnlp-231215172843-839c05ab.pptx
PPTX
Research Writing - Universitas Indonesia
PPTX
Dr. Ross PSYCH440 - Seminar
PPTX
Engl 1421 smith
PDF
A Gentle Introduction to Text Analysis :)
PPTX
ENGL 1221 Writing Seminar
lexical-semantics-221118101910-ccd46ac3.pdf
Lexical Semantics, Semantic Similarity and Relevance for SEO
Literature search
Presentation on the use of AI tools.pptx
LitSearch for Postgraduate Thesis selection (NXPowerLite Copy).ppt
Word vectorization(embedding) with nnlm
1 Introduction.ppt
Chapter 3 class version
15. political discourseinthenewskb
Writing Seminar Surface
Contextualized Online Search and Research Skills.pptx
Writing Seminar Babbitt Spring 2012
Natural Language Processing (NLP)
Literature Review- Dr. Mangeni.pdf ffhhg
naturallanguageprocessingnlp-231215172843-839c05ab.pptx
Research Writing - Universitas Indonesia
Dr. Ross PSYCH440 - Seminar
Engl 1421 smith
A Gentle Introduction to Text Analysis :)
ENGL 1221 Writing Seminar

More from Lauren Klein (7)

PPTX
Feminist Data Visualization
PPTX
Exploratory Thematic Analysis for Historical Newspaper Archives
PPTX
Yale Digital Humanities Working Group
PPTX
The Long Arc of Visual Display
PPTX
Viz workshop
PPTX
Archival Silence, Digital Humanities, and James Hemings
PPTX
Towards an Ethics of Online Research: Accounting for Absence in the Jefferson...
Feminist Data Visualization
Exploratory Thematic Analysis for Historical Newspaper Archives
Yale Digital Humanities Working Group
The Long Arc of Visual Display
Viz workshop
Archival Silence, Digital Humanities, and James Hemings
Towards an Ethics of Online Research: Accounting for Absence in the Jefferson...

Recently uploaded (20)

PDF
Sports Quiz easy sports quiz sports quiz
PDF
01-Introduction-to-Information-Management.pdf
PDF
Basic Mud Logging Guide for educational purpose
PDF
Insiders guide to clinical Medicine.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Computing-Curriculum for Schools in Ghana
PDF
Complications of Minimal Access Surgery at WLH
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
Sports Quiz easy sports quiz sports quiz
01-Introduction-to-Information-Management.pdf
Basic Mud Logging Guide for educational purpose
Insiders guide to clinical Medicine.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Computing-Curriculum for Schools in Ghana
Complications of Minimal Access Surgery at WLH
Supply Chain Operations Speaking Notes -ICLT Program
PPH.pptx obstetrics and gynecology in nursing
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Cell Types and Its function , kingdom of life
Abdominal Access Techniques with Prof. Dr. R K Mishra
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Renaissance Architecture: A Journey from Faith to Humanism
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
Anesthesia in Laparoscopic Surgery in India

Introduction to Text Analysis