SlideShare a Scribd company logo
New Tools in Digital Humanities UDHIG June 13 2006 Zoe Borovsky
New tools Text: Juxta TAPoR, HyperPo  WordHoard Images: Image Markup Tool
Why digitize text? Text analysis: discovering new knowledge by linking information together in interesting ways, not just showing overall trends.  “ I think discovering new knowledge vs. showing trends is like the difference between a detective following clues to find the criminal vs. analysts looking at crime statistics to assess overall trends in car theft.” (Marti Hearst, 2003)
The verb “look” occurs more often near words & names of giantesses than giants. Three volumes of sagas:  Hundreds of giants and giantesses
Types of tools Concordance, comparison, corpus, critical editions (Juxta) Search (TAPoR, HyperPo, WordHoard) Key words in context (KWIC) Collocates (associations) Markup: Lemma, Parts of speech, Speaker
Juxta Produces critical editions, comparing and collating multiple witnesses of a single work http://www.patacriticism.org/juxta/
Juxta Desktop Application: Mac, Windows and Unix/Linux (open source)  Input: plain text (UTF-8), or XML Output: HTML critical apparatus
The darker color, the more variants that differ
Toggle between texts
Generate HTML
 
TAPoR Web-based text analysis portal Search and display using online tools http://test-tapor.mcmaster.ca/portal/portal Input: XML, HTML, TEI, plain text
TAPoR Mostly English, some western European languages Word Lists KWIC (key word in context) Collocates/co occurrences - words that occur in the proximity
Word List HyperPo
Key word in context, HyperPo
co occurrences “ white” add secondary corpus
WordHoard Desktop application/server version texts are annotated or tagged by morphological, lexical, semantic, prosodic, and narratological criteria. http://wordhoard.northwestern.edu/userman/index.html
The downloadable version comes with texts Open source version can be installed on your own server with your texts
Sample WordHoard query Shakespeare’s use of the word “love” over time
Results….
Image Markup Tool http://www.tapor.uvic.ca/~mholmes/image_markup/ Windows only
Image Markup tool Input: an image that you want to make available on a web page with annotations directly on the image Ex, Robert Watson’s  Back to Nature
 
Image Markup Tool Output:  sample A copy of your XML data file with an added XSL stylesheet declaration A copy of the image file you're marking up (usually reduced to a size suitable for a Web page -- you can control this size in the Options / Web view preferences window). An XSLT file (copied from the web_view folder in the program folder, with some variables modified to suit your data). A JavaScript file (copied from the web_view folder in the program folder). A CSS stylesheet file (copied from the web_view folder in the program folder).

More Related Content

PPTX
hypertext
PPT
Semantic Web Austin Yahoo
PDF
Changing Data: Implementing Primo for the Tri University Group of Libraries (...
PPT
Directing Research Guide
PPTX
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
PPT
Database fundamentals
PPTX
Hotbot ppt
hypertext
Semantic Web Austin Yahoo
Changing Data: Implementing Primo for the Tri University Group of Libraries (...
Directing Research Guide
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Database fundamentals
Hotbot ppt

What's hot (20)

DOCX
Extracting Person Names from Diverse and Noisy OCR Text Thomas ...
PDF
Statster introduction essay
PPTX
2018 02 20_biological_databases_part1_v_upload
PDF
Connections that work: Linked Open Data demystified
PPTX
2019 02 12_biological_databases_part1_v_upload
PDF
Bio ontologies and semantic technologies[2]
PPTX
Keyword searching idc
PPT
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
PPTX
2020 02 11_biological_databases_part1
PPTX
Effective Internet Searching
PPTX
2019 03 05_biological_databases_part3_v_upload
PDF
E profiles 1
PPT
3810 01 Fall 2009
PDF
Bio ontologies and semantic technologies
PPTX
Powerpoint Homework
PPTX
How to make your published data findable, accessible, interoperable and reusable
PPTX
Resources for genomics research
PDF
Build Your Own World Class Directory Search From Alpha to Omega
PPT
Tracing Networks: Ontology-based Software in a Nutshell
PPT
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
Extracting Person Names from Diverse and Noisy OCR Text Thomas ...
Statster introduction essay
2018 02 20_biological_databases_part1_v_upload
Connections that work: Linked Open Data demystified
2019 02 12_biological_databases_part1_v_upload
Bio ontologies and semantic technologies[2]
Keyword searching idc
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
2020 02 11_biological_databases_part1
Effective Internet Searching
2019 03 05_biological_databases_part3_v_upload
E profiles 1
3810 01 Fall 2009
Bio ontologies and semantic technologies
Powerpoint Homework
How to make your published data findable, accessible, interoperable and reusable
Resources for genomics research
Build Your Own World Class Directory Search From Alpha to Omega
Tracing Networks: Ontology-based Software in a Nutshell
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
Ad

Viewers also liked (13)

PPTX
UVA MDST 3703 2013 08-27 Introduction
PPTX
Mdst3705 2013-01-31-php3
PPTX
Mdst3705 2012-01-22-code-as-language
PPTX
Mdst3705 2012-01-15-introduction
PPTX
Mdst3705 2013-02-12-finding-data
PPTX
Mdst3705 2013-02-05-databases
PPTX
Mdst3705 2013-01-29-praxis
PDF
Novedades cine ene - 2012
PPTX
Mdst3705 2013-02-19-text-into-data
PPTX
Juxtaposition powerpoint
PDF
Juxtaposition
PPT
Juxtaposition in art
PDF
Guía de terror. Biblioteca da Deputación da Coruña
UVA MDST 3703 2013 08-27 Introduction
Mdst3705 2013-01-31-php3
Mdst3705 2012-01-22-code-as-language
Mdst3705 2012-01-15-introduction
Mdst3705 2013-02-12-finding-data
Mdst3705 2013-02-05-databases
Mdst3705 2013-01-29-praxis
Novedades cine ene - 2012
Mdst3705 2013-02-19-text-into-data
Juxtaposition powerpoint
Juxtaposition
Juxtaposition in art
Guía de terror. Biblioteca da Deputación da Coruña
Ad

Similar to Udhig0613 (20)

PPTX
Chapter 2.pptx multimedia and the uses inlife
PDF
Drupal and Apache Stanbol. What if you could reliably do autotagging?
PPT
DM110 - Week 10 - Semantic Web / Web 3.0
PPTX
Sem tech2013 tutorial
PPTX
Recent Trends in Semantic Search Technologies
PPT
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
PPTX
Revolution in publishing bio horizon rome 2013
PPT
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
PPT
Year of the Monkey: Lessons from the first year of SearchMonkey
PPT
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
PPT
Web Search Engine
PPTX
Making things findable
PPT
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
PPT
Information Extraction and Linked Data Cloud
PPTX
Tdm information retrieval
DOC
Fyp ideas
PPT
Semantic Search with Topic Maps
PPT
Ijcai 2007 Pedersen
PPT
Citation Analysis for the Free, Online Literature
PDF
Text Analytics Online Knowledge Base / Database
Chapter 2.pptx multimedia and the uses inlife
Drupal and Apache Stanbol. What if you could reliably do autotagging?
DM110 - Week 10 - Semantic Web / Web 3.0
Sem tech2013 tutorial
Recent Trends in Semantic Search Technologies
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
Revolution in publishing bio horizon rome 2013
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Year of the Monkey: Lessons from the first year of SearchMonkey
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Web Search Engine
Making things findable
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Information Extraction and Linked Data Cloud
Tdm information retrieval
Fyp ideas
Semantic Search with Topic Maps
Ijcai 2007 Pedersen
Citation Analysis for the Free, Online Literature
Text Analytics Online Knowledge Base / Database

Recently uploaded (20)

PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
Getting Started with Data Integration: FME Form 101
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Hybrid model detection and classification of lung cancer
PDF
Encapsulation theory and applications.pdf
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
A Presentation on Touch Screen Technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Tartificialntelligence_presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Getting Started with Data Integration: FME Form 101
SOPHOS-XG Firewall Administrator PPT.pptx
Heart disease approach using modified random forest and particle swarm optimi...
Hybrid model detection and classification of lung cancer
Encapsulation theory and applications.pdf
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Accuracy of neural networks in brain wave diagnosis of schizophrenia
MIND Revenue Release Quarter 2 2025 Press Release
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
OMC Textile Division Presentation 2021.pptx
Web App vs Mobile App What Should You Build First.pdf
Approach and Philosophy of On baking technology
A Presentation on Touch Screen Technology
Digital-Transformation-Roadmap-for-Companies.pptx
Tartificialntelligence_presentation.pptx
A Presentation on Artificial Intelligence
Zenith AI: Advanced Artificial Intelligence
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Udhig0613

  • 1. New Tools in Digital Humanities UDHIG June 13 2006 Zoe Borovsky
  • 2. New tools Text: Juxta TAPoR, HyperPo WordHoard Images: Image Markup Tool
  • 3. Why digitize text? Text analysis: discovering new knowledge by linking information together in interesting ways, not just showing overall trends. “ I think discovering new knowledge vs. showing trends is like the difference between a detective following clues to find the criminal vs. analysts looking at crime statistics to assess overall trends in car theft.” (Marti Hearst, 2003)
  • 4. The verb “look” occurs more often near words & names of giantesses than giants. Three volumes of sagas: Hundreds of giants and giantesses
  • 5. Types of tools Concordance, comparison, corpus, critical editions (Juxta) Search (TAPoR, HyperPo, WordHoard) Key words in context (KWIC) Collocates (associations) Markup: Lemma, Parts of speech, Speaker
  • 6. Juxta Produces critical editions, comparing and collating multiple witnesses of a single work http://www.patacriticism.org/juxta/
  • 7. Juxta Desktop Application: Mac, Windows and Unix/Linux (open source) Input: plain text (UTF-8), or XML Output: HTML critical apparatus
  • 8. The darker color, the more variants that differ
  • 11.  
  • 12. TAPoR Web-based text analysis portal Search and display using online tools http://test-tapor.mcmaster.ca/portal/portal Input: XML, HTML, TEI, plain text
  • 13. TAPoR Mostly English, some western European languages Word Lists KWIC (key word in context) Collocates/co occurrences - words that occur in the proximity
  • 15. Key word in context, HyperPo
  • 16. co occurrences “ white” add secondary corpus
  • 17. WordHoard Desktop application/server version texts are annotated or tagged by morphological, lexical, semantic, prosodic, and narratological criteria. http://wordhoard.northwestern.edu/userman/index.html
  • 18. The downloadable version comes with texts Open source version can be installed on your own server with your texts
  • 19. Sample WordHoard query Shakespeare’s use of the word “love” over time
  • 21. Image Markup Tool http://www.tapor.uvic.ca/~mholmes/image_markup/ Windows only
  • 22. Image Markup tool Input: an image that you want to make available on a web page with annotations directly on the image Ex, Robert Watson’s Back to Nature
  • 23.  
  • 24. Image Markup Tool Output: sample A copy of your XML data file with an added XSL stylesheet declaration A copy of the image file you're marking up (usually reduced to a size suitable for a Web page -- you can control this size in the Options / Web view preferences window). An XSLT file (copied from the web_view folder in the program folder, with some variables modified to suit your data). A JavaScript file (copied from the web_view folder in the program folder). A CSS stylesheet file (copied from the web_view folder in the program folder).