SlideShare a Scribd company logo
Classifying the (digital)
 Arts and Humanities

       Wishful thinking in fifteen slides
            By Dr Torsten Reimer
Centre for e-Research, King's College London
IEEE Conference on e-Science - 11/12/2009
Torsten Reimer
Torsten Reimer
Torsten Reimer
Once upon a time
                      ICT Guides
                      •   Projects
                      •   Methods
                      •   Tools

arts-humanities.net
•    Events and
     reports
•    Community
•    Bibliography
     etc.
Torsten Reimer
arts-humanities.net
 an online hub for research & teaching
  in the digital arts and humanities
 support for creating and using digital
  resources
 enables members to locate information,
  promote their research and discuss
  ideas
 mix of centrally provided and user
  contributed content
 use of web 2.0 functionality such as
  tagging, feeds, wiki, blogging, user
  profiles etc.
 community resource
Methods Taxonomy

•   Originally developed for the projects
    and methods database
•   Focus on resource creation
•   Used to categorize projects,
    tools, resources
•   Now part of arts-humanities.net
•   Seven main categories
Data analysis
•   Collating: Collation is the process of comparing different versions of a text to discover the location and type of
    textual variants. Collation is fundamental to a variety of scholarly pursuits, for example in the Arts and Humanities
    field it can be used for the accurate reconstruction of texts of classical works. In the past collation was performed by
    hand; today, it is performed with the assistance of a computer. Read more...

•   Collocating: Refers to the techniques used to detect patterns of words that appear together in a text more often
    than would be expected by chance. A collocation is a group or pair of words that are always used together, and can
    illustrate restrictions on which verbs or adjectives can be used with particular nouns, or the order in which words
    appear. Read more...

•   Content analysis: Content analysis is a research technique focused on the content and internal features of media.
    It is used to determine the presence of certain words, concepts, themes, phrases, characters, or sentences within
    texts or sets of texts and to quantify this presence in an objective manner. Read more...

•   Content-based image retrieval: Content-based image retrieval (CBIR) refers to techniques used to search for
    digital images by features of their content, which is particularly helpful when studying large databases. It is often
    preferable to perform searches relying on metadata, which can be expensive and time-consuming to produce, as it
    requires humans to describe each individual item in the database. Read more...

•   Content-based sound retrieval: Refers to techniques used to search for sound files by features of their content,
    using specialist software, which is particularly helpful when studying large databases. It is often preferable to
    perform searches relying on metadata, which can be expensive and time-consuming to produce, as it requires
    humans to describe each individual item in the database. Read more...

•   Data mining: Data mining is the process of using computing power to extract hidden patterns from data, analysing
    the results from different perspectives and summarising it into a useful format, such as a graph or table. This
    process is often facilitated by the use of metadata. It is important that any patterns found are verified and validated
    by comparison with other data samples. In this way, data mining can identify trends that go beyond simple data
    analysis. Read more...

•   Image feature measurement: Image feature measurement is a term to describe techniques used to acquire,
    measure, and analyse the parameters of digital images, such as size, shape, relative locations, textures, grey tones
    and colours. These parameters are also known as ‘perception attributes’. Read more...
Three partners – one system?
The 'mine, all mine' problem
CHAIN
ADHO, centerNet, CLARIN,
  DARIAH, Project Bamboo,
  NoC
Key theme: advocacy for an
  improved digital research
  infrastructure for the
  Humanities and Arts
Knowledge base: all partners
  want one; we have one
International desire to overcome
   'mine, all mine problem'


      Coalition of Humanities and Arts Infrastructures and Networks
Problems with current set-up

•   Shared editing necessary
•   Versioning system
•   Distributed across several websites
•   Only parent-child relationships
•   Different terminology for same
    method in different fields
•   Only monolingual
Solution: semantic web?




Linked Data:
• 1. Use URIs to identify things.
• 2. Use HTTP URIs so that these things can be referred to and looked up
("dereference") by people and user agents.
• 3. Provide useful information (i.e., a structured description — metadata)
about the thing when its URI is dereferenced.
• 4. Include links to other, related URIs in the exposed data to improve
discovery of other related information on the Web.
Taxonomy as service
              Semantic web
                 (linked data)
              Shared taxonomy
              •   CeRch
              •   DHO
              •   OeRC
              •   (CHAIN)
              •   and you?
Glorious future

•   Build a resource owned
    by and useful for the
    wider Digital
    Humanities / Arts
    community
•   Bring field(s) together
•   Make what we do more
    easily accessible to
    funding bodies and the
    public

More Related Content

PPT
Digital library presentation
PDF
Digital Library Initiatives in India : An Overview
PPT
Introduction to Metadata for IDAH Fellows
PPTX
Digital Library
PPTX
National Digital Library
Digital library presentation
Digital Library Initiatives in India : An Overview
Introduction to Metadata for IDAH Fellows
Digital Library
National Digital Library

What's hot (20)

DOCX
Digital library Assignment
PPTX
Digital library
PPT
Assignment 1 Digital Library Review
PPT
Aksum University digital libraries
PPT
Hartley Presentation on Cataloging & Metadata Trends
PDF
Qatar Digital Library Project Workshop
PPSX
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
PDF
Introduction to Digital libraries
PPT
Digital Libraries
PPT
Aggregation as tactic sm new
PPSX
DOMAINS OF USER STUDIES (User Studies and User Education)
PPT
Digital library
PPTX
PPTX
User Focused Digital Library: A Practical Guide
PPTX
Digital Library
PPTX
Creating a digital library
PPTX
Toward universal information access on the digital object cloud
PPT
Digital libraries power point
Digital library Assignment
Digital library
Assignment 1 Digital Library Review
Aksum University digital libraries
Hartley Presentation on Cataloging & Metadata Trends
Qatar Digital Library Project Workshop
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
Introduction to Digital libraries
Digital Libraries
Aggregation as tactic sm new
DOMAINS OF USER STUDIES (User Studies and User Education)
Digital library
User Focused Digital Library: A Practical Guide
Digital Library
Creating a digital library
Toward universal information access on the digital object cloud
Digital libraries power point
Ad

Viewers also liked (7)

PDF
PDF
Why life is so complicated
PPTX
Transforming scholarly communications support at Imperial College London
PDF
The Good, the Bad and the Ugly. Open Access in the UK
PDF
Unknown Unknowns
PDF
On the research paper, and the knowledge within
PPTX
Imperial College London - journey to open scholarship
Why life is so complicated
Transforming scholarly communications support at Imperial College London
The Good, the Bad and the Ugly. Open Access in the UK
Unknown Unknowns
On the research paper, and the knowledge within
Imperial College London - journey to open scholarship
Ad

Similar to Torsten Reimer (20)

PPT
Brisith Academy/LH presentation
PPTX
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
PPT
From digital to social collections. A short story of collections online.
PPTX
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
PDF
Measuring the Impact of the Digital for the Humanities
PPTX
AHRC CDP Digital Humanities 101
PPTX
Thatcamp recap
PDF
Knowledge Engineering for TELDAP
PDF
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
PPTX
Digital collections and humanities research
PDF
Sharing - Collecting our DAH Thoughts
PDF
arts-humanities-A4
PPTX
Aquiles imlr seminar
PDF
Making an Impact: How Digitised Resources Change Lives
PDF
Introduction to Digital humanities
PPT
Ontologies and the humanities: some issues affecting the design of digital in...
PDF
2013 Aarhus University-DIGHUMLAB kickoff-Champion
PPTX
Dh presentation helig 2014
PPTX
Digital Repositories, the Data Set of the Humanities
PDF
06 gioca-ontologies
Brisith Academy/LH presentation
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
From digital to social collections. A short story of collections online.
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
Measuring the Impact of the Digital for the Humanities
AHRC CDP Digital Humanities 101
Thatcamp recap
Knowledge Engineering for TELDAP
Digital Tools, Trends and Methodologies in the Humanities and Social Sciences
Digital collections and humanities research
Sharing - Collecting our DAH Thoughts
arts-humanities-A4
Aquiles imlr seminar
Making an Impact: How Digitised Resources Change Lives
Introduction to Digital humanities
Ontologies and the humanities: some issues affecting the design of digital in...
2013 Aarhus University-DIGHUMLAB kickoff-Champion
Dh presentation helig 2014
Digital Repositories, the Data Set of the Humanities
06 gioca-ontologies

More from Anita de Waard (20)

PDF
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
PPTX
Why would a publisher care about open data?
PPTX
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
PDF
NFAIS Talk on Enabling FAIR Data
PPTX
CNI 2018: A Research Object Authoring Tool for the Data Commons
PPTX
Enabling FAIR Data: TAG B Authoring Guidelines
PPTX
Scientific facts are myths, told through fairytales and spread by gossip.
PPTX
Data, Data Everywhere: What's A Publisher to Do?
PPTX
Talk on Research Data Management
PPTX
History of the future
PPTX
Networked Science, And Integrating with Dataverse
PPTX
Big Data and the Future of Publishing
PPTX
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
PDF
Data Repositories: Recommendation, Certification and Models for Cost Recovery
PPTX
The Economics of Data Sharing
PPTX
Public Identifiers in Scholarly Publishing
PPTX
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
PPTX
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
PPTX
Charleston Conference 2016
PPTX
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Why would a publisher care about open data?
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
NFAIS Talk on Enabling FAIR Data
CNI 2018: A Research Object Authoring Tool for the Data Commons
Enabling FAIR Data: TAG B Authoring Guidelines
Scientific facts are myths, told through fairytales and spread by gossip.
Data, Data Everywhere: What's A Publisher to Do?
Talk on Research Data Management
History of the future
Networked Science, And Integrating with Dataverse
Big Data and the Future of Publishing
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Data Repositories: Recommendation, Certification and Models for Cost Recovery
The Economics of Data Sharing
Public Identifiers in Scholarly Publishing
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Charleston Conference 2016
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...

Recently uploaded (20)

PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Complications of Minimal Access Surgery at WLH
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Classroom Observation Tools for Teachers
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Institutional Correction lecture only . . .
PDF
RMMM.pdf make it easy to upload and study
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
01-Introduction-to-Information-Management.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Insiders guide to clinical Medicine.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
102 student loan defaulters named and shamed – Is someone you know on the list?
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
VCE English Exam - Section C Student Revision Booklet
Complications of Minimal Access Surgery at WLH
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Renaissance Architecture: A Journey from Faith to Humanism
Classroom Observation Tools for Teachers
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Institutional Correction lecture only . . .
RMMM.pdf make it easy to upload and study
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
O7-L3 Supply Chain Operations - ICLT Program
01-Introduction-to-Information-Management.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Microbial diseases, their pathogenesis and prophylaxis
O5-L3 Freight Transport Ops (International) V1.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Insiders guide to clinical Medicine.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx

Torsten Reimer

  • 1. Classifying the (digital) Arts and Humanities Wishful thinking in fifteen slides By Dr Torsten Reimer Centre for e-Research, King's College London IEEE Conference on e-Science - 11/12/2009
  • 5. Once upon a time ICT Guides • Projects • Methods • Tools arts-humanities.net • Events and reports • Community • Bibliography etc.
  • 7. arts-humanities.net  an online hub for research & teaching in the digital arts and humanities  support for creating and using digital resources  enables members to locate information, promote their research and discuss ideas  mix of centrally provided and user contributed content  use of web 2.0 functionality such as tagging, feeds, wiki, blogging, user profiles etc.  community resource
  • 8. Methods Taxonomy • Originally developed for the projects and methods database • Focus on resource creation • Used to categorize projects, tools, resources • Now part of arts-humanities.net • Seven main categories
  • 9. Data analysis • Collating: Collation is the process of comparing different versions of a text to discover the location and type of textual variants. Collation is fundamental to a variety of scholarly pursuits, for example in the Arts and Humanities field it can be used for the accurate reconstruction of texts of classical works. In the past collation was performed by hand; today, it is performed with the assistance of a computer. Read more... • Collocating: Refers to the techniques used to detect patterns of words that appear together in a text more often than would be expected by chance. A collocation is a group or pair of words that are always used together, and can illustrate restrictions on which verbs or adjectives can be used with particular nouns, or the order in which words appear. Read more... • Content analysis: Content analysis is a research technique focused on the content and internal features of media. It is used to determine the presence of certain words, concepts, themes, phrases, characters, or sentences within texts or sets of texts and to quantify this presence in an objective manner. Read more... • Content-based image retrieval: Content-based image retrieval (CBIR) refers to techniques used to search for digital images by features of their content, which is particularly helpful when studying large databases. It is often preferable to perform searches relying on metadata, which can be expensive and time-consuming to produce, as it requires humans to describe each individual item in the database. Read more... • Content-based sound retrieval: Refers to techniques used to search for sound files by features of their content, using specialist software, which is particularly helpful when studying large databases. It is often preferable to perform searches relying on metadata, which can be expensive and time-consuming to produce, as it requires humans to describe each individual item in the database. Read more... • Data mining: Data mining is the process of using computing power to extract hidden patterns from data, analysing the results from different perspectives and summarising it into a useful format, such as a graph or table. This process is often facilitated by the use of metadata. It is important that any patterns found are verified and validated by comparison with other data samples. In this way, data mining can identify trends that go beyond simple data analysis. Read more... • Image feature measurement: Image feature measurement is a term to describe techniques used to acquire, measure, and analyse the parameters of digital images, such as size, shape, relative locations, textures, grey tones and colours. These parameters are also known as ‘perception attributes’. Read more...
  • 10. Three partners – one system?
  • 11. The 'mine, all mine' problem
  • 12. CHAIN ADHO, centerNet, CLARIN, DARIAH, Project Bamboo, NoC Key theme: advocacy for an improved digital research infrastructure for the Humanities and Arts Knowledge base: all partners want one; we have one International desire to overcome 'mine, all mine problem' Coalition of Humanities and Arts Infrastructures and Networks
  • 13. Problems with current set-up • Shared editing necessary • Versioning system • Distributed across several websites • Only parent-child relationships • Different terminology for same method in different fields • Only monolingual
  • 14. Solution: semantic web? Linked Data: • 1. Use URIs to identify things. • 2. Use HTTP URIs so that these things can be referred to and looked up ("dereference") by people and user agents. • 3. Provide useful information (i.e., a structured description — metadata) about the thing when its URI is dereferenced. • 4. Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
  • 15. Taxonomy as service Semantic web (linked data) Shared taxonomy • CeRch • DHO • OeRC • (CHAIN) • and you?
  • 16. Glorious future • Build a resource owned by and useful for the wider Digital Humanities / Arts community • Bring field(s) together • Make what we do more easily accessible to funding bodies and the public