Annotation
Jodi Schneider
Linguistic and corpus perspectives on argumentative
discourse, SwissUniversities Doctoral Programme
Language & Cognition
University of Fribourg, Fribourg, Switzerland
2019-09-02
A typical annotation process
• Find text of interest
• Find phenomena of interest
• Draft an annotation manual
• Iteratively test annotation & revise manual
– Find questionable annotations, check disagreements.
– Revise the manual.
– Iterate.
• Annotate
Examples of annotation software
• GATE: https://gate.ac.uk Free & Open source
– NLP pipeline integration, robust developmnet community, ingests lots of
formats,
• UAM CorpusTool: http://www.corpustool.com Free
– Comparative statistics, corpus search, annotation schemes easy to set up
• Excel
– Great for simple annotation
• BRAT: http://brat.nlplab.org Free & Open source
– Run your own instance, browser-based for collaboration
• EPPI Reviewer:
https://eppi.ioe.ac.uk/cms/Default.aspx?alias=eppi.ioe.ac.uk/cms/er4
– Data extraction for systematic review
• Custom tools
GATE
Jodi Schneider, Alexandre Passant, and Stefan Decker “Deletion
Discussions in Wikipedia: Decision Factors and Outcomes.”
In WikiSym2012. Linz, Austria, August 27-29, 2012.
UAM CorpusTool(V 2.8.16)
Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments
about Deletion: How Experience Improves the Acceptability of Arguments in Ad-
hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
Excel
Dong, Xiaoru; Xie, Jingyi; Hoang, Linh (2019): Inclusion_Criteria_Annotation. University of
Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5958960_V2 for Text
Mining Pipeline to Accelerate Systematic Reviews in Evidence-Based Medicine
Excel
Hoang, Linh; Schneider, Jodi (2018): Citation context analysis of RobotReviewer core papers circa
2018-06. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-
1075526_V1 for Text Mining Pipeline to Accelerate Systematic Reviews in Evidence-Based
Medicine
BRAT
Halil Kilicoglu, Zeshan Peng Shabnam Tafreshi, Tung Tran, Graciela Rosemblat, Jodi Schneider.
“Confirm or Refute?: A Comparative Study on Citation Sentiment Classification in Clinical Research
Publications.” Journal of Biomedical Informatics, Vol 91, 103123. doi: 10.1016/j.jbi.2019.103123
EPPI-Reviewer
Work in progress, Systematic Review of Empirical Research about Retracted
Publications project team
Custom Tools
Halil Kilicoglu, Graciela Rosemblat, Zeshan Peng, Mario Malicki, Tony Tse, Jodi Schneider, Gerben
ter Riet. Annotating Clinical Trial Publications to Assess CONSORT Adherence: A Feasibility Study.
6th World Conference on Research Integrity, Hong Kong, 2019.
Annotation examples--Fribourg--2019-09-03
Some tools have additional features
GATE - sentiment (gazeteer)
Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text”
In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
GATE - ,
,
(gazeteers)
Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text”
In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
GATE – semantic search
Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text”
In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
UAM CorpusTool(V 2.8.16)
Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments
about Deletion: How Experience Improves the Acceptability of Arguments in Ad-
hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
UAM CorpusTool(V 2.8.16)
Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments
about Deletion: How Experience Improves the Acceptability of Arguments in Ad-
hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
EPPI-Reviewer
Work in progress, Systematic Review of Empirical Research about Retracted
Publications project team

More Related Content

PPTX
Towards knowledge maintenance in scientific digital libraries with the keysto...
PPTX
Using the Micropublications ontology and the Open Annotation Data Model to re...
PPTX
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
PPT
Clinical Epidemiology - Systematic PubMed Searching Workshop
PPTX
SocialCite makes its debut at the HighWire Press meeting
PPTX
The problems of post retraction citation - and mitigation strategies that wor...
PPTX
Resident Presentations - Evidence-Based Medicine for Haematology
PPTX
An introduction to Statistical Analysis Plans
Towards knowledge maintenance in scientific digital libraries with the keysto...
Using the Micropublications ontology and the Open Annotation Data Model to re...
Enabling reuse of arguments and opinions in open collaboration systems PhD vi...
Clinical Epidemiology - Systematic PubMed Searching Workshop
SocialCite makes its debut at the HighWire Press meeting
The problems of post retraction citation - and mitigation strategies that wor...
Resident Presentations - Evidence-Based Medicine for Haematology
An introduction to Statistical Analysis Plans

What's hot (18)

PDF
Digital Scholar Webinar: Transparent, Open, and Reproducible Research
PDF
Digital Scholar Webinar: Open reproducible research
PDF
How to increase your Citations
PDF
How to conduct_a_systematic_or_evidence_review
PPTX
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
PDF
Developing a Replicable Methodology for Automated Identification of Emerging ...
PPT
An introduction to conducting a systematic literature review for social scien...
PPT
Lahey Research Methods
PPTX
Scholarly Research: Therapeutic Recreation
PPT
Qualitative Lab - Analysis And Report
PPTX
PPT
Garcia Ethics 2016
PDF
Journal Club - Best Practices for Scientific Computing
PDF
Digital Scholar Webinar: Understanding and using PROSPERO: International pros...
PDF
Open science and the individual researcher
PPTX
Nov1 webinar intro_slides v
PPTX
系統的レビューの出版数
DOCX
Digital Scholar Webinar: Transparent, Open, and Reproducible Research
Digital Scholar Webinar: Open reproducible research
How to increase your Citations
How to conduct_a_systematic_or_evidence_review
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
Developing a Replicable Methodology for Automated Identification of Emerging ...
An introduction to conducting a systematic literature review for social scien...
Lahey Research Methods
Scholarly Research: Therapeutic Recreation
Qualitative Lab - Analysis And Report
Garcia Ethics 2016
Journal Club - Best Practices for Scientific Computing
Digital Scholar Webinar: Understanding and using PROSPERO: International pros...
Open science and the individual researcher
Nov1 webinar intro_slides v
系統的レビューの出版数
Ad

Similar to Annotation examples--Fribourg--2019-09-03 (20)

PPTX
FAIRer Research
PPTX
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
PDF
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
PPTX
Software Repositories for Research-- An Environmental Scan
PDF
RDA Scholarly Infrastructure 2015
PPT
Review of literature
PPT
Cesse July 22 2009
PPT
Mendeley Open API
PPTX
Use of Internet and Advanced Search Techniques for PhD Research.pptx
PPTX
research unveiling connections and recommendations.pptx
PPT
Vellino presentationtocisti
PPTX
WAYS OF HANDLING DIFFERENT TYPES OF FABRICS.pptx
PDF
OpenMinTeD: Making Sense of Large Volumes of Data
PPTX
AI open tools for Research.pptx
PPTX
Crediting informatics and data folks in life science teams
PPT
Lit Reviews for the Health Sciences
PPT
How Semantic Technology Helps Researchers
PPTX
Upgrading the Scholarly Infrastructure
PPTX
RARE and FAIR Science: Reproducibility and Research Objects
PPTX
Keynote speech - Carole Goble - Jisc Digital Festival 2015
FAIRer Research
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
Software Repositories for Research-- An Environmental Scan
RDA Scholarly Infrastructure 2015
Review of literature
Cesse July 22 2009
Mendeley Open API
Use of Internet and Advanced Search Techniques for PhD Research.pptx
research unveiling connections and recommendations.pptx
Vellino presentationtocisti
WAYS OF HANDLING DIFFERENT TYPES OF FABRICS.pptx
OpenMinTeD: Making Sense of Large Volumes of Data
AI open tools for Research.pptx
Crediting informatics and data folks in life science teams
Lit Reviews for the Health Sciences
How Semantic Technology Helps Researchers
Upgrading the Scholarly Infrastructure
RARE and FAIR Science: Reproducibility and Research Objects
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Ad

More from jodischneider (20)

PPTX
Continued citation of bad science and what we can do about it--2021-04-20
PPTX
Continued citation of bad science and what we can do about it--2021-02-19
PPTX
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
PPTX
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
PPTX
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
PPTX
Problem-citations--CrossrefLive18--2018-11-13
PPTX
Problematic citations--Workshop-on-Open-Citations--2018-09-03
PPTX
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
PPTX
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
PPTX
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
PPTX
Rhetorical moves and audience considerations in the discussion sections of ra...
PPTX
Citation practices and the construction of scientific fact--ECA-facts-preconf...
PPTX
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
PPTX
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
PPTX
Persons, documents, models: organising and structuring information for the We...
PPTX
Synthesizing knowledge from disagreement -- Manchester -- 2015-05-06
PPTX
Synthesizing knowledge from disagreement -cwi-2015-04-23
PPTX
Packaging ideas--nanopublications-in-the-humanities--Europeana--2015-04-21
Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-02-19
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Problem-citations--CrossrefLive18--2018-11-13
Problematic citations--Workshop-on-Open-Citations--2018-09-03
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
Rhetorical moves and audience considerations in the discussion sections of ra...
Citation practices and the construction of scientific fact--ECA-facts-preconf...
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Persons, documents, models: organising and structuring information for the We...
Synthesizing knowledge from disagreement -- Manchester -- 2015-05-06
Synthesizing knowledge from disagreement -cwi-2015-04-23
Packaging ideas--nanopublications-in-the-humanities--Europeana--2015-04-21

Recently uploaded (20)

PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
UiPath Agentic Automation session 1: RPA to Agents
PPTX
The various Industrial Revolutions .pptx
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
2018-HIPAA-Renewal-Training for executives
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Architecture types and enterprise applications.pdf
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Zenith AI: Advanced Artificial Intelligence
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
A contest of sentiment analysis: k-nearest neighbor versus neural network
Custom Battery Pack Design Considerations for Performance and Safety
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
UiPath Agentic Automation session 1: RPA to Agents
The various Industrial Revolutions .pptx
Comparative analysis of machine learning models for fake news detection in so...
Chapter 5: Probability Theory and Statistics
Flame analysis and combustion estimation using large language and vision assi...
2018-HIPAA-Renewal-Training for executives
Module 1.ppt Iot fundamentals and Architecture
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Improvisation in detection of pomegranate leaf disease using transfer learni...
sustainability-14-14877-v2.pddhzftheheeeee
Architecture types and enterprise applications.pdf
Consumable AI The What, Why & How for Small Teams.pdf
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
Benefits of Physical activity for teenagers.pptx
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Zenith AI: Advanced Artificial Intelligence

Annotation examples--Fribourg--2019-09-03

  • 1. Annotation Jodi Schneider Linguistic and corpus perspectives on argumentative discourse, SwissUniversities Doctoral Programme Language & Cognition University of Fribourg, Fribourg, Switzerland 2019-09-02
  • 2. A typical annotation process • Find text of interest • Find phenomena of interest • Draft an annotation manual • Iteratively test annotation & revise manual – Find questionable annotations, check disagreements. – Revise the manual. – Iterate. • Annotate
  • 3. Examples of annotation software • GATE: https://gate.ac.uk Free & Open source – NLP pipeline integration, robust developmnet community, ingests lots of formats, • UAM CorpusTool: http://www.corpustool.com Free – Comparative statistics, corpus search, annotation schemes easy to set up • Excel – Great for simple annotation • BRAT: http://brat.nlplab.org Free & Open source – Run your own instance, browser-based for collaboration • EPPI Reviewer: https://eppi.ioe.ac.uk/cms/Default.aspx?alias=eppi.ioe.ac.uk/cms/er4 – Data extraction for systematic review • Custom tools
  • 4. GATE Jodi Schneider, Alexandre Passant, and Stefan Decker “Deletion Discussions in Wikipedia: Decision Factors and Outcomes.” In WikiSym2012. Linz, Austria, August 27-29, 2012.
  • 5. UAM CorpusTool(V 2.8.16) Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments about Deletion: How Experience Improves the Acceptability of Arguments in Ad- hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
  • 6. Excel Dong, Xiaoru; Xie, Jingyi; Hoang, Linh (2019): Inclusion_Criteria_Annotation. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5958960_V2 for Text Mining Pipeline to Accelerate Systematic Reviews in Evidence-Based Medicine
  • 7. Excel Hoang, Linh; Schneider, Jodi (2018): Citation context analysis of RobotReviewer core papers circa 2018-06. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB- 1075526_V1 for Text Mining Pipeline to Accelerate Systematic Reviews in Evidence-Based Medicine
  • 8. BRAT Halil Kilicoglu, Zeshan Peng Shabnam Tafreshi, Tung Tran, Graciela Rosemblat, Jodi Schneider. “Confirm or Refute?: A Comparative Study on Citation Sentiment Classification in Clinical Research Publications.” Journal of Biomedical Informatics, Vol 91, 103123. doi: 10.1016/j.jbi.2019.103123
  • 9. EPPI-Reviewer Work in progress, Systematic Review of Empirical Research about Retracted Publications project team
  • 10. Custom Tools Halil Kilicoglu, Graciela Rosemblat, Zeshan Peng, Mario Malicki, Tony Tse, Jodi Schneider, Gerben ter Riet. Annotating Clinical Trial Publications to Assess CONSORT Adherence: A Feasibility Study. 6th World Conference on Research Integrity, Hong Kong, 2019.
  • 12. Some tools have additional features
  • 13. GATE - sentiment (gazeteer) Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text” In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
  • 14. GATE - , , (gazeteers) Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text” In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
  • 15. GATE – semantic search Jodi Schneider & Adam Wyner. “Identifying Consumers' Arguments in Text” In SWAIE 2012: Semantic Web and Information Extraction at EKAW 2012.
  • 16. UAM CorpusTool(V 2.8.16) Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments about Deletion: How Experience Improves the Acceptability of Arguments in Ad- hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
  • 17. UAM CorpusTool(V 2.8.16) Jodi Schneider, Krystian Samp, Alexandre Passant, Stefan Decker. “Arguments about Deletion: How Experience Improves the Acceptability of Arguments in Ad- hoc Online Task Groups”. In CSCW 2013, San Antonio, TX, February 23-27, 2013.
  • 18. EPPI-Reviewer Work in progress, Systematic Review of Empirical Research about Retracted Publications project team