SlideShare a Scribd company logo
1/23
Introduction
NL Inference
Women in Computer Science
Natural Language Inference for Humans
Valeria de Paiva
Women+@DCS Sheffield
July 2020
Valeria de Paiva Women+@DCS
2/23
Introduction
NL Inference
Women in Computer Science
Thanks, Aline!
Valeria de Paiva Women+@DCS
3/23
Introduction
NL Inference
Women in Computer Science
Personal stories
I’m a logician, a proof-theorist, a computational semanticist and a
category theorist.
I work in industry in Silicon Valley, have done so for the last 20
years, applying the purest of pure mathematics, in surprising ways.
Valeria de Paiva Women+@DCS
4/23
Introduction
NL Inference
Women in Computer Science
Personal stories
Valeria de Paiva Women+@DCS
5/23
Introduction
NL Inference
Women in Computer Science
PARC, XLE and Bridge
Valeria de Paiva Women+@DCS
6/23
Introduction
NL Inference
Women in Computer Science
Powerset, Cuil and Nuance
Valeria de Paiva Women+@DCS
7/23
Introduction
NL Inference
Women in Computer Science
Natural Language Inference (NLI)
A shock when the work of almost a decade at PARC was out
of reach when I left in 2008
I gave a talk at SRI proposing to redo it all, open source (de
Paiva 2010 Bridges)
Pleased to report that almost all of it is now available
open-source
Most work with/by Katerina Kalouli, PhD student at
Konstanz
Valeria de Paiva Women+@DCS
8/23
Introduction
NL Inference
Women in Computer Science
Natural Language Inference: why?
In May 2016 Google announced Parsey McParseface, the
world’s most accurate parser1: 94% accuracy
In 2014 Marelli et al launched the SICK corpus at SemEval
2014: an easy (no named entities, no temporal phenomena,
limited vocabulary, etc..), linguist curated corpus to test
compositional knowledge
Can we use SyntaxNet to process SICK with off-the-shelf
tools such as WordNet and SUMO?
It’s complicated! Five papers and counting!
1
ai.googleblog.com/2016/0/announcing-syntaxnet-worlds-most.
html
Valeria de Paiva Women+@DCS
9/23
Introduction
NL Inference
Women in Computer Science
Natural Language Inference: what?
Examples from SNLI dataset at Stanford
Valeria de Paiva Women+@DCS
10/23
Introduction
NL Inference
Women in Computer Science
NLI for Humans
Easier to detect inference than to decide on “good”semantic
representations
Data-driven NLU need large, diverse, high-quality corpora
annotated to learn inference relations: entails, contradicts,
neutral
Can we trust the corpora we have?
Are they really learning logical inferences?
Are the findings on the big corpora available SNLI, MNLI,
SciTail, etc transferable and generalizable? (Plenty of recent
work showing no, systems learn biases of the corpora, cannot
be redeployed)
Valeria de Paiva Women+@DCS
11/23
Introduction
NL Inference
Women in Computer Science
NLI for SICK
Explaining Simple
Natural Language
Inference ACL2019
Textual Inference:
getting logic from
humans IWCS2017
Correcting
Contradictions,
CONLI 2017
Graph Knowledge
Representations for
SICK, NLCS2018
WordNet for “Easy”
Textual Inferences
LREC2018
Valeria de Paiva Women+@DCS
12/23
Introduction
NL Inference
Women in Computer Science
NLI for SICK
Are the annotations in SICK logical? Can we trust them?
Several problems: lack of guidelines on co-reference, how to
annotate contradictions, ungrammatical and non-sensical
sentences, noisy data, etc..
This meant contradictions in SICK are not symmetric and
they need to be
Contradictions require alignment between entities and events,
which need to be ”close enough”
how to decide when things are close enough?
Can we do simpler case where sentences are
”one-word-apart”using WordNet?
More guidelines necessary for SICK annotation?
Valeria de Paiva Women+@DCS
13/23
Introduction
NL Inference
Women in Computer Science
NLI for SICK
https://logic-forall.blogspot.com/2020/03/
sick-dataset-in-these-trying-times.html
Valeria de Paiva Women+@DCS
14/23
Introduction
NL Inference
Women in Computer Science
Are we there yet?
Manning: Computational Linguistics and Deep Learning, 2015
”NLP is kind of like a rabbit in the headlights of the Deep
Learning machine, waiting to be flattened.”
Hinton 2015: ”I will be disappointed if in five years’ time we do
not have something that can watch a YouTube video and tell a
story about what happened.”
[not totally flattened, yet]
Valeria de Paiva Women+@DCS
15/23
Introduction
NL Inference
Women in Computer Science
Conclusions so far
Working for division of semantic labor between
symbolic/structural and distributional approaches
Have fledgling proposal GKR with strict separation of
conceptual and contextual structures
Also concrete proposal for injecting distributionality in GKR:
promising results (COLING submission)
Further Work: Still working to produce a ‘correct’ SICK
Working on annotations and theorem provers
test GKR with further datasets, further distributional
architectures
Valeria de Paiva Women+@DCS
16/23
Introduction
NL Inference
Women in Computer Science
4th Workshop Women in Logic 2020
Valeria de Paiva Women+@DCS
17/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science
I grew up believing most of the gender wars had been fought
by our grandmothers, suffragettes or not.
that the law allowed me to get into colleges and work places.
that I could always apply for scholarships and grants. I had
plenty of women teachers.
I thought my job was to work hard and show people I could
do the job as well as any man
I knew the numbers were bad both in Computing and in
Maths, but I thought they’re bad as usual, not particularly
bad.
That time would be on our side, that things were going to get
more equal as time went by
Valeria de Paiva Women+@DCS
18/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science
Valeria de Paiva Women+@DCS
19/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science
Valeria de Paiva Women+@DCS
20/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science
When Nat Shankar asked me if I wanted to say a few words about
Logic in Computer Science, in its 30th birthday, I warned him that
he might not like the few words.
Then we launched the Workshop Women in Logic, the facebook
group Women in Logic and the blog.
Valeria de Paiva Women+@DCS
21/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science
Workshops in Iceland, UK, Canada and this year Paris, France.
Funding for scholarships from SIGLOG, VCLA (Vienna Center for
Logic and Algorithms), and ILLC (institute for Language, Logic,
and Computation), Amsterdam, Netherlands.
Valeria de Paiva Women+@DCS
22/23
Introduction
NL Inference
Women in Computer Science
Women in Computer Science Data
We have a spreadsheet of women logicians, editable by
everyone, since 2012.
A collection of spreadsheets checking numbers of female
Invited Speakers in many of the theoretical Computer Science
main conferences.
Careful work on number of women invited speakers for the
ASL meetings (thanks Johanna Franklin!)
Have a mailing list and many plans. Join us!
Valeria de Paiva Women+@DCS
23/23
Introduction
NL Inference
Women in Computer Science
More information
GKR Demo:
http://lap0973.sprachwiss.uni-konstanz.de:
8080/sem.mapper/
GKR source code:
https://github.com/kkalouli/GKR_semantic_parser
Ask KAterina questions!
Play with it and tells us all the other things we haven’t done, yet!
Thanks!
Valeria de Paiva Women+@DCS

More Related Content

PDF
Benchmarking Linear Logic Proofs
PDF
The importance of Being Erneast: Open datasets in Portuguese
PDF
Weapons of Math Construction
PDF
Structural and Distributional Meaning Representations
PDF
Benchmarking Linear Logic Proofs
PDF
Dialectica Categories and Petri Nets
PDF
Constructive Modal and Linear Logics
PDF
Semantics and Reasoning for NLP, AI and ACT
Benchmarking Linear Logic Proofs
The importance of Being Erneast: Open datasets in Portuguese
Weapons of Math Construction
Structural and Distributional Meaning Representations
Benchmarking Linear Logic Proofs
Dialectica Categories and Petri Nets
Constructive Modal and Linear Logics
Semantics and Reasoning for NLP, AI and ACT

What's hot (14)

PDF
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
PDF
Dialectica Categories and Petri Nets
PDF
Dialectica Categories: the Relevant version, Valeria de Paiva
PDF
Constructive Modalities
PDF
Fun with Constructive Modalities
PDF
Categorical Semantics for Explicit Substitutions
PDF
Going Without: a modality and its role
PDF
Semantics and Reasoning: for NLP, AI and ACT
PDF
Categorical Explicit Substitutions
PDF
OWN-PT: Taking Stock
PDF
Categorical Semantics for Explicit Substitutions
PDF
Edwardian Proofs as Futuristic Programs
PPT
Ontologies and Semantics for Portuguese
PDF
Little engines of inference: contexts for quantification
Intuitive Semantics for Full Intuitionistic Linear Logic (2014)
Dialectica Categories and Petri Nets
Dialectica Categories: the Relevant version, Valeria de Paiva
Constructive Modalities
Fun with Constructive Modalities
Categorical Semantics for Explicit Substitutions
Going Without: a modality and its role
Semantics and Reasoning: for NLP, AI and ACT
Categorical Explicit Substitutions
OWN-PT: Taking Stock
Categorical Semantics for Explicit Substitutions
Edwardian Proofs as Futuristic Programs
Ontologies and Semantics for Portuguese
Little engines of inference: contexts for quantification
Ad

Similar to Natural Language Inference for Humans (13)

PDF
Ask Us Anything by the Paris WiMLDS team
PPT
Training Leadership Summit
PPT
Training Leadership Summit
PPTX
Getting Started with STEAM
PPT
Isis duke 041610
PDF
Data + Women June 2023.pdf
PDF
Digital Fabrication in Learning Environments
PDF
Digitalsts A Field Guide For Science Technology Studies
PPTX
Families learning summit (final)
PPT
Chief Learning Officer Forum
PDF
Information sharing in transportation systems - more than digital pieces of p...
PDF
Online Marketing PART2 - new communication paradigm
PPSX
Overview Of Technology Goddesses
Ask Us Anything by the Paris WiMLDS team
Training Leadership Summit
Training Leadership Summit
Getting Started with STEAM
Isis duke 041610
Data + Women June 2023.pdf
Digital Fabrication in Learning Environments
Digitalsts A Field Guide For Science Technology Studies
Families learning summit (final)
Chief Learning Officer Forum
Information sharing in transportation systems - more than digital pieces of p...
Online Marketing PART2 - new communication paradigm
Overview Of Technology Goddesses
Ad

More from Valeria de Paiva (20)

PDF
Dialectica Comonoids
PDF
Dialectica Categorical Constructions
PDF
Logic & Representation 2021
PDF
Constructive Modal and Linear Logics
PDF
Dialectica Categories Revisited
PDF
PLN para Tod@s
PDF
Networked Mathematics: NLP tools for Better Science
PDF
Going Without: a modality and its role
PDF
Problemas de Kolmogorov-Veloso
PDF
Natural Language Inference: for Humans and Machines
PDF
Dialectica Petri Nets
PDF
Negation in the Ecumenical System
PDF
NLCS 2013 opening slides
PDF
Dialectica Comonads
PDF
Logic and Probabilistic Methods for Dialog
PDF
Dialectica and Kolmogorov Problems
PDF
Gender Gap in Computing 2014
PDF
Categorical Proof Theory for Everyone
PDF
Dialectica and Kolmogorov Problems
PDF
Constructive Modalities
Dialectica Comonoids
Dialectica Categorical Constructions
Logic & Representation 2021
Constructive Modal and Linear Logics
Dialectica Categories Revisited
PLN para Tod@s
Networked Mathematics: NLP tools for Better Science
Going Without: a modality and its role
Problemas de Kolmogorov-Veloso
Natural Language Inference: for Humans and Machines
Dialectica Petri Nets
Negation in the Ecumenical System
NLCS 2013 opening slides
Dialectica Comonads
Logic and Probabilistic Methods for Dialog
Dialectica and Kolmogorov Problems
Gender Gap in Computing 2014
Categorical Proof Theory for Everyone
Dialectica and Kolmogorov Problems
Constructive Modalities

Recently uploaded (20)

PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPT
DATA COLLECTION METHODS-ppt for nursing research
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PPTX
Leprosy and NLEP programme community medicine
PDF
annual-report-2024-2025 original latest.
PDF
Lecture1 pattern recognition............
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Transcultural that can help you someday.
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
modul_python (1).pptx for professional and student
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Qualitative Qantitative and Mixed Methods.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
DATA COLLECTION METHODS-ppt for nursing research
ISS -ESG Data flows What is ESG and HowHow
STERILIZATION AND DISINFECTION-1.ppthhhbx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
importance of Data-Visualization-in-Data-Science. for mba studnts
Leprosy and NLEP programme community medicine
annual-report-2024-2025 original latest.
Lecture1 pattern recognition............
Optimise Shopper Experiences with a Strong Data Estate.pdf
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Transcultural that can help you someday.
Introduction-to-Cloud-ComputingFinal.pptx
modul_python (1).pptx for professional and student

Natural Language Inference for Humans

  • 1. 1/23 Introduction NL Inference Women in Computer Science Natural Language Inference for Humans Valeria de Paiva Women+@DCS Sheffield July 2020 Valeria de Paiva Women+@DCS
  • 2. 2/23 Introduction NL Inference Women in Computer Science Thanks, Aline! Valeria de Paiva Women+@DCS
  • 3. 3/23 Introduction NL Inference Women in Computer Science Personal stories I’m a logician, a proof-theorist, a computational semanticist and a category theorist. I work in industry in Silicon Valley, have done so for the last 20 years, applying the purest of pure mathematics, in surprising ways. Valeria de Paiva Women+@DCS
  • 4. 4/23 Introduction NL Inference Women in Computer Science Personal stories Valeria de Paiva Women+@DCS
  • 5. 5/23 Introduction NL Inference Women in Computer Science PARC, XLE and Bridge Valeria de Paiva Women+@DCS
  • 6. 6/23 Introduction NL Inference Women in Computer Science Powerset, Cuil and Nuance Valeria de Paiva Women+@DCS
  • 7. 7/23 Introduction NL Inference Women in Computer Science Natural Language Inference (NLI) A shock when the work of almost a decade at PARC was out of reach when I left in 2008 I gave a talk at SRI proposing to redo it all, open source (de Paiva 2010 Bridges) Pleased to report that almost all of it is now available open-source Most work with/by Katerina Kalouli, PhD student at Konstanz Valeria de Paiva Women+@DCS
  • 8. 8/23 Introduction NL Inference Women in Computer Science Natural Language Inference: why? In May 2016 Google announced Parsey McParseface, the world’s most accurate parser1: 94% accuracy In 2014 Marelli et al launched the SICK corpus at SemEval 2014: an easy (no named entities, no temporal phenomena, limited vocabulary, etc..), linguist curated corpus to test compositional knowledge Can we use SyntaxNet to process SICK with off-the-shelf tools such as WordNet and SUMO? It’s complicated! Five papers and counting! 1 ai.googleblog.com/2016/0/announcing-syntaxnet-worlds-most. html Valeria de Paiva Women+@DCS
  • 9. 9/23 Introduction NL Inference Women in Computer Science Natural Language Inference: what? Examples from SNLI dataset at Stanford Valeria de Paiva Women+@DCS
  • 10. 10/23 Introduction NL Inference Women in Computer Science NLI for Humans Easier to detect inference than to decide on “good”semantic representations Data-driven NLU need large, diverse, high-quality corpora annotated to learn inference relations: entails, contradicts, neutral Can we trust the corpora we have? Are they really learning logical inferences? Are the findings on the big corpora available SNLI, MNLI, SciTail, etc transferable and generalizable? (Plenty of recent work showing no, systems learn biases of the corpora, cannot be redeployed) Valeria de Paiva Women+@DCS
  • 11. 11/23 Introduction NL Inference Women in Computer Science NLI for SICK Explaining Simple Natural Language Inference ACL2019 Textual Inference: getting logic from humans IWCS2017 Correcting Contradictions, CONLI 2017 Graph Knowledge Representations for SICK, NLCS2018 WordNet for “Easy” Textual Inferences LREC2018 Valeria de Paiva Women+@DCS
  • 12. 12/23 Introduction NL Inference Women in Computer Science NLI for SICK Are the annotations in SICK logical? Can we trust them? Several problems: lack of guidelines on co-reference, how to annotate contradictions, ungrammatical and non-sensical sentences, noisy data, etc.. This meant contradictions in SICK are not symmetric and they need to be Contradictions require alignment between entities and events, which need to be ”close enough” how to decide when things are close enough? Can we do simpler case where sentences are ”one-word-apart”using WordNet? More guidelines necessary for SICK annotation? Valeria de Paiva Women+@DCS
  • 13. 13/23 Introduction NL Inference Women in Computer Science NLI for SICK https://logic-forall.blogspot.com/2020/03/ sick-dataset-in-these-trying-times.html Valeria de Paiva Women+@DCS
  • 14. 14/23 Introduction NL Inference Women in Computer Science Are we there yet? Manning: Computational Linguistics and Deep Learning, 2015 ”NLP is kind of like a rabbit in the headlights of the Deep Learning machine, waiting to be flattened.” Hinton 2015: ”I will be disappointed if in five years’ time we do not have something that can watch a YouTube video and tell a story about what happened.” [not totally flattened, yet] Valeria de Paiva Women+@DCS
  • 15. 15/23 Introduction NL Inference Women in Computer Science Conclusions so far Working for division of semantic labor between symbolic/structural and distributional approaches Have fledgling proposal GKR with strict separation of conceptual and contextual structures Also concrete proposal for injecting distributionality in GKR: promising results (COLING submission) Further Work: Still working to produce a ‘correct’ SICK Working on annotations and theorem provers test GKR with further datasets, further distributional architectures Valeria de Paiva Women+@DCS
  • 16. 16/23 Introduction NL Inference Women in Computer Science 4th Workshop Women in Logic 2020 Valeria de Paiva Women+@DCS
  • 17. 17/23 Introduction NL Inference Women in Computer Science Women in Computer Science I grew up believing most of the gender wars had been fought by our grandmothers, suffragettes or not. that the law allowed me to get into colleges and work places. that I could always apply for scholarships and grants. I had plenty of women teachers. I thought my job was to work hard and show people I could do the job as well as any man I knew the numbers were bad both in Computing and in Maths, but I thought they’re bad as usual, not particularly bad. That time would be on our side, that things were going to get more equal as time went by Valeria de Paiva Women+@DCS
  • 18. 18/23 Introduction NL Inference Women in Computer Science Women in Computer Science Valeria de Paiva Women+@DCS
  • 19. 19/23 Introduction NL Inference Women in Computer Science Women in Computer Science Valeria de Paiva Women+@DCS
  • 20. 20/23 Introduction NL Inference Women in Computer Science Women in Computer Science When Nat Shankar asked me if I wanted to say a few words about Logic in Computer Science, in its 30th birthday, I warned him that he might not like the few words. Then we launched the Workshop Women in Logic, the facebook group Women in Logic and the blog. Valeria de Paiva Women+@DCS
  • 21. 21/23 Introduction NL Inference Women in Computer Science Women in Computer Science Workshops in Iceland, UK, Canada and this year Paris, France. Funding for scholarships from SIGLOG, VCLA (Vienna Center for Logic and Algorithms), and ILLC (institute for Language, Logic, and Computation), Amsterdam, Netherlands. Valeria de Paiva Women+@DCS
  • 22. 22/23 Introduction NL Inference Women in Computer Science Women in Computer Science Data We have a spreadsheet of women logicians, editable by everyone, since 2012. A collection of spreadsheets checking numbers of female Invited Speakers in many of the theoretical Computer Science main conferences. Careful work on number of women invited speakers for the ASL meetings (thanks Johanna Franklin!) Have a mailing list and many plans. Join us! Valeria de Paiva Women+@DCS
  • 23. 23/23 Introduction NL Inference Women in Computer Science More information GKR Demo: http://lap0973.sprachwiss.uni-konstanz.de: 8080/sem.mapper/ GKR source code: https://github.com/kkalouli/GKR_semantic_parser Ask KAterina questions! Play with it and tells us all the other things we haven’t done, yet! Thanks! Valeria de Paiva Women+@DCS