SlideShare a Scribd company logo
Survey on Discourse
Annotation for Arabic
A. Algarni, H. Alharbi and N. Almutairy
Supervisor: Dr. A. Alsaif
April 23, 2013
Kingdom of Saudi Arabia
Ministry of Higher Education
Imam Mohammed Ibn Saud Islamic University
College of computer and Information Sciences
CS465 - Natural Language Processing –
1
Outline
 Introduction
 The Leeds Arabic Discourse Treebank
 Discourse Connective Recognition
 Discourse Relation Recognition
 Semantic-Based Segmentation
 Discourse Segmentation Based on Rhetorical
Methods
 A Comprehensive Taxonomy of Arabic Discourse
Coherence Relations
2
Introduction
 Linguistic annotation covers any descriptive
or analytic notations applied to raw language
data.
 Annotated Discourse Corpora can be very
useful to facilitate theoretical studies along
with contributing in the development of NLP
applications.
3
Applications
 Information extraction
 Question-answering
 Summarization
 Machine translation, generation.
4
Discourse Relations and
Discourse Connectives
 Discourse Relation is the way that two
arguments (text segments) logically connected.
 Temporal, Comparison, Causal, Expansion..etc
 Discourse Connective (DC) :A lexical marker
used to link two abstract objects in a text.
 Abstract Object (AO) : Abstract objects in
discourse are things like proposition
, events, facts and opinions.
 Argument (Arg) : A text expressing an abstract
object and linked by a DC.
5
The Leeds Arabic Discourse
Treebank
6
• First effort towards producing an Arabic
Discourse Treebank was introduced in 2011
by A. Alsaif and K. Markert.
• Collected a large set of Arabic discourse
connectives using text analysis and corpus
based techniques.
•Final list contains 107 discourse
connectives.
Types of Discourse connectives
7
Types of Relations
8
Types of Relations Cont..
 COMPARISON.Similarity:
9
Arabic Discourse Annotation Tool
(ADA) and Annotation Process
10
Annotation Methodology
1. Measuring whether annotators agree on
the binary decision on whether an item
constitutes a discourse connective in
context.
2. Measuring whether annotators agree on
which discourse relation an identified
connective expresses. As annotators can
use sets of relations for a connective.
11
Results
 Agreement in task 1 is highly reliable
(N=23331) percentage agreement of
0.95, kappa of 0.88.
 Agreement in task 2 (relation assignment)
is relatively low (N=5586), percentage
agreement of 0.66, kappa 0.57, and alpha
of 0.58.
12
Discourse Connective Recognition
 To distinguish between discourse and non-
discourse usage of a connective.
 Example: once, while.
 A. Alsaif and K.Markert (2011) introduced
a Connective identifier for Arabic based on
syntactic features.
13
Discourse Connective Recognition
by A. Alsaif and K.Markert (2011)
Features:
 Surface Features (SConn)
 Lexical features of surrounding words
(Lex)
 Example
Arg1DC
Arg2.
[Children might be tired]Arg1 [and]DC [feel sleepy]Arg2 during school time if they did
not sleep well
14
Features:
 Part of Speech features (POS)
 Syntactic category of related phrases
(Syn) (E.g.: / the school is
very large and beautiful)
 Al-Masdar feature.
Discourse Connective Recognition
by A. Alsaif and K.Markert (2011) Cont…
15
 Results
Discourse Connective Recognition
by A. Alsaif and K.Markert (2011) Cont…
Features Acurr K
Baseline (not Conn) 68.9 0
M1 Conn only 75.7 0.48
Tokenization by white space + auto tagger
M2
M3
M4
Conn+ SConn+Lex
Conn+ SConn+Lex+POS
Conn+SConn+Lex+POS+Masdar
85.6 0.62
87.6 0.69
88.5 0.70
ATB-based features
M5
M6
M7
Conn+SConn+Lex
Conn+SConn+Lex+Syn/POS
Conn+SConn+Lex+Syn/POS+Masdar
86.2 0.65
91.2 0.79
92.4 0.82
M8
M9
Conn+SConn+Syn
SConn+Lex+Syn+Masdar
91.2 0.79
91.2 0.79
16
Discourse Relation Recognition
 To identify the type of the relation
 A. Alsaif and K.Markert (2011) introduced
the first algorithms to automatically
identify relations for Arabic
17
Features:
 Connective features
 Words and POS of arguments
 Masdar
 Tense and Negation
 Length, Distance and Order Features
 Argument Parent
 Production Rules
Discourse Relation Recognition
by A. Alsaif and K.Markert (2011)
18
Results
Acurr kFeatures
All connectives (6039)
52.5 0Baseline (CONJUNCTION)
77.2 0.60
78.7 0.66
78.3 0.65
Conn only (1)
Conn+Conn f+ Arg f (37)
Conn+Conn f+ Arg f+ Production rules (1237)
M1
M2
M3
Excluding wa at BOP (3813)
35 0Baseline (CONJUNCTION)
74.3 0.65
77.0 0.69
76.7 0.69
Conn only (1)
Conn+Conn f+ Arg f (37)
Conn+Conn f+ Arg f+ Production rules (1237)
M1
M2
M3
19
Results
Acurr kFeatures
All connectives (6039)
62.4 0Baseline (EXPANSION )
88.7 0.78
88.7 0.78
Conn only (1)
Conn+Conn f+ Arg f (37)
M1
M2
Excluding wa at BOP (3813)
41.8 0Baseline (EXPANSION)
82.7 0.74
83.5 0.75
Conn only (1)
Conn+Conn f+ Arg f (37)
M1
M2
20
Semantic-Based Segmentation of
Arabic Texts
 Corpus Analysis
 Definition: Let L be a list of candidate
segments connectors, each element c in L is
classified based on its effects on the text
segmentation as either active or passive
 Examples:
.1[
][
[
.2]][
]
[
21
Segmentation Process
 Identifying the connectors that indicate
complete segments.
 Locating the active connectors.
 Resolving the case where adjacent active
connectors exist.
 Setting the segments boundaries.
 Creating the final list of segments.
22
Discussion
 evaluate the segmentation process, they
collected ten essays.
 Each essay ranges between 500 and 700
words.
 After implementing the segmentation
process.
 Gave the output to judges to evaluate
them in terms of two factors: correct
hit and incorrect hit.
23
Discussion Cont..
Incorrect hitCorrect hitEssay
0331
1152
0253
1234
0205
1296
1267
2338
0269
02210
24
Arabic Discourse Segmentation
Based on Rhetorical Methods
 This Method is depends on the meaning of
the connector " " in Arabic language.
 There are six types of " " classified into
two classes, "Fasl" and "Wasl " :
 "Fasl " : segmenting place.
 "Wasl " : unsegmenting but connecting
the text.
25
Types of Connector " "
ClassExampleType
Fasl
Fasl
Fasl
Wasl
Wasl
Wasl
26
The Arabic sentence
Segmentation System
27
Feature Extraction
•The following are the features of " ":
X3 = noun and X7 = accusative mark.
28
Experiment and Results
 They used 1200 instances for training.
 They used 293 instances for testing after
testing there are 290 correct and 3
incorrect instances.
 The result with:
94.68%Recall
96.82%Precision
98.98 %Accuracy
29
A Comprehensive Taxonomy of Arabic
Discourse Coherence Relations
 Coherence relations are classified into two
types: explicit relations and implicit
relations.
exampleCoherence relations
I am very happy because I got
excellent marks in exams.
Explicit relations
I am very happy. I got excellent
marks in exams.
Implicit relations.
30
The procedure of creating an Arabic
Taxonomy of Coherence Relations
31
Examples of Implicit Arabic
relations
 "Impossible condition / " :
 "Cascaded questioning/ :
(
32
Results
 They got a set of 47 Arabic coherence
relations.
coherence relations.Result
From English coherence
relations.
31
additional Arabic explicit
coherence relations.
12
Arabic implicit relations.4
33
Conclusion
Discourse Annotation is a very fertile field
and it has many NLP applications, for
Arabic there are some challenges due to
the lack of annotated corpora and studies.
34
Thank You
35

More Related Content

PDF
English to punjabi machine translation system using hybrid approach of word s
PPTX
Intent Classifier with Facebook fastText
PDF
An approach to word sense disambiguation combining modified lesk and bag of w...
PDF
AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-W...
PDF
4213ijaia04
PDF
Improvement wsd dictionary using annotated corpus and testing it with simplif...
PDF
Isolated word recognition using lpc & vector quantization
PDF
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
English to punjabi machine translation system using hybrid approach of word s
Intent Classifier with Facebook fastText
An approach to word sense disambiguation combining modified lesk and bag of w...
AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-W...
4213ijaia04
Improvement wsd dictionary using annotated corpus and testing it with simplif...
Isolated word recognition using lpc & vector quantization
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION

What's hot (17)

PDF
Arabic named entity recognition using deep learning approach
PPTX
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
PDF
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
PDF
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
PDF
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
PDF
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
PDF
A Survey of Various Methods for Text Summarization
PDF
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
PDF
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
PDF
GDG Tbilisi 2017. Word Embedding Libraries Overview: Word2Vec and fastText
PDF
Rule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
PDF
text summarization using amr
PDF
An exploratory research on grammar checking of Bangla sentences using statist...
PDF
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
PPTX
Effect of morphological segmentation & de-segmentation on machine translation...
PDF
PDF
DOMAIN BASED CHUNKING
Arabic named entity recognition using deep learning approach
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
Parameters Optimization for Improving ASR Performance in Adverse Real World N...
Metrics for Evaluating Quality of Embeddings for Ontological Concepts
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
A Survey of Various Methods for Text Summarization
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
GDG Tbilisi 2017. Word Embedding Libraries Overview: Word2Vec and fastText
Rule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
text summarization using amr
An exploratory research on grammar checking of Bangla sentences using statist...
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Effect of morphological segmentation & de-segmentation on machine translation...
DOMAIN BASED CHUNKING
Ad

Viewers also liked (17)

PPTX
Syntactic parsing for arabic
PDF
Arabic speech recognition
PPTX
Coreference recognition in arabic
PPTX
Speech recognition for arabic
PPTX
Arabic question answering ‫‬
PPTX
Arabic spell checkers
PPTX
Automatic summaraitztion for_arabic
PPTX
Discourse annotation for arabic
PPTX
Discourse annotation for arabic 3
PPTX
Discourse annotation
PPTX
Building corpus from www for arabic
PPTX
The named entity recognition (ner)2
PPTX
Arabic to-english machine translation
PPTX
Part of speech tagging for Arabic
PPTX
Arabic spell checking approaches
PPTX
Arabic tokenization and stemming
PPTX
Sentiment analysis of arabic,a survey
Syntactic parsing for arabic
Arabic speech recognition
Coreference recognition in arabic
Speech recognition for arabic
Arabic question answering ‫‬
Arabic spell checkers
Automatic summaraitztion for_arabic
Discourse annotation for arabic
Discourse annotation for arabic 3
Discourse annotation
Building corpus from www for arabic
The named entity recognition (ner)2
Arabic to-english machine translation
Part of speech tagging for Arabic
Arabic spell checking approaches
Arabic tokenization and stemming
Sentiment analysis of arabic,a survey
Ad

Similar to Discourse annotation for arabic 2 (20)

PDF
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
PDF
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
PDF
Dialect classification using acoustic and linguistic features in Arabic speech
PDF
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
PDF
CHUNKER BASED SENTIMENT ANALYSIS AND TENSE CLASSIFICATION FOR NEPALI TEXT
PDF
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
PDF
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
PDF
AN EFFICIENT APPROACH TO IMPROVE ARABIC DOCUMENTS CLUSTERING BASED ON A NEW K...
PDF
AN EFFICIENT APPROACH TO IMPROVE ARABIC DOCUMENTS CLUSTERING BASED ON A NEW K...
PDF
129966864160453838[1]
PDF
Classification of Arabic Texts using Four Classifiers
PDF
The effect of training set size in authorship attribution: application on sho...
PDF
Text-Based Detection of On-Hold Scripts in Contact Center Calls
PDF
Text-Based Detection of On-Hold Scripts in Contact Center Calls
PDF
Text-Based Detection of On-Hold Scripts in Contact Center Calls
PDF
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
PPTX
1 l5eng
PDF
Athifah procedia technology_2013
PDF
EasyChair-Preprint-7375.pdf
PDF
dialogue act modeling for automatic tagging and recognition
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
Dialect classification using acoustic and linguistic features in Arabic speech
An Efficient Semantic Relation Extraction Method For Arabic Texts Based On Si...
CHUNKER BASED SENTIMENT ANALYSIS AND TENSE CLASSIFICATION FOR NEPALI TEXT
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
Chunker Based Sentiment Analysis and Tense Classification for Nepali Text
AN EFFICIENT APPROACH TO IMPROVE ARABIC DOCUMENTS CLUSTERING BASED ON A NEW K...
AN EFFICIENT APPROACH TO IMPROVE ARABIC DOCUMENTS CLUSTERING BASED ON A NEW K...
129966864160453838[1]
Classification of Arabic Texts using Four Classifiers
The effect of training set size in authorship attribution: application on sho...
Text-Based Detection of On-Hold Scripts in Contact Center Calls
Text-Based Detection of On-Hold Scripts in Contact Center Calls
Text-Based Detection of On-Hold Scripts in Contact Center Calls
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
1 l5eng
Athifah procedia technology_2013
EasyChair-Preprint-7375.pdf
dialogue act modeling for automatic tagging and recognition

Recently uploaded (20)

PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Getting Started with Data Integration: FME Form 101
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
August Patch Tuesday
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
The various Industrial Revolutions .pptx
DOCX
search engine optimization ppt fir known well about this
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Modernising the Digital Integration Hub
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
Five Habits of High-Impact Board Members
PDF
STKI Israel Market Study 2025 version august
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Hybrid model detection and classification of lung cancer
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Enhancing emotion recognition model for a student engagement use case through...
Getting Started with Data Integration: FME Form 101
Benefits of Physical activity for teenagers.pptx
Getting started with AI Agents and Multi-Agent Systems
August Patch Tuesday
Hindi spoken digit analysis for native and non-native speakers
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
NewMind AI Weekly Chronicles – August ’25 Week III
The various Industrial Revolutions .pptx
search engine optimization ppt fir known well about this
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Modernising the Digital Integration Hub
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Final SEM Unit 1 for mit wpu at pune .pptx
Five Habits of High-Impact Board Members
STKI Israel Market Study 2025 version august
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Hybrid model detection and classification of lung cancer
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor

Discourse annotation for arabic 2

  • 1. Survey on Discourse Annotation for Arabic A. Algarni, H. Alharbi and N. Almutairy Supervisor: Dr. A. Alsaif April 23, 2013 Kingdom of Saudi Arabia Ministry of Higher Education Imam Mohammed Ibn Saud Islamic University College of computer and Information Sciences CS465 - Natural Language Processing – 1
  • 2. Outline  Introduction  The Leeds Arabic Discourse Treebank  Discourse Connective Recognition  Discourse Relation Recognition  Semantic-Based Segmentation  Discourse Segmentation Based on Rhetorical Methods  A Comprehensive Taxonomy of Arabic Discourse Coherence Relations 2
  • 3. Introduction  Linguistic annotation covers any descriptive or analytic notations applied to raw language data.  Annotated Discourse Corpora can be very useful to facilitate theoretical studies along with contributing in the development of NLP applications. 3
  • 4. Applications  Information extraction  Question-answering  Summarization  Machine translation, generation. 4
  • 5. Discourse Relations and Discourse Connectives  Discourse Relation is the way that two arguments (text segments) logically connected.  Temporal, Comparison, Causal, Expansion..etc  Discourse Connective (DC) :A lexical marker used to link two abstract objects in a text.  Abstract Object (AO) : Abstract objects in discourse are things like proposition , events, facts and opinions.  Argument (Arg) : A text expressing an abstract object and linked by a DC. 5
  • 6. The Leeds Arabic Discourse Treebank 6 • First effort towards producing an Arabic Discourse Treebank was introduced in 2011 by A. Alsaif and K. Markert. • Collected a large set of Arabic discourse connectives using text analysis and corpus based techniques. •Final list contains 107 discourse connectives.
  • 7. Types of Discourse connectives 7
  • 9. Types of Relations Cont..  COMPARISON.Similarity: 9
  • 10. Arabic Discourse Annotation Tool (ADA) and Annotation Process 10
  • 11. Annotation Methodology 1. Measuring whether annotators agree on the binary decision on whether an item constitutes a discourse connective in context. 2. Measuring whether annotators agree on which discourse relation an identified connective expresses. As annotators can use sets of relations for a connective. 11
  • 12. Results  Agreement in task 1 is highly reliable (N=23331) percentage agreement of 0.95, kappa of 0.88.  Agreement in task 2 (relation assignment) is relatively low (N=5586), percentage agreement of 0.66, kappa 0.57, and alpha of 0.58. 12
  • 13. Discourse Connective Recognition  To distinguish between discourse and non- discourse usage of a connective.  Example: once, while.  A. Alsaif and K.Markert (2011) introduced a Connective identifier for Arabic based on syntactic features. 13
  • 14. Discourse Connective Recognition by A. Alsaif and K.Markert (2011) Features:  Surface Features (SConn)  Lexical features of surrounding words (Lex)  Example Arg1DC Arg2. [Children might be tired]Arg1 [and]DC [feel sleepy]Arg2 during school time if they did not sleep well 14
  • 15. Features:  Part of Speech features (POS)  Syntactic category of related phrases (Syn) (E.g.: / the school is very large and beautiful)  Al-Masdar feature. Discourse Connective Recognition by A. Alsaif and K.Markert (2011) Cont… 15
  • 16.  Results Discourse Connective Recognition by A. Alsaif and K.Markert (2011) Cont… Features Acurr K Baseline (not Conn) 68.9 0 M1 Conn only 75.7 0.48 Tokenization by white space + auto tagger M2 M3 M4 Conn+ SConn+Lex Conn+ SConn+Lex+POS Conn+SConn+Lex+POS+Masdar 85.6 0.62 87.6 0.69 88.5 0.70 ATB-based features M5 M6 M7 Conn+SConn+Lex Conn+SConn+Lex+Syn/POS Conn+SConn+Lex+Syn/POS+Masdar 86.2 0.65 91.2 0.79 92.4 0.82 M8 M9 Conn+SConn+Syn SConn+Lex+Syn+Masdar 91.2 0.79 91.2 0.79 16
  • 17. Discourse Relation Recognition  To identify the type of the relation  A. Alsaif and K.Markert (2011) introduced the first algorithms to automatically identify relations for Arabic 17
  • 18. Features:  Connective features  Words and POS of arguments  Masdar  Tense and Negation  Length, Distance and Order Features  Argument Parent  Production Rules Discourse Relation Recognition by A. Alsaif and K.Markert (2011) 18
  • 19. Results Acurr kFeatures All connectives (6039) 52.5 0Baseline (CONJUNCTION) 77.2 0.60 78.7 0.66 78.3 0.65 Conn only (1) Conn+Conn f+ Arg f (37) Conn+Conn f+ Arg f+ Production rules (1237) M1 M2 M3 Excluding wa at BOP (3813) 35 0Baseline (CONJUNCTION) 74.3 0.65 77.0 0.69 76.7 0.69 Conn only (1) Conn+Conn f+ Arg f (37) Conn+Conn f+ Arg f+ Production rules (1237) M1 M2 M3 19
  • 20. Results Acurr kFeatures All connectives (6039) 62.4 0Baseline (EXPANSION ) 88.7 0.78 88.7 0.78 Conn only (1) Conn+Conn f+ Arg f (37) M1 M2 Excluding wa at BOP (3813) 41.8 0Baseline (EXPANSION) 82.7 0.74 83.5 0.75 Conn only (1) Conn+Conn f+ Arg f (37) M1 M2 20
  • 21. Semantic-Based Segmentation of Arabic Texts  Corpus Analysis  Definition: Let L be a list of candidate segments connectors, each element c in L is classified based on its effects on the text segmentation as either active or passive  Examples: .1[ ][ [ .2]][ ] [ 21
  • 22. Segmentation Process  Identifying the connectors that indicate complete segments.  Locating the active connectors.  Resolving the case where adjacent active connectors exist.  Setting the segments boundaries.  Creating the final list of segments. 22
  • 23. Discussion  evaluate the segmentation process, they collected ten essays.  Each essay ranges between 500 and 700 words.  After implementing the segmentation process.  Gave the output to judges to evaluate them in terms of two factors: correct hit and incorrect hit. 23
  • 24. Discussion Cont.. Incorrect hitCorrect hitEssay 0331 1152 0253 1234 0205 1296 1267 2338 0269 02210 24
  • 25. Arabic Discourse Segmentation Based on Rhetorical Methods  This Method is depends on the meaning of the connector " " in Arabic language.  There are six types of " " classified into two classes, "Fasl" and "Wasl " :  "Fasl " : segmenting place.  "Wasl " : unsegmenting but connecting the text. 25
  • 26. Types of Connector " " ClassExampleType Fasl Fasl Fasl Wasl Wasl Wasl 26
  • 28. Feature Extraction •The following are the features of " ": X3 = noun and X7 = accusative mark. 28
  • 29. Experiment and Results  They used 1200 instances for training.  They used 293 instances for testing after testing there are 290 correct and 3 incorrect instances.  The result with: 94.68%Recall 96.82%Precision 98.98 %Accuracy 29
  • 30. A Comprehensive Taxonomy of Arabic Discourse Coherence Relations  Coherence relations are classified into two types: explicit relations and implicit relations. exampleCoherence relations I am very happy because I got excellent marks in exams. Explicit relations I am very happy. I got excellent marks in exams. Implicit relations. 30
  • 31. The procedure of creating an Arabic Taxonomy of Coherence Relations 31
  • 32. Examples of Implicit Arabic relations  "Impossible condition / " :  "Cascaded questioning/ : ( 32
  • 33. Results  They got a set of 47 Arabic coherence relations. coherence relations.Result From English coherence relations. 31 additional Arabic explicit coherence relations. 12 Arabic implicit relations.4 33
  • 34. Conclusion Discourse Annotation is a very fertile field and it has many NLP applications, for Arabic there are some challenges due to the lack of annotated corpora and studies. 34