SlideShare a Scribd company logo
International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064
Volume 2 Issue 9, September 2013
www.ijsr.net
A Review on Novel Scoring System for Identify
Accurate Answers for Factoid Questions
1
Harpreet Kaur, 2
Rimpi
1, 2
Swami Vivekanand Institute of Engineering and Technology
Banur, Punjab, India
Abstract: In this research work we have develop a new scoring mathematical model that works on the five types of questions.
The question text failures are first extracted and a score is found based on its structure with respect to its template structure and
then answer score is calculated again the question as well as paragraph. Text to finally reach at the index of the most probable
answer with respect to question.
Keywords: Natural language processing, Question answering System, Information retrieval.
1. Introduction
NLP focuses on communications between computers and
natural languages in terms of theoretical results and practical
applications, and on information sharing now that
information is exchange as it never has been before and
sharing information becoming the leading theme in the
domain of NLP systems. Question Answering (QA) systems
go beyond the usual Information Retrieval (IR) systems
which underlay popular Internet search engines. QA systems
have the aim of responding to natural language questions
whereas IR systems take up keywords from users and deploy
some intelligent search mechanisms on a document
collection to get back to the user with a ranked list of
documents rather than an exact answer.
There is a need for tools that would reduce the amount of
text in order to obtain the desired information [1] [2] [3].
People have questions and they need answers, not documents
and they always prefer to express the questions in their
native language without being restricted to a particular query
language, query formation rules, or even a particular
knowledge domain. Automatic question answering system
will help for the above technology. In this Question
Answering System consists of three distinct phases: Question
classification, information retrieval or document processing
and answer extraction.
The design of a standard QA system assumes that the
language in which the question is asked and the text
collection available to be processed are all in the same
language. However, there might be a need for cross-lingual
QA system which takes in questions in one language and
searches through a document collection in a different
language to get to the answer. English QA system research
attempts to deal with a wide range of question types like
WHEN, WHERE, WHAT, HOW, WHOM, WHY &
WHOSE. Thus the aim of a QA system is to localize the
exact answer to a question from a structured or a non-
structured collection of texts. The QA system states that a
QA task can be decomposed into three main sub problems.
The sub problems are:- Question processing, information
retrieval or document Processing and answer processing and.
The question processing stage is responsible for taking a
question in a natural language and producing some kind of
intelligent representation of the raw question string so that it
becomes more useful for finding answers. The document
processing stage is used to reduce the search space of the
document collection where the answer to the question can be
expected. This stage is basically a complete Information
Retrieval system where the idea is to take in some keywords
and produce a ranked list of documents related to those
keywords. The final stage of a QA system is the answer
processing stage where the system does some intelligent
matching with the output of the previous two stages to
produce an answer to the given question. Any QA system
should have these four basic components and may have a
number of other components to make the system more useful
and robust.
1.1 What is Question and Answer System?
Questions answering (QA) systems look for the answer of a
question in a large collection of documents. The question is
in natural language. QA systems select text passages. Then,
after that the answer is extracted from these passages,
according to criteria issued from the question analysis. To
facilitate the question generation task, we build text from the
input complex text using a syntactic parser. We classify the
text based on their subject, verb, object and preposition for
determining the possible type of questions to be generated.
The ability of QA systems to recognize a great amount of
answer types is related to their powerfulness for extracting
right answers [4] [5] [6].
1.2 Why do we need QSA System?
Question Answering (QA) Systems allow the user to ask
questions in a natural language and obtain an exact answer.
In this, we tried to learn the important issues in the field of
Question Answering (QA) systems. We peeked into the
internals of many established QA systems. we do not only
consider simple questions but text problems consisting of
several sentences. Our approach to translating the natural
language question uses an underlying corpus and the
knowledge base to derive meaningful and relevant patterns
which can then be used to process the questions and capture
their meaning with respect to the underlying knowledge
base.
154Paper ID: 05091302
International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064
Volume 2 Issue 9, September 2013
www.ijsr.net
2. Types of QA systems
Different types of QA systems which are divided into two
major groups based on the methods used by them. First
group of QA system belongs to simple natural language
processing and information retrieval methods, while another
group of QA systems are dependent upon the reasoning with
natural language.
2.1 Web Based Question Answering System
This web bases QA system submits the question to the search
engine like Google, Yahoo etc and grabs its top 100 search
results..Given a user's natural language question, the system
will submit the question to a search engine, then extract all
possible answers from the search results according to the
question type identified by the question classification
module, finally select the most possible answers to return.
The Web Based QA systems mostly handles wh-type of
questions such as “Who was the first American in space?”
Or “Which of the following is correct”. This QA system
provides answers in various forms like text documents, Xml
documents or Wikipedia. The common levels that are used
by different web based Question Answering systems
architectures are as follows [7]:
 Question Classification:- In order to correctly answer a
question, usually one needs to understand what type of
information the question asks for, e.g., the sample
question “Who was the first American in space?” asks for
a person name. The question classification is made to
provide better accuracy in the results.
 Answer Extraction: - In this, extracts the correct
possible answers for different classification of questions.
 Answer Selection:- Among the possible answers
obtained, ranking approaches are used to find out the best
accurate answers based on its weight age factor. Answer
classes generally is of factoid and non - factoid types. The
factoid is getting short fact based answers like names,
dates, and non-factoid is getting descriptions or
definitions [9]. Given a user's natural language question,
the system will submit the question to a search engine,
then extract all possible answers from the search results
according to the question type identified by the question
classification module, finally select the most similar
answers to return. The architecture of web based question
answering system is shown in figure 2[8].
Figure 1: Architecture of Web based question answering
system
2.2 IR / IE Based Question Answering Systems
Question Answering, the process of extracting answers to
natural language questions is profoundly different from
Information Retrieval (IR) or Information Extraction (IE). IR
systems allow us to locate relevant documents that relate to a
query, but do not specify exactly where the answers are. In
IR, the documents of interest are fetched by matching query
keywords to the index of the document collection. IE
systems need several resources like Named Entity Tagging
(NE), Template Element (TE), Template relation (TR),
Correlated Element (CE), and General Element (GE). IE
systems architecture is build into distinct levels:
 Level 1 NE tagger is use to handle named entity elements
in the text (who, when, where, what etc..,).
 Level 2 handles NE tagging + adj like (how far, how long,
how often etc..,),
 Level 3 builds the correlated entities by using the most
important entity in the question and prepares General
Element (GE) which consists of asking point of view. For
Eg: “How did John pass the exam?” The ASKING
POINT is clearly defined i.e. Person (Noun) if we by
passing this question into the separate levels which was
mentioned above.
 The Architecture of IR/IE based question answering
system is given in figure3
Figure 2: Architecture of IR/IE based question answering
2.3 Restricted Domain Question Answering systems
QA systems for restricted domains may be designed to
retrieve answers from so-called unstructured data (free texts),
semi-structured data (such as XML-annotated texts), or
structured data (databases). Question answering applied to
restricted domains is interesting and challenging in two
important respects. Question answering on restricted
domains requiring the processing of complex questions and
offering the opportunity to carry out complex analysis of the
text sources and the questions [3] [10]. The main difference
between open-domain question answering and restricted-
domain question answering is the existence of domain-
dependent information that can be used to improve the
accuracy of the system.
2.4 Rule Based Question Answering Systems
The rule-based system uses lexical and semantic heuristics to
look for evidence that a sentence contains the answer to a
question. Each type of WH question looks for different types
of answers, so Quarc uses a separate set of rules for each
question type (WHO, WHAT, WHEN, WHERE,
WHY,WHOSE,WHOM). Given a question and a story,
155Paper ID: 05091302
International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064
Volume 2 Issue 9, September 2013
www.ijsr.net
Quarc parses the question and all of the sentences in the
story using our partial parser Sundance. Much of the
syntactic analysis is not used, but Quarc does use the
morphological analysis, part-of- speech tagging, semantic
class tagging, and entity recognition. The rules are applied to
each sentence in the story, as well as the title of the story,
with the exception that the title is not considered for WHY
questions. “Who” rules looks for Names that are mostly
Nouns of persons or things. The “What” questions were the
most difficult to handle because they sought an amazing
variety of answers and it consists of DATE expression or
nouns. “When” questions almost always require a TIME
expression, so sentences that do not contain a TIME
expression are only considered in special cases. The
“Where” questions almost always look for specific locations,
so the WHERE rules are much focused. “Why” questions are
handled differently than other questions. The WHY rules are
based on the observation that the answer to a WHY question
often appears immediately before or immediately after the
sentence that most closely matches the question. We believe
that this is due to the causal nature of WHY questions. The
“Whose/Whom” these questions usually ask about an
individual or an organization. The rule based QA systems
first establish parse notations and generate training cases and
test cases through the semantic model. This system consists
of some common modules like IR module and Answer
identifier or Ranker Module.
3. Application of QAS
Question answering has many applications. We can
subdividing these applications based upon the source of the
answers: structured data (databases), semi-structured data
(for example, comment fields in databases) or free text. We
can further distinguish among search over a fixed set of
collections, as used in TREC (particularly useful for
evaluation); search over the Web, search over a collection or
book, e.g. an encyclopaedia or search over a single text, as
done for reading comprehension evaluations[11]. Another
application is in education can also find uses for Question
Answering in fields where there are frequently asked
questions that people wants to search.
4. A Review on Methodology
First, we collect the articles from encyclopaedia. These
articles bank created for extraction of text. When the full
extraction system generated multiple outputs from an input
sentence and text, we randomly sampled one of them.
Figure 3: Flow of Question Answer
The run text parser is that in which we can take text as a
input and break it up into meaning components in some way.
After that using text parsing algorithm identify noun/verbs
combination to develop questions. The last but not least
regular expressions algorithm is used to specify the pattern
that provides concise and flexible means to match. Strings of
text such as, particular character, words or patterns of
character. Our main purpose of a Question Answers System
(QAS) is to find out who did what to whom, where, when,
how why and whose? And after that the answers are
extracted for the questions of types (What), (Where),
(When), (Who), (Why) & (Whose). At the end based on
these questions the performance is evaluated.
5. Conclusion
In this paper we discussed some of the approaches used in
the existing QA system and proposed a new architecture for
QA system retrieves the exact answer. It presents a method
checking that an answer is of the specific type expected by
the question. It can be used to improve question answering
system by checking all returned answers. However, it cannot
be used alone to select the good answer. Answering system
has become an important component of the online education
platform. The goal of a question answering system is to
retrieving answers to questions rather than full documents or
best matching passages, as most information retrieval
systems.
References
[1] Haque, Nafid. and Rosner, Mike. A prototype
framework for a Bangla question answering system
using translation based on transliteration and table look-
up as an interface for the medical domain. University of
Malta Gertjan Van Noord, University of Groningen
[2] Ashish Kumar Saxena, Ganesh Viswanath Sambhu, L.
Venkata Subramaniam*, Saroj Kaushik, ”IITD-IBMIRL
System for Question Answering using Pattern Matching,
Semantic Type and Semantic Category Recognition”
OCT 2007.
[3] Poonam Gupta & Vishal Gupta(2012)” A Survey of
Text Question Answering Techniques” International
Journal of Computer Applications (0975 – 8887)Vol No.
2012
156Paper ID: 05091302
International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064
Volume 2 Issue 9, September 2013
www.ijsr.net
[4] Arnaud Grappy, Brigitte Grau”Answer type validation
in question answering systems”Le centre de hautes
etudes internationals dtnnformatique documentaire
Paris, France, France ©2010
[5] S. M. Harabagiu, M. A. Pa_sca, and S. J. Maiorano.
Experiments with open-domain textual question
answering. In Proceedings of the 18th conference on
Computational linguistics, Morristown, NJ, USA, 2000.
Association for Computational Linguistics
[6] E. Hovy, L. Gerber, U. Hermjakob, C.-Y. Lin, and D.
Ravichandran. Toward semantics-based answer
pinpointing. In HLT '01: Proceedings of the _rst
international conference on Human language technology
research, Morristown, NJ, USA, 2001. Association for
Computational Linguistics
[7] Guda, Vanitha., Sanampudi, Suresh. Kumar. And
Manikyamba, I.Lalkshmi ,”Approaches For Question
Answering Systems” , Vanitha Guda et al. /
International Journal of Engineering Science and
Technology (IJEST) ISSN : 0975-5462 Vol. 3 No. 2011.
990-995
[8] Rodrigo, Alvaro. Perez-Iglesias, joaqum., Penas,
Anselmo., Garrido, Guillermo and Araujo,Lourdes. A
Question Answering System based on Information
Retrieval and Validation
[9] Quarteroni, S. and Manandhar S. “Designing an
Interactive Open-Domain Question Answering System”.
Journal of Natural Language Engineering 1. 1-23.
[10]Anette Frank, Hans-Ulrich Krieger, Feiyu Xu, Hans
Uszkoreit, Berthold Crysmann, Brigitte Jörg, Ulrich
Schäfe” Question answering from structured knowledge
sources” German Research Center for Artificial
Intelligence, DFKI, Stuhlsatzenhausweg 3, 66123
Saarbrücken, Germany, Journal of Applied Logic 5
(2007) 20–48
[11]Gupta, Vishal and Lehal, Gurpreet S. “A Survey of Text
Mining Techniques and Applications”. Journal of
Emerging Technologies in web Intelligence, VOL. 1,
No. 1.
Author Profile
Harpreet Kaur is currently perusing M. Tech in
Computer Science and Engineering from Swami
Vivekanand Institute of Engineering & Technology,
Banur, Punjab. She holds the degree of B. Tech in
Computer Science and Technology from Baba Banda
Singh Bahadur Engineering and Technology, Fathegarh sahib,
Punjab.
Er. Rimpi is currently working as Assistant Professor
in Computer Science and Engineering Department at
Swami Vivekanand Institute of Engineering and
Technology, Banur. She has completed her M. Tech in
Computer Engineering from Guru Nanak Dev University, Amritsar,
Punjab in 2011. She holds the degree of B. Tech in Computer
Science and Technology from Guru Nanak Dev University,
Amritsar, Punjab in 2009.
157Paper ID: 05091302

More Related Content

PDF
Architecture of an ontology based domain-specific natural language question a...
PDF
QUESTION ANSWERING SYSTEM USING ONTOLOGY IN MARATHI LANGUAGE
PDF
Paper id 28201441
PDF
Generation of Question and Answer from Unstructured Document using Gaussian M...
PPTX
PDF
A Survey on Sentiment Categorization of Movie Reviews
PDF
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
PDF
Implementation of Semantic Analysis Using Domain Ontology
Architecture of an ontology based domain-specific natural language question a...
QUESTION ANSWERING SYSTEM USING ONTOLOGY IN MARATHI LANGUAGE
Paper id 28201441
Generation of Question and Answer from Unstructured Document using Gaussian M...
A Survey on Sentiment Categorization of Movie Reviews
TEXT SENTIMENTS FOR FORUMS HOTSPOT DETECTION
Implementation of Semantic Analysis Using Domain Ontology

What's hot (17)

PPTX
Programmer information needs after memory failure
PDF
Modeling Text Independent Speaker Identification with Vector Quantization
PDF
A017640107
PDF
IRJET- An Automated Approach to Conduct Pune University’s In-Sem Examination
PDF
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
PDF
IRJET- Missing Value Evaluation in SQL Queries: A Survey
PDF
Ontology Based Approach for Semantic Information Retrieval System
PDF
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
PDF
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
PDF
Feature selection, optimization and clustering strategies of text documents
PDF
Advanced Question Paper Generator using Fuzzy Logic
PDF
IRJET- Text Document Clustering using K-Means Algorithm
PDF
Question Classification using Semantic, Syntactic and Lexical features
PDF
A4 elanjceziyan
PDF
NLP_Project_Paper_up276_vec241
PDF
Intelligent information extraction based on artificial neural network
PDF
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
Programmer information needs after memory failure
Modeling Text Independent Speaker Identification with Vector Quantization
A017640107
IRJET- An Automated Approach to Conduct Pune University’s In-Sem Examination
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm
IRJET- Missing Value Evaluation in SQL Queries: A Survey
Ontology Based Approach for Semantic Information Retrieval System
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
FAST FUZZY FEATURE CLUSTERING FOR TEXT CLASSIFICATION
Feature selection, optimization and clustering strategies of text documents
Advanced Question Paper Generator using Fuzzy Logic
IRJET- Text Document Clustering using K-Means Algorithm
Question Classification using Semantic, Syntactic and Lexical features
A4 elanjceziyan
NLP_Project_Paper_up276_vec241
Intelligent information extraction based on artificial neural network
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
Ad

Similar to A Review on Novel Scoring System for Identify Accurate Answers for Factoid Questions (20)

PDF
Open domain question answering system using semantic role labeling
PDF
A_Review_of_Question_Answering_Systems.pdf
PDF
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
PDF
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
PDF
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
PDF
QUESTION ANSWERING SYSTEMS: ANALYSIS AND SURVEY
PDF
Application of hidden markov model in question answering systems
PDF
Novel Scoring System for Identify Accurate Answers for Factoid Questions
PDF
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
DOCX
NLP Techniques for Question Answering.docx
PDF
Open domain Question Answering System - Research project in NLP
PDF
Development and evaluation of a web based question answering system for arabi...
PDF
Answer extraction and passage retrieval for
PDF
QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS
PDF
Response quality-evaluation-in-heterogeneous-question-answering-system-a-blac...
PPTX
Question Answering System using machine learning approach
PDF
Question Classification using Semantic, Syntactic and Lexical features
PDF
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
PDF
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
PDF
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
Open domain question answering system using semantic role labeling
A_Review_of_Question_Answering_Systems.pdf
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
A BRIEF SURVEY OF QUESTION ANSWERING SYSTEMS
QUESTION ANSWERING SYSTEMS: ANALYSIS AND SURVEY
Application of hidden markov model in question answering systems
Novel Scoring System for Identify Accurate Answers for Factoid Questions
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
NLP Techniques for Question Answering.docx
Open domain Question Answering System - Research project in NLP
Development and evaluation of a web based question answering system for arabi...
Answer extraction and passage retrieval for
QUESTION ANALYSIS FOR ARABIC QUESTION ANSWERING SYSTEMS
Response quality-evaluation-in-heterogeneous-question-answering-system-a-blac...
Question Answering System using machine learning approach
Question Classification using Semantic, Syntactic and Lexical features
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
Ad

More from International Journal of Science and Research (IJSR) (20)

PDF
Innovations in the Diagnosis and Treatment of Chronic Heart Failure
PDF
Design and implementation of carrier based sinusoidal pwm (bipolar) inverter
PDF
Polarization effect of antireflection coating for soi material system
PDF
Image resolution enhancement via multi surface fitting
PDF
Ad hoc networks technical issues on radio links security & qo s
PDF
Microstructure analysis of the carbon nano tubes aluminum composite with diff...
PDF
Improving the life of lm13 using stainless spray ii coating for engine applic...
PDF
An overview on development of aluminium metal matrix composites with hybrid r...
PDF
Pesticide mineralization in water using silver nanoparticles incorporated on ...
PDF
Comparative study on computers operated by eyes and brain
PDF
T s eliot and the concept of literary tradition and the importance of allusions
PDF
Effect of select yogasanas and pranayama practices on selected physiological ...
PDF
Grid computing for load balancing strategies
PDF
A new algorithm to improve the sharing of bandwidth
PDF
Main physical causes of climate change and global warming a general overview
PDF
Performance assessment of control loops
PDF
Capital market in bangladesh an overview
PDF
Faster and resourceful multi core web crawling
PDF
Extended fuzzy c means clustering algorithm in segmentation of noisy images
PDF
Parallel generators of pseudo random numbers with control of calculation errors
Innovations in the Diagnosis and Treatment of Chronic Heart Failure
Design and implementation of carrier based sinusoidal pwm (bipolar) inverter
Polarization effect of antireflection coating for soi material system
Image resolution enhancement via multi surface fitting
Ad hoc networks technical issues on radio links security & qo s
Microstructure analysis of the carbon nano tubes aluminum composite with diff...
Improving the life of lm13 using stainless spray ii coating for engine applic...
An overview on development of aluminium metal matrix composites with hybrid r...
Pesticide mineralization in water using silver nanoparticles incorporated on ...
Comparative study on computers operated by eyes and brain
T s eliot and the concept of literary tradition and the importance of allusions
Effect of select yogasanas and pranayama practices on selected physiological ...
Grid computing for load balancing strategies
A new algorithm to improve the sharing of bandwidth
Main physical causes of climate change and global warming a general overview
Performance assessment of control loops
Capital market in bangladesh an overview
Faster and resourceful multi core web crawling
Extended fuzzy c means clustering algorithm in segmentation of noisy images
Parallel generators of pseudo random numbers with control of calculation errors

Recently uploaded (20)

PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
01-Introduction-to-Information-Management.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Pharma ospi slides which help in ospi learning
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Institutional Correction lecture only . . .
PPTX
Cell Structure & Organelles in detailed.
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Pre independence Education in Inndia.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
01-Introduction-to-Information-Management.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
Microbial disease of the cardiovascular and lymphatic systems
Microbial diseases, their pathogenesis and prophylaxis
Pharma ospi slides which help in ospi learning
Abdominal Access Techniques with Prof. Dr. R K Mishra
O5-L3 Freight Transport Ops (International) V1.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
Renaissance Architecture: A Journey from Faith to Humanism
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Anesthesia in Laparoscopic Surgery in India
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Institutional Correction lecture only . . .
Cell Structure & Organelles in detailed.
Supply Chain Operations Speaking Notes -ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pre independence Education in Inndia.pdf

A Review on Novel Scoring System for Identify Accurate Answers for Factoid Questions

  • 1. International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064 Volume 2 Issue 9, September 2013 www.ijsr.net A Review on Novel Scoring System for Identify Accurate Answers for Factoid Questions 1 Harpreet Kaur, 2 Rimpi 1, 2 Swami Vivekanand Institute of Engineering and Technology Banur, Punjab, India Abstract: In this research work we have develop a new scoring mathematical model that works on the five types of questions. The question text failures are first extracted and a score is found based on its structure with respect to its template structure and then answer score is calculated again the question as well as paragraph. Text to finally reach at the index of the most probable answer with respect to question. Keywords: Natural language processing, Question answering System, Information retrieval. 1. Introduction NLP focuses on communications between computers and natural languages in terms of theoretical results and practical applications, and on information sharing now that information is exchange as it never has been before and sharing information becoming the leading theme in the domain of NLP systems. Question Answering (QA) systems go beyond the usual Information Retrieval (IR) systems which underlay popular Internet search engines. QA systems have the aim of responding to natural language questions whereas IR systems take up keywords from users and deploy some intelligent search mechanisms on a document collection to get back to the user with a ranked list of documents rather than an exact answer. There is a need for tools that would reduce the amount of text in order to obtain the desired information [1] [2] [3]. People have questions and they need answers, not documents and they always prefer to express the questions in their native language without being restricted to a particular query language, query formation rules, or even a particular knowledge domain. Automatic question answering system will help for the above technology. In this Question Answering System consists of three distinct phases: Question classification, information retrieval or document processing and answer extraction. The design of a standard QA system assumes that the language in which the question is asked and the text collection available to be processed are all in the same language. However, there might be a need for cross-lingual QA system which takes in questions in one language and searches through a document collection in a different language to get to the answer. English QA system research attempts to deal with a wide range of question types like WHEN, WHERE, WHAT, HOW, WHOM, WHY & WHOSE. Thus the aim of a QA system is to localize the exact answer to a question from a structured or a non- structured collection of texts. The QA system states that a QA task can be decomposed into three main sub problems. The sub problems are:- Question processing, information retrieval or document Processing and answer processing and. The question processing stage is responsible for taking a question in a natural language and producing some kind of intelligent representation of the raw question string so that it becomes more useful for finding answers. The document processing stage is used to reduce the search space of the document collection where the answer to the question can be expected. This stage is basically a complete Information Retrieval system where the idea is to take in some keywords and produce a ranked list of documents related to those keywords. The final stage of a QA system is the answer processing stage where the system does some intelligent matching with the output of the previous two stages to produce an answer to the given question. Any QA system should have these four basic components and may have a number of other components to make the system more useful and robust. 1.1 What is Question and Answer System? Questions answering (QA) systems look for the answer of a question in a large collection of documents. The question is in natural language. QA systems select text passages. Then, after that the answer is extracted from these passages, according to criteria issued from the question analysis. To facilitate the question generation task, we build text from the input complex text using a syntactic parser. We classify the text based on their subject, verb, object and preposition for determining the possible type of questions to be generated. The ability of QA systems to recognize a great amount of answer types is related to their powerfulness for extracting right answers [4] [5] [6]. 1.2 Why do we need QSA System? Question Answering (QA) Systems allow the user to ask questions in a natural language and obtain an exact answer. In this, we tried to learn the important issues in the field of Question Answering (QA) systems. We peeked into the internals of many established QA systems. we do not only consider simple questions but text problems consisting of several sentences. Our approach to translating the natural language question uses an underlying corpus and the knowledge base to derive meaningful and relevant patterns which can then be used to process the questions and capture their meaning with respect to the underlying knowledge base. 154Paper ID: 05091302
  • 2. International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064 Volume 2 Issue 9, September 2013 www.ijsr.net 2. Types of QA systems Different types of QA systems which are divided into two major groups based on the methods used by them. First group of QA system belongs to simple natural language processing and information retrieval methods, while another group of QA systems are dependent upon the reasoning with natural language. 2.1 Web Based Question Answering System This web bases QA system submits the question to the search engine like Google, Yahoo etc and grabs its top 100 search results..Given a user's natural language question, the system will submit the question to a search engine, then extract all possible answers from the search results according to the question type identified by the question classification module, finally select the most possible answers to return. The Web Based QA systems mostly handles wh-type of questions such as “Who was the first American in space?” Or “Which of the following is correct”. This QA system provides answers in various forms like text documents, Xml documents or Wikipedia. The common levels that are used by different web based Question Answering systems architectures are as follows [7]:  Question Classification:- In order to correctly answer a question, usually one needs to understand what type of information the question asks for, e.g., the sample question “Who was the first American in space?” asks for a person name. The question classification is made to provide better accuracy in the results.  Answer Extraction: - In this, extracts the correct possible answers for different classification of questions.  Answer Selection:- Among the possible answers obtained, ranking approaches are used to find out the best accurate answers based on its weight age factor. Answer classes generally is of factoid and non - factoid types. The factoid is getting short fact based answers like names, dates, and non-factoid is getting descriptions or definitions [9]. Given a user's natural language question, the system will submit the question to a search engine, then extract all possible answers from the search results according to the question type identified by the question classification module, finally select the most similar answers to return. The architecture of web based question answering system is shown in figure 2[8]. Figure 1: Architecture of Web based question answering system 2.2 IR / IE Based Question Answering Systems Question Answering, the process of extracting answers to natural language questions is profoundly different from Information Retrieval (IR) or Information Extraction (IE). IR systems allow us to locate relevant documents that relate to a query, but do not specify exactly where the answers are. In IR, the documents of interest are fetched by matching query keywords to the index of the document collection. IE systems need several resources like Named Entity Tagging (NE), Template Element (TE), Template relation (TR), Correlated Element (CE), and General Element (GE). IE systems architecture is build into distinct levels:  Level 1 NE tagger is use to handle named entity elements in the text (who, when, where, what etc..,).  Level 2 handles NE tagging + adj like (how far, how long, how often etc..,),  Level 3 builds the correlated entities by using the most important entity in the question and prepares General Element (GE) which consists of asking point of view. For Eg: “How did John pass the exam?” The ASKING POINT is clearly defined i.e. Person (Noun) if we by passing this question into the separate levels which was mentioned above.  The Architecture of IR/IE based question answering system is given in figure3 Figure 2: Architecture of IR/IE based question answering 2.3 Restricted Domain Question Answering systems QA systems for restricted domains may be designed to retrieve answers from so-called unstructured data (free texts), semi-structured data (such as XML-annotated texts), or structured data (databases). Question answering applied to restricted domains is interesting and challenging in two important respects. Question answering on restricted domains requiring the processing of complex questions and offering the opportunity to carry out complex analysis of the text sources and the questions [3] [10]. The main difference between open-domain question answering and restricted- domain question answering is the existence of domain- dependent information that can be used to improve the accuracy of the system. 2.4 Rule Based Question Answering Systems The rule-based system uses lexical and semantic heuristics to look for evidence that a sentence contains the answer to a question. Each type of WH question looks for different types of answers, so Quarc uses a separate set of rules for each question type (WHO, WHAT, WHEN, WHERE, WHY,WHOSE,WHOM). Given a question and a story, 155Paper ID: 05091302
  • 3. International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064 Volume 2 Issue 9, September 2013 www.ijsr.net Quarc parses the question and all of the sentences in the story using our partial parser Sundance. Much of the syntactic analysis is not used, but Quarc does use the morphological analysis, part-of- speech tagging, semantic class tagging, and entity recognition. The rules are applied to each sentence in the story, as well as the title of the story, with the exception that the title is not considered for WHY questions. “Who” rules looks for Names that are mostly Nouns of persons or things. The “What” questions were the most difficult to handle because they sought an amazing variety of answers and it consists of DATE expression or nouns. “When” questions almost always require a TIME expression, so sentences that do not contain a TIME expression are only considered in special cases. The “Where” questions almost always look for specific locations, so the WHERE rules are much focused. “Why” questions are handled differently than other questions. The WHY rules are based on the observation that the answer to a WHY question often appears immediately before or immediately after the sentence that most closely matches the question. We believe that this is due to the causal nature of WHY questions. The “Whose/Whom” these questions usually ask about an individual or an organization. The rule based QA systems first establish parse notations and generate training cases and test cases through the semantic model. This system consists of some common modules like IR module and Answer identifier or Ranker Module. 3. Application of QAS Question answering has many applications. We can subdividing these applications based upon the source of the answers: structured data (databases), semi-structured data (for example, comment fields in databases) or free text. We can further distinguish among search over a fixed set of collections, as used in TREC (particularly useful for evaluation); search over the Web, search over a collection or book, e.g. an encyclopaedia or search over a single text, as done for reading comprehension evaluations[11]. Another application is in education can also find uses for Question Answering in fields where there are frequently asked questions that people wants to search. 4. A Review on Methodology First, we collect the articles from encyclopaedia. These articles bank created for extraction of text. When the full extraction system generated multiple outputs from an input sentence and text, we randomly sampled one of them. Figure 3: Flow of Question Answer The run text parser is that in which we can take text as a input and break it up into meaning components in some way. After that using text parsing algorithm identify noun/verbs combination to develop questions. The last but not least regular expressions algorithm is used to specify the pattern that provides concise and flexible means to match. Strings of text such as, particular character, words or patterns of character. Our main purpose of a Question Answers System (QAS) is to find out who did what to whom, where, when, how why and whose? And after that the answers are extracted for the questions of types (What), (Where), (When), (Who), (Why) & (Whose). At the end based on these questions the performance is evaluated. 5. Conclusion In this paper we discussed some of the approaches used in the existing QA system and proposed a new architecture for QA system retrieves the exact answer. It presents a method checking that an answer is of the specific type expected by the question. It can be used to improve question answering system by checking all returned answers. However, it cannot be used alone to select the good answer. Answering system has become an important component of the online education platform. The goal of a question answering system is to retrieving answers to questions rather than full documents or best matching passages, as most information retrieval systems. References [1] Haque, Nafid. and Rosner, Mike. A prototype framework for a Bangla question answering system using translation based on transliteration and table look- up as an interface for the medical domain. University of Malta Gertjan Van Noord, University of Groningen [2] Ashish Kumar Saxena, Ganesh Viswanath Sambhu, L. Venkata Subramaniam*, Saroj Kaushik, ”IITD-IBMIRL System for Question Answering using Pattern Matching, Semantic Type and Semantic Category Recognition” OCT 2007. [3] Poonam Gupta & Vishal Gupta(2012)” A Survey of Text Question Answering Techniques” International Journal of Computer Applications (0975 – 8887)Vol No. 2012 156Paper ID: 05091302
  • 4. International Journal of Science and Research (IJSR), India Online ISSN: 2319-7064 Volume 2 Issue 9, September 2013 www.ijsr.net [4] Arnaud Grappy, Brigitte Grau”Answer type validation in question answering systems”Le centre de hautes etudes internationals dtnnformatique documentaire Paris, France, France ©2010 [5] S. M. Harabagiu, M. A. Pa_sca, and S. J. Maiorano. Experiments with open-domain textual question answering. In Proceedings of the 18th conference on Computational linguistics, Morristown, NJ, USA, 2000. Association for Computational Linguistics [6] E. Hovy, L. Gerber, U. Hermjakob, C.-Y. Lin, and D. Ravichandran. Toward semantics-based answer pinpointing. In HLT '01: Proceedings of the _rst international conference on Human language technology research, Morristown, NJ, USA, 2001. Association for Computational Linguistics [7] Guda, Vanitha., Sanampudi, Suresh. Kumar. And Manikyamba, I.Lalkshmi ,”Approaches For Question Answering Systems” , Vanitha Guda et al. / International Journal of Engineering Science and Technology (IJEST) ISSN : 0975-5462 Vol. 3 No. 2011. 990-995 [8] Rodrigo, Alvaro. Perez-Iglesias, joaqum., Penas, Anselmo., Garrido, Guillermo and Araujo,Lourdes. A Question Answering System based on Information Retrieval and Validation [9] Quarteroni, S. and Manandhar S. “Designing an Interactive Open-Domain Question Answering System”. Journal of Natural Language Engineering 1. 1-23. [10]Anette Frank, Hans-Ulrich Krieger, Feiyu Xu, Hans Uszkoreit, Berthold Crysmann, Brigitte Jörg, Ulrich Schäfe” Question answering from structured knowledge sources” German Research Center for Artificial Intelligence, DFKI, Stuhlsatzenhausweg 3, 66123 Saarbrücken, Germany, Journal of Applied Logic 5 (2007) 20–48 [11]Gupta, Vishal and Lehal, Gurpreet S. “A Survey of Text Mining Techniques and Applications”. Journal of Emerging Technologies in web Intelligence, VOL. 1, No. 1. Author Profile Harpreet Kaur is currently perusing M. Tech in Computer Science and Engineering from Swami Vivekanand Institute of Engineering & Technology, Banur, Punjab. She holds the degree of B. Tech in Computer Science and Technology from Baba Banda Singh Bahadur Engineering and Technology, Fathegarh sahib, Punjab. Er. Rimpi is currently working as Assistant Professor in Computer Science and Engineering Department at Swami Vivekanand Institute of Engineering and Technology, Banur. She has completed her M. Tech in Computer Engineering from Guru Nanak Dev University, Amritsar, Punjab in 2011. She holds the degree of B. Tech in Computer Science and Technology from Guru Nanak Dev University, Amritsar, Punjab in 2009. 157Paper ID: 05091302