“April 2020: Top Read
Articles in Control Theory
and Computer Modelling”
International Journal of Control Theory and
Computer Modelling (IJCTCM)
ISSN : 2249-1155 [Online]; 2319 - 4138 [print].
http://airccse.org/journal/ijctcm/ijctcm.html
AUTOMATIC CLASSIFICATION OF BENGALI
SENTENCES BASED ON SENSE DEFINITIONS
PRESENT IN BENGALI WORDNET
Alok Ranjan Pal1
, Diganta Saha2
and Niladri Sekhar Dash3
1
Dept. of Computer Science and Eng., College of Engineering and Management,
Kolaghat
2
Dept. of Computer Science and Eng., Jadavpur University, Kolkata
3
Linguistic Research Unit, Indian Statistical Institute, Kolkata
ABSTRACT
Based on the sense definition of words available in the Bengali WordNet, an attempt is made to
classify the Bengali sentences automatically into different groups in accordance with their
underlying senses. The input sentences are collected from 50 different categories of the Bengali text
corpus developed in the TDIL project of the Govt. of India, while information about the different
senses of particular ambiguous lexical item is collected from Bengali WordNet. In an experimental
basis we have used Naive Bayes probabilistic model as a useful classifier of sentences. We have
applied the algorithm over 1747 sentences that contain a particular Bengali lexical item which,
because of its ambiguous nature, is able to trigger different senses that render sentences in different
meanings. In our experiment we have achieved around 84% accurate result on the sense
classification over the total input sentences. We have analyzed those residual sentences that did not
comply with our experiment and did affect the results to note that in many cases, wrong syntactic
structures and less semantic information are the main hurdles in semantic classification of sentences.
The applicational relevance of this study is attested in automatic text classification, machine
learning, information extraction, and word sense disambiguation
KEYWORDS
Natural Language Processing, Bengali Word Sense Disambiguation, Bengali WordNet, Naïve Bayes
Classification.
For More Details: http://airccse.org/journal/ijctcm/papers/5115ijctcm01.pdf
Volume Link: http://airccse.org/journal/ijctcm/vol5.html
REFERENCES
[1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the Art”, Computational
Linguistics, Vol. 24, No. 1, Pp. 1-40.
[2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense
disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University Press,
Pp. 327- 341.
[3] Nameh, M., S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense
Disambiguation Based on Context Similarity”, Proceedings of the World Congress on
Engineering, Vol. I.
[4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining pseudo
training data”, Proceedings of The International Conference on Natural Language Processing
and Knowledge Engineering, Pp. 138-143.
[5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol. 41,
No.2, ACM Press, Pp. 1-69.
[6] Gaizauskas, R., (1997) “Gold Standard Datasets for Evaluating Word Sense Disambiguation
Programs”, Computer Speech and Language, Vol. 12, No. 3, Special Issue on Evaluation of
Speech and Language Technology, Pp. 453-472.
[7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense
disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3, Pp.
253- 273.
[8] G. Miller, (1991) “WordNet: An on-line lexical database”, International Journal of Lexicography,
Vol.3, No. 4.
[9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”, First
International Conference on Digital Object Identifier, Pp. 1187-1191.
[10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word
Senses for Text Classification”, Proceedings of the 7th International Conference on
Computational Science, Springer-Verlag, Pp. 781 - 789.
[11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on-line
Lexical Database”, International Journal of Lexicography, 3(4): 235-244.
[12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39-
41.
[13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using
WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String
Processing and Information Retrieval, Pp. 350-359.
[14] http://en.wikipedia.org/wiki/Naive_bayes
[15] Dash, N.S., (2007) Indian scenario in language corpus generation. In, Dash, Ni.S., P. Dasgupta
and P. Sarkar (Eds.) Rainbow of Linguistics: Vol. I. Kolkata: T. Media Publication. Pp. 129-
162.
[16] Dash, N.S., (1999) “Corpus oriented Bangla language processing”, Jadavpur Journal of
Philosophy. 11(1): 1-28.
[17] Dash, N.S., (2000) “Bangla pronouns-a corpus based study”, Literary and Linguistic
Computing. 15(4): 433-444.
[18] Dash, N.S., (2004) Language Corpora: Present Indian Need, Indian Statistical Institute, Kolkata,
http://www. elda. org/en/proj/scalla/SCALLA2004/dash. pdf.
[19] Dash, N.S. (2005) Methods in Madness of Bengali Spelling: A Corpus-based Investigation,
South Asian Language Rewiew, Vol. XV, No. 2.
[20] Dash, N.S., (2012), From KCIE to LDC-IL: Some Milestones in NLP Journey in Indian
Multilingual Panorama. Indian Linguistics. 73(1-4): 129-146.
[21] Dash, N.S. and B.B. Chaudhuri, (2001) “A corpus based study of the Bangla language”, Indian
Journal of Linguistics. 20: 19-40.
[22] Dash, N.S. and B.B. Chaudhuri, (2001) “Corpus-based empirical analysis of form, function and
frequency of characters used in Bangla”, Published in Rayson, P. , Wilson, A. , McEnery, T. ,
Hardie, A. , and Khoja, S. , (eds.) Special issue of the Proceedings of the Corpus Linguistics
2001 Conference, Lancaster: Lancaster University Press. UK. 13: 144-157. 2001.
[23] Dash, N.S. and B.B. Chaudhuri., (2002) Corpus generation and text processing, International
Journal of Dravidian Linguistics. 31(1): 25-44.
[24] Dash, N.S. and B.B. Chaudhuri, (2002) “Using Text Corpora for Understanding Polysemy in
Bangla”, Proceedings of the Language Engineering Conference (LEC'02) IEEE.
[25] Dolamic, L. and J. Savoy, (2010) “Comparative Study of Indexing and Search Strategies for the
Hindi, Marathi and Bengali Languages”, ACM Transactions on Asian Language Information
Processing, 9(3): 1-24.
[26] Jurafsky, D. and J.H. Martin, (2000) Speech and Language Processing, ISBN 81-7808-594-1,
Pearson Education Asia, page no: 604.
[27] http://www.isical.ac.in/~lru/wordnetnew/
WORD SENSE DISAMBIGUATION: A SURVEY
Alok Ranjan Pal1
and Diganta Saha2
1
Dept. of Computer Science and Engg., College of Engg. and Mgmt, Kolaghat
2
Dept. of Computer Science and Engg., Jadavpur University, Kolkata
ABSTRACT
In this paper, we made a survey on Word Sense Disambiguation (WSD). Near about in all major
languages around the world, research in WSD has been conducted upto different extents. In this
paper, we have gone through a survey regarding the different approaches adopted in different
research works, the State of the Art in the performance in this domain, recent works in different
Indian languages and finally a survey in Bengali language. We have made a survey on different
competitions in this field and the bench mark results, obtained from those competitions.
Keywords:
Natural Languages Processing, Word Sense Disambiguation
For More Details: http://airccse.org/journal/ijctcm/papers/5315ijctcm01.pdf
Volume Link: http://airccse.org/journal/ijctcm/vol5.html
REFERENCES
[1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the
Art”,Computational Linguistics, Vol. 24, No. 1, Pp. 1-40.
[2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense
disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University
Press,Pp. 327- 341.
[3] Nameh, M. S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense
Disambiguation Based on Context Similarity”, Proceedings of the World Congress on
Engineering,Vol. I.
[4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining
pseudo training data”,Proceedings of The International Conference on Natural Language
Processing and Knowledge Engineering, Pp. 138-143.
[5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol.
41,No.2, ACM Press, Pp. 1-69.
[6] Word Sense Disambiguation; Algorithms and Applications, Edited by Eneko Agirre and
Philip Edmonds, Springer, VOLUME 33.
[7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense
disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3,
Pp. 253-273.
[8] Miller, G., (1991) “WordNet: An on-line lexical database”, International Journal of
Lexicography,Vol.3,No. 4.
[9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”,
First International Conference on Digital Object Identifier, Pp. 1187-1191.
[10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word
Senses for Text Classification”, Proceedings of the 7th International Conference on
Computational Science, Springer-Verlag, Pp. 781 - 789.
[11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on-
line Lexical Database”, International Journal of Lexicography, 3(4): 235-244.
[12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39-
41.
[13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using
WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String
Processing and Information Retrieval, Pp. 350-359.
[14] Marine, C., Dekai, W.U.,(2005) “Word Sense Disambiguation vs. Statistical Machine
Translation”, Proceedings of the 43rd Annual Meeting of the ACL , Ann Arbor, June 2005,
pages 387–394.
[15] http://www.ling.gu.se/~sl/Undervisning/StatMet11/wsd-mt.pdf date: 14/05/2015
[16] http://nlp.cs.nyu.edu/sk-symposium/note/P-28.pdf date: 14/05/2015
[17] Yee, S. C., Hwee, T. N., David, C., (2007) “Word Sense Disambiguation Improves Statistical
Machine Translation”, Proceedings of the 45th Annual Meeting of the Association of
Computational Linguistics, pages 33–40, Prague, Czech Republic, June 2007.
[18] Sanderson, M.,(1994) “Word Sense Disambiguation and Information Retrieval”, Proceedings
of the 17th Annual International ACM SIGIR conference on Research and Development in
Information Retrieval, SIGIR’94, July 03-06, Dublin, Ireland, Springer, New York, pp 142-
151.
[19] Christopher, S., Michael, P. O., John, T.,(2003) “Word Sense Disambiguation in Information
Retrieval Revisited”, SIGIR’03, July 28–August 1, 2003, Toronto, Canada.
[20] http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.65.6828&rep=rep1&type=pdf
date:4/05/2015
[21] http://www.aclweb.org/anthology/P12-1029date: 14/05/2015
[22] https://www.comp.nus.edu.sg/~nght/pubs/esair11.pdf date: 14/05/2015
[23] http://cui.unige.ch/isi/reports/2008/CLEF2008-LNCS.pdf date: 14/05/2015
[24] Banerjee, S., Pedersen, T.,(2002) "An adapted Lesk algorithm for word sense disambiguation
using WordNet", In Proceedings of the Third International Conference on Intelligent Text
Processing and Computational Linguistics, Mexico City, February.
[25] Lesk, M.,(1986) "Automatic Sense Disambiguation Using Machine Readable Dictionaries:
How to Tell a Pine Cone from an Ice Cream Cone", Proceedings of SIGDOC.
[26] http://www.dlsi.ua.es/projectes/srim/publicaciones/CICling-2002.pdf date: 14/05/2015
[27] Mittal, K. and Jain, A.,(2015)“WORD SENSE DISAMBIGUATION METHOD USING
SEMANTIC SIMILARITY MEASURES AND OWA OPERATOR”, ICTACT JOURNAL
ON SOFT COMPUTING: SPECIAL ISSUE ON SOFT –COMPUTING THEORY,
APPLICATION AND IMPLICATIONS IN ENGINEERING AND TECHNOLOGY,
JANUARY, 2015, VOLUME: 05, ISSUE: 02.
[28] http://www.d.umn.edu/~tpederse/Pubs/cicling2003-3.pdf%date: 14/05/2015
[29] http://www.aclweb.org/anthology/U04-1021date: 14/05/2015
[30] http://www.aclweb.org/anthology/C10-2142date: 14/05/2015
[31] Diana, M.C., Carroll, J., “Disambiguating Nouns, Verbs, and Adjectives Using Automatically
Acquired Selectional Preferences”, Computational Linguistics, Volume 29, Number 4, pp.
639-654.
[32] Patrick, Y. and Timothy, B.,(2006) “Verb Sense Disambiguation Using Selectional
Preferences Extracted with a State-of-the-art Semantic Role Labeler”, Proceedings of the
2006 Australasian Language Technology Workshop (ALTW2006), pages 139–148.
[33] http://link.springer.com/article/10.1023/A%3A1002674829964#page-1 date: 14/05/2015

More Related Content

DOCX
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
PDF
11.development of a feature extraction technique for online character recogni...
PDF
Development of a feature extraction technique for online character recognitio...
PDF
Trends of machine learning in 2020 - International Journal of Artificial Inte...
PDF
Top 10 Natural Language Processing Trends in 2020 - International Journal on ...
PDF
11.development of a writer independent online handwritten character recogniti...
DOCX
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
11.development of a feature extraction technique for online character recogni...
Development of a feature extraction technique for online character recognitio...
Trends of machine learning in 2020 - International Journal of Artificial Inte...
Top 10 Natural Language Processing Trends in 2020 - International Journal on ...
11.development of a writer independent online handwritten character recogniti...

What's hot (6)

PDF
김문형-Curriculum Vitae
DOCX
Dr Khdour CV
PDF
Curriculum Vitae
PDF
Ijetcas14 371
PDF
The International Journal of Network Security & Its Applications (IJNSA) -- ...
DOCX
SATANJEEV BANERJEE
김문형-Curriculum Vitae
Dr Khdour CV
Curriculum Vitae
Ijetcas14 371
The International Journal of Network Security & Its Applications (IJNSA) -- ...
SATANJEEV BANERJEE
Ad

Similar to April 2020 most read artilce in contro theory & computer controlling (20)

PDF
Automatic classification of bengali sentences based on sense definitions pres...
PDF
2-IJCSE-00536
PDF
2-IJCSE-00536
PDF
Word sense disambiguation a survey
PDF
Ny3424442448
PDF
228-SE3001_2
PDF
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
PDF
Word sense disambiguation a survey
PPT
Advances in word sense disambiguation
PDF
A Survey on Word Sense Disambiguation
PPTX
Module 4.1 of chennai's slides wo hanve dot do thhopps otps
PDF
International Journal of Engineering and Science Invention (IJESI)
PDF
IJNLC 2013 - Ambiguity-Aware Document Similarity
PDF
AMBIGUITY-AWARE DOCUMENT SIMILARITY
PDF
A supervised word sense disambiguation method using ontology and context know...
PDF
English to punjabi machine translation system using hybrid approach of word s
PDF
Malayalam Word Sense Disambiguation using Machine Learning Approach
DOC
Doc format.
PPTX
Word Sense Disambiguation - Algorithms for WSD.pptx
PDF
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
Automatic classification of bengali sentences based on sense definitions pres...
2-IJCSE-00536
2-IJCSE-00536
Word sense disambiguation a survey
Ny3424442448
228-SE3001_2
AN EMPIRICAL STUDY OF WORD SENSE DISAMBIGUATION
Word sense disambiguation a survey
Advances in word sense disambiguation
A Survey on Word Sense Disambiguation
Module 4.1 of chennai's slides wo hanve dot do thhopps otps
International Journal of Engineering and Science Invention (IJESI)
IJNLC 2013 - Ambiguity-Aware Document Similarity
AMBIGUITY-AWARE DOCUMENT SIMILARITY
A supervised word sense disambiguation method using ontology and context know...
English to punjabi machine translation system using hybrid approach of word s
Malayalam Word Sense Disambiguation using Machine Learning Approach
Doc format.
Word Sense Disambiguation - Algorithms for WSD.pptx
A N H YBRID A PPROACH TO W ORD S ENSE D ISAMBIGUATION W ITH A ND W ITH...
Ad

Recently uploaded (20)

PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
My India Quiz Book_20210205121199924.pdf
PDF
advance database management system book.pdf
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
What if we spent less time fighting change, and more time building what’s rig...
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
Trump Administration's workforce development strategy
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
20th Century Theater, Methods, History.pptx
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PDF
HVAC Specification 2024 according to central public works department
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Chinmaya Tiranga quiz Grand Finale.pdf
Practical Manual AGRO-233 Principles and Practices of Natural Farming
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
History, Philosophy and sociology of education (1).pptx
My India Quiz Book_20210205121199924.pdf
advance database management system book.pdf
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
AI-driven educational solutions for real-life interventions in the Philippine...
What if we spent less time fighting change, and more time building what’s rig...
B.Sc. DS Unit 2 Software Engineering.pptx
TNA_Presentation-1-Final(SAVE)) (1).pptx
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Unit 4 Computer Architecture Multicore Processor.pptx
Trump Administration's workforce development strategy
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
20th Century Theater, Methods, History.pptx
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
HVAC Specification 2024 according to central public works department
202450812 BayCHI UCSC-SV 20250812 v17.pptx

April 2020 most read artilce in contro theory & computer controlling

  • 1. “April 2020: Top Read Articles in Control Theory and Computer Modelling” International Journal of Control Theory and Computer Modelling (IJCTCM) ISSN : 2249-1155 [Online]; 2319 - 4138 [print]. http://airccse.org/journal/ijctcm/ijctcm.html
  • 2. AUTOMATIC CLASSIFICATION OF BENGALI SENTENCES BASED ON SENSE DEFINITIONS PRESENT IN BENGALI WORDNET Alok Ranjan Pal1 , Diganta Saha2 and Niladri Sekhar Dash3 1 Dept. of Computer Science and Eng., College of Engineering and Management, Kolaghat 2 Dept. of Computer Science and Eng., Jadavpur University, Kolkata 3 Linguistic Research Unit, Indian Statistical Institute, Kolkata ABSTRACT Based on the sense definition of words available in the Bengali WordNet, an attempt is made to classify the Bengali sentences automatically into different groups in accordance with their underlying senses. The input sentences are collected from 50 different categories of the Bengali text corpus developed in the TDIL project of the Govt. of India, while information about the different senses of particular ambiguous lexical item is collected from Bengali WordNet. In an experimental basis we have used Naive Bayes probabilistic model as a useful classifier of sentences. We have applied the algorithm over 1747 sentences that contain a particular Bengali lexical item which, because of its ambiguous nature, is able to trigger different senses that render sentences in different meanings. In our experiment we have achieved around 84% accurate result on the sense classification over the total input sentences. We have analyzed those residual sentences that did not comply with our experiment and did affect the results to note that in many cases, wrong syntactic structures and less semantic information are the main hurdles in semantic classification of sentences. The applicational relevance of this study is attested in automatic text classification, machine learning, information extraction, and word sense disambiguation KEYWORDS Natural Language Processing, Bengali Word Sense Disambiguation, Bengali WordNet, Naïve Bayes Classification. For More Details: http://airccse.org/journal/ijctcm/papers/5115ijctcm01.pdf Volume Link: http://airccse.org/journal/ijctcm/vol5.html
  • 3. REFERENCES [1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the Art”, Computational Linguistics, Vol. 24, No. 1, Pp. 1-40. [2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University Press, Pp. 327- 341. [3] Nameh, M., S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense Disambiguation Based on Context Similarity”, Proceedings of the World Congress on Engineering, Vol. I. [4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining pseudo training data”, Proceedings of The International Conference on Natural Language Processing and Knowledge Engineering, Pp. 138-143. [5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol. 41, No.2, ACM Press, Pp. 1-69. [6] Gaizauskas, R., (1997) “Gold Standard Datasets for Evaluating Word Sense Disambiguation Programs”, Computer Speech and Language, Vol. 12, No. 3, Special Issue on Evaluation of Speech and Language Technology, Pp. 453-472. [7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3, Pp. 253- 273. [8] G. Miller, (1991) “WordNet: An on-line lexical database”, International Journal of Lexicography, Vol.3, No. 4. [9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”, First International Conference on Digital Object Identifier, Pp. 1187-1191. [10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word Senses for Text Classification”, Proceedings of the 7th International Conference on Computational Science, Springer-Verlag, Pp. 781 - 789. [11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on-line Lexical Database”, International Journal of Lexicography, 3(4): 235-244. [12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39- 41. [13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String Processing and Information Retrieval, Pp. 350-359. [14] http://en.wikipedia.org/wiki/Naive_bayes
  • 4. [15] Dash, N.S., (2007) Indian scenario in language corpus generation. In, Dash, Ni.S., P. Dasgupta and P. Sarkar (Eds.) Rainbow of Linguistics: Vol. I. Kolkata: T. Media Publication. Pp. 129- 162. [16] Dash, N.S., (1999) “Corpus oriented Bangla language processing”, Jadavpur Journal of Philosophy. 11(1): 1-28. [17] Dash, N.S., (2000) “Bangla pronouns-a corpus based study”, Literary and Linguistic Computing. 15(4): 433-444. [18] Dash, N.S., (2004) Language Corpora: Present Indian Need, Indian Statistical Institute, Kolkata, http://www. elda. org/en/proj/scalla/SCALLA2004/dash. pdf. [19] Dash, N.S. (2005) Methods in Madness of Bengali Spelling: A Corpus-based Investigation, South Asian Language Rewiew, Vol. XV, No. 2. [20] Dash, N.S., (2012), From KCIE to LDC-IL: Some Milestones in NLP Journey in Indian Multilingual Panorama. Indian Linguistics. 73(1-4): 129-146. [21] Dash, N.S. and B.B. Chaudhuri, (2001) “A corpus based study of the Bangla language”, Indian Journal of Linguistics. 20: 19-40. [22] Dash, N.S. and B.B. Chaudhuri, (2001) “Corpus-based empirical analysis of form, function and frequency of characters used in Bangla”, Published in Rayson, P. , Wilson, A. , McEnery, T. , Hardie, A. , and Khoja, S. , (eds.) Special issue of the Proceedings of the Corpus Linguistics 2001 Conference, Lancaster: Lancaster University Press. UK. 13: 144-157. 2001. [23] Dash, N.S. and B.B. Chaudhuri., (2002) Corpus generation and text processing, International Journal of Dravidian Linguistics. 31(1): 25-44. [24] Dash, N.S. and B.B. Chaudhuri, (2002) “Using Text Corpora for Understanding Polysemy in Bangla”, Proceedings of the Language Engineering Conference (LEC'02) IEEE. [25] Dolamic, L. and J. Savoy, (2010) “Comparative Study of Indexing and Search Strategies for the Hindi, Marathi and Bengali Languages”, ACM Transactions on Asian Language Information Processing, 9(3): 1-24. [26] Jurafsky, D. and J.H. Martin, (2000) Speech and Language Processing, ISBN 81-7808-594-1, Pearson Education Asia, page no: 604. [27] http://www.isical.ac.in/~lru/wordnetnew/
  • 5. WORD SENSE DISAMBIGUATION: A SURVEY Alok Ranjan Pal1 and Diganta Saha2 1 Dept. of Computer Science and Engg., College of Engg. and Mgmt, Kolaghat 2 Dept. of Computer Science and Engg., Jadavpur University, Kolkata ABSTRACT In this paper, we made a survey on Word Sense Disambiguation (WSD). Near about in all major languages around the world, research in WSD has been conducted upto different extents. In this paper, we have gone through a survey regarding the different approaches adopted in different research works, the State of the Art in the performance in this domain, recent works in different Indian languages and finally a survey in Bengali language. We have made a survey on different competitions in this field and the bench mark results, obtained from those competitions. Keywords: Natural Languages Processing, Word Sense Disambiguation For More Details: http://airccse.org/journal/ijctcm/papers/5315ijctcm01.pdf Volume Link: http://airccse.org/journal/ijctcm/vol5.html
  • 6. REFERENCES [1] Ide, N., Véronis, J., (1998) “Word Sense Disambiguation: The State of the Art”,Computational Linguistics, Vol. 24, No. 1, Pp. 1-40. [2] Cucerzan, R.S., C. Schafer, and D. Yarowsky, (2002) “Combining classifiers for word sense disambiguation”, Natural Language Engineering, Vol. 8, No. 4, Cambridge University Press,Pp. 327- 341. [3] Nameh, M. S., Fakhrahmad, M., Jahromi, M.Z., (2011) “A New Approach to Word Sense Disambiguation Based on Context Similarity”, Proceedings of the World Congress on Engineering,Vol. I. [4] Xiaojie, W., Matsumoto, Y., (2003) “Chinese word sense disambiguation by combining pseudo training data”,Proceedings of The International Conference on Natural Language Processing and Knowledge Engineering, Pp. 138-143. [5] Navigli, R. (2009) “Word Sense Disambiguation: a Survey”, ACM Computing Surveys, Vol. 41,No.2, ACM Press, Pp. 1-69. [6] Word Sense Disambiguation; Algorithms and Applications, Edited by Eneko Agirre and Philip Edmonds, Springer, VOLUME 33. [7] Seo, H., Chung, H., Rim, H., Myaeng, S. H., Kim, S., (2004) “Unsupervised word sense disambiguation using WordNet relatives”, Computer Speech and Language, Vol. 18, No. 3, Pp. 253-273. [8] Miller, G., (1991) “WordNet: An on-line lexical database”, International Journal of Lexicography,Vol.3,No. 4. [9] Kolte, S.G., Bhirud, S.G., (2008) “Word Sense Disambiguation Using WordNet Domains”, First International Conference on Digital Object Identifier, Pp. 1187-1191. [10] Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007) “Using WordNet to Disambiguate Word Senses for Text Classification”, Proceedings of the 7th International Conference on Computational Science, Springer-Verlag, Pp. 781 - 789. [11] Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J., (1990) “WordNet An on- line Lexical Database”, International Journal of Lexicography, 3(4): 235-244. [12] Miller, G.A., (1993) “WordNet: A Lexical Database”, Comm. ACM, Vol. 38, No. 11, Pp. 39- 41. [13] Cañas, A.J., A. Valerio, J. Lalinde-Pulido, M. Carvalho, and M. Arguedas, (2003) “Using WordNet for Word Sense Disambiguation to Support Concept Map Construction”, String Processing and Information Retrieval, Pp. 350-359.
  • 7. [14] Marine, C., Dekai, W.U.,(2005) “Word Sense Disambiguation vs. Statistical Machine Translation”, Proceedings of the 43rd Annual Meeting of the ACL , Ann Arbor, June 2005, pages 387–394. [15] http://www.ling.gu.se/~sl/Undervisning/StatMet11/wsd-mt.pdf date: 14/05/2015 [16] http://nlp.cs.nyu.edu/sk-symposium/note/P-28.pdf date: 14/05/2015 [17] Yee, S. C., Hwee, T. N., David, C., (2007) “Word Sense Disambiguation Improves Statistical Machine Translation”, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 33–40, Prague, Czech Republic, June 2007. [18] Sanderson, M.,(1994) “Word Sense Disambiguation and Information Retrieval”, Proceedings of the 17th Annual International ACM SIGIR conference on Research and Development in Information Retrieval, SIGIR’94, July 03-06, Dublin, Ireland, Springer, New York, pp 142- 151. [19] Christopher, S., Michael, P. O., John, T.,(2003) “Word Sense Disambiguation in Information Retrieval Revisited”, SIGIR’03, July 28–August 1, 2003, Toronto, Canada. [20] http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.65.6828&rep=rep1&type=pdf date:4/05/2015 [21] http://www.aclweb.org/anthology/P12-1029date: 14/05/2015 [22] https://www.comp.nus.edu.sg/~nght/pubs/esair11.pdf date: 14/05/2015 [23] http://cui.unige.ch/isi/reports/2008/CLEF2008-LNCS.pdf date: 14/05/2015 [24] Banerjee, S., Pedersen, T.,(2002) "An adapted Lesk algorithm for word sense disambiguation using WordNet", In Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, February. [25] Lesk, M.,(1986) "Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone", Proceedings of SIGDOC. [26] http://www.dlsi.ua.es/projectes/srim/publicaciones/CICling-2002.pdf date: 14/05/2015 [27] Mittal, K. and Jain, A.,(2015)“WORD SENSE DISAMBIGUATION METHOD USING SEMANTIC SIMILARITY MEASURES AND OWA OPERATOR”, ICTACT JOURNAL ON SOFT COMPUTING: SPECIAL ISSUE ON SOFT –COMPUTING THEORY, APPLICATION AND IMPLICATIONS IN ENGINEERING AND TECHNOLOGY, JANUARY, 2015, VOLUME: 05, ISSUE: 02. [28] http://www.d.umn.edu/~tpederse/Pubs/cicling2003-3.pdf%date: 14/05/2015 [29] http://www.aclweb.org/anthology/U04-1021date: 14/05/2015 [30] http://www.aclweb.org/anthology/C10-2142date: 14/05/2015
  • 8. [31] Diana, M.C., Carroll, J., “Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences”, Computational Linguistics, Volume 29, Number 4, pp. 639-654. [32] Patrick, Y. and Timothy, B.,(2006) “Verb Sense Disambiguation Using Selectional Preferences Extracted with a State-of-the-art Semantic Role Labeler”, Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pages 139–148. [33] http://link.springer.com/article/10.1023/A%3A1002674829964#page-1 date: 14/05/2015