Relieving Polysemy Problem for Synonymy Detection

Dias, Gaël; Moraliyski, Rumen

doi:10.1007/978-3-642-04686-5_50

Gaël Dias²³ &
Rumen Moraliyski²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5816))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

1405 Accesses

Abstract

In order to automatically identify noun synonyms, we propose a new idea which opposes classical polysemous representations of words to monosemous representations based on the “one sense per discourse” hypothesis. For that purpose, we apply the attributional similarity paradigm on two levels: corpus and document. We evaluate our methodology on well-known standard multiple choice synonymy question tests and evidence that it steadily outperforms the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Expert Assessment of Synonymic Rows in RuWordNet

A cascaded framework for identification and extraction of antonym for Turkish language

Article 01 August 2018

A Study on Chinese Synonyms: From the Perspective of Collocations

References

Landauer, T., Dumais, S.: A solution to plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review 104(2), 211–240 (1997)
Article Google Scholar
Freitag, D., Blume, M., Byrnes, J., Chow, E., Kapadia, S., Rohwer, R., Wang, Z.: New experiments in distributional representations of synonymy. In: Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL), Ann Arbor, Michigan, pp. 25–32 (2005)
Google Scholar
Miller, G.A., Chodorow, M., Landes, S., Leacock, C., Thomas, R.G.: Using a semantic concordance for sense identification. In: HLT 1994: Proceedings of the workshop on Human Language Technology, Morristown, NJ, USA, pp. 240–243. Association for Computational Linguistics (1994)
Google Scholar
Gale, W., Church, K.W., Yarowsky, D.: One sense per discourse. In: HLT 1991: Proceedings of the workshop on Speech and Natural Language, Morristown, NJ, USA, pp. 233–237 (1992)
Google Scholar
Moraliyski, R., Dias, G.: One sense per discourse for synonymy extraction (2006)
Google Scholar
Terra, E., Clarke, C.: Frequency estimates for statistical word similarity measures. In: Proceedings of HTL/NAACL 2003, Edmonton, Canada, pp. 165–172 (2003)
Google Scholar
Weeds, J., Weir, D., McCarthy, D.: Characterising measures of lexical distributional similarity. In: Proceedings of COLING 2004, Geneva, Switzerland (2004)
Google Scholar
Ehlert, B.: Making accurate lexical semantic similarity judgments using word-context co-occurrence statistics. Master’s thesis, University of California, San Diego (2003)
Google Scholar
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of the 15th International Conference on Machine Learning, pp. 296–304. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Turney, P.D.: Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)
Chapter Google Scholar
Turney, P.D., Littman, M.L., Bigham, J., Shnayder, V.: Combining independent modules in lexical multiple-choice problems. In: Recent Advances in Natural Language Processing III: Selected Papers from RANLP 2003, pp. 101–110 (2003)
Google Scholar
Jarmasz, M., Szpakowicz, S.: Roget’s thesaurus and semantic similarity. In: Proceedings of Conference on Recent Advances in Natural Language Processing (RANLP), Borovets, Bulgaria, pp. 212–219 (2004)
Google Scholar
Curran, J.R., Moens, M.: Improvements in automatic thesaurus extraction. In: Proceedings of the Workshop of the ACL Special Interest Group on the Lexicon (SIGLEX), Philadelphia, USA, pp. 59–66 (2002)
Google Scholar
Rapp, R.: Word sense discovery based on sense descriptor dissimilarity. In: Proceedings of the Ninth Machine Translation Summit, pp. 315–322 (2003)
Google Scholar
Fellbaum, C. (ed.): WordNet: an electronic lexical database. The MIT Press, Cambridge (1998)
MATH Google Scholar
Sahlgren, M., Karlgren, J.: Vector-based semantic analysis using random indexing for cross-lingual query expansion. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 169–176. Springer, Heidelberg (2002)
Chapter Google Scholar
Lewis, D.D., Yang, Y., Rose, T.G., Li, F.: Rcv1: A new benchmark collection for text categorization research. Journal of Machine Learning Research 5, 361–397 (2004)
Google Scholar
Liu, H.: Montylingua: An end-to-end natural language processor with common sense (2004), http://web.media.mit.edu/~hugo/montylingua
Weeds, J., Weir, D.: Co-occurrence retrieval: A flexible framework for lexical distributional similarity. Computational Linguistic 31(4), 439–475 (2005)
Article MATH Google Scholar
McCarthy, D., Koeling, R., Weeds, J., Carroll, J.: Unsupervised acquisition of predominant word senses. Comput. Linguist. 33(4), 553–590 (2007)
Article Google Scholar
Fisher, R.A.: Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population. Biometrika 10, 507–521 (1915)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Beira Interior, Covilhã, 6201-001, Portugal
Gaël Dias & Rumen Moraliyski

Authors

Gaël Dias
View author publications
You can also search for this author in PubMed Google Scholar
Rumen Moraliyski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IEETA/Department of Electronics & Telecommunication, University of Aveiro, Campus Santiago, P.O. Box, 3810-153, Aveiro, Portugal
Luís Seabra Lopes
LSE-IEETA/DETI, Universidade de Aveiro, Portugal
Nuno Lau
Universidade de Aveiro, Aveiro, Portugal
Pedro Mariano
School of Informatics, Indiana University, Bloomington, IN, USA, and Computational Biology Collaboratorium, Instituto ulbenkian da Ciencia, Portugal
Luís M. Rocha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dias, G., Moraliyski, R. (2009). Relieving Polysemy Problem for Synonymy Detection. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M. (eds) Progress in Artificial Intelligence. EPIA 2009. Lecture Notes in Computer Science(), vol 5816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04686-5_50

Download citation

DOI: https://doi.org/10.1007/978-3-642-04686-5_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04685-8
Online ISBN: 978-3-642-04686-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Relieving Polysemy Problem for Synonymy Detection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Expert Assessment of Synonymic Rows in RuWordNet

A cascaded framework for identification and extraction of antonym for Turkish language

A Study on Chinese Synonyms: From the Perspective of Collocations

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Relieving Polysemy Problem for Synonymy Detection

Abstract

Access this chapter

Preview

Similar content being viewed by others

Expert Assessment of Synonymic Rows in RuWordNet

A cascaded framework for identification and extraction of antonym for Turkish language

A Study on Chinese Synonyms: From the Perspective of Collocations

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.