Representation of indirect senses of adjectives in WordNet and RuWordNet
Abstract
Much research has been devoted to studying the ways polysemous words are represented in electronic linguistic databases, ontologies and thesauri. This study reports the ongoing project (since 2018) aiming at providing a Complex Analysis of the Structure and Content of RuWordNet Thesaurus. In our paper we focus on figurative language, in particular, indirect meanings of polysemous adjectives and compare the ways they are represented in English WordNet and Russian RuWordNet. We use the term ‘indirect sense’ to indicate literal meanings developed as a result of primary meaning. To collect the data for the study, we applied the continuous sampling method and extracted 20 polysemous adjectives from WordNet and traced their equivalents in RuWordNet. The research data was limited to two groups of adjectives: color terms and adjectives describing the weather. The comparison of adjectives in two lexical databases showed the ratio of indirect senses to the total number of senses as 77.8% in WordNet and 48.9% in RuWordNet. The data analysis indicates that the presentation of indirect senses of polysemous adjectives in WordNet and RuWordNet is different, and the low ratio for RuWordNet is explained by the use of hypernym/hyponym relations. The study also showed that color terms presented in both databases illustrate certain differences in English and Russian linguistic world views.
Keywords
Full Text:
PDFReferences
Alonge, A., Bertagna, F., Calzolari, N., Roventini, A., & Zampolli, A. (2000). Encoding information on adjectives in a lexical-semantic net for computational applications. In 1st Meeting of the North American Chapter of the Association for Computational Linguistics, ANLP (pp. 42-49).
Apresyan, Yu. D. (1995). Izbrannyye trudy. V 2 tomakh. Tom 1. Leksicheskaya semantika. Cinonimicheskiye sredstva yazyka [Selected works. In 2 volumes. Volume 1. Lexical semantics. The synonymous means of language]. Moscow: Russian Academy of Sciences (in Russian)
Azarova, I., & Sinopalnikova, A. (2004). Adjectives in RussNet. In P. Sojka, K. Pala, P. Smrz, C. Fellbaum & P. Vossen (Eds.), Proceedings of the 2nd Global WordNet Conference (GWC 2004) (pp. 251-258). Brno: Masaryk University.
Bochkarev, V. V, & Solovyev, V. D. (2019). Properties of the network of semantic relations in the Russian language based on the RuWordNet data. Journal of Physics: Conference Series: 8th International Conference on Mathematical Modeling in Physical Science, 1391(1), art. 012053. Retrieved on October 25, 2021 from: https://iopscience.iop.org/article/10.1088/1742-6596/1391/1/012052/meta
Boleda, G., Pado, S., & Utt, J. (2012). Regular polysemy: A distributional model. In The First Joint Conference on Lexical and Computational Semantics (SEM2012) (pp. 151-160). Montreal, Canada: Association for Computational Linguists.
Chugur, I., Gonzalo, J., & Verdejo, F. (2002). A study of Polysemy and Sense Proximity in the Sense-val-2 test suite. In Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions (pp. 32-39). Association for Computational Linguistics.
Dowker, A. (2003). Young children’s and adults’ use of figurative language: how important are cultural and linguistic influences? In B. Nerlich, Z. Todd, V. Herman & D.D. Clarke (Eds.), Polysemy: Flexible Patterns of Meaning in Mind and Language (pp. 317-332). Berlin/New York: Mouton de Gruyter.
Efremova, T. (2006). Sovremennyy tolkovyy slovar’ russkogo yazyka v 3 tomakh [The modern explanatory dictionary of the Russian language in 3 volumes]. Moscow: Russkiy yazyk (in Russian)
Fellbaum, C. (1998). Towards a Representation of Idioms in WordNet. In Proceedings of the Workshop Usage of WordNet in Natural Language Processing Systems, COLING-ACL (pp. 52-57). Montreal, Quebec, Canada.
Fellbaum, C. (1999). The Organization of verbs and verb concepts in a semantic net. In P. Saint-Dizier (Ed.), Predicative forms in natural language and in lexical knowledge bases, vol. 6 (pp. 98-110). Springer, Science & Business Media.
Fellbaum, C. (2012). WordNet. The encyclopaedia of applied linguistics.
Gross, D., & Miller, K. (1990). Adjectives in WordNet. International Journal of Lexicography 3(4), 265-277. DOI: 10.1093/ijl/3.4.265
Handl, S. (2011). The conventionality of figurative language: A usage-based study. Vol. 46. Narr Francke Attempto Verlag GmbH+Co, Germany. Retrieved on October 25, 2021 from: https://download.e-bookshelf.de/download/0000/4904/77/L-G-0000490477-0002318533.pdf
Hanks, P. (2004). Why WordNet Should Not Include Figurative Language, and What Would Be Done Instead. In P. Sojka, K. Pala, P. Smrz, C. Fellbaum & P. Vossen (Eds.), Proceedings of the 2nd Global WordNet Conference (GWC 2004) (pp. 11-14). Brno: Masaryk University.
Lacalle, O. L., & Agirre, E. (2015). A methodology for word sense disambiguation at 90% based on large-scale CrowdSourcing. In M. Palmer, G. Boleda & P. Rosso (Eds.), Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (pp. 61-70). Denver, Colorado: Association for Computational Linguists.
Lohk, A., Orav, H., Vare, K., Bond, F., & Vaik, R. (2019). New Polysemy Structures in Wordnets Induced by Vertical Polysemy. In C. Fellbaum, P. Vossen, E. Rudnicka, M. Maziarz & M. Piasecki (Eds.), Proceedings of the 10th Global WordNet Conference (GWC 2019) (pp. 394-403). Wroclaw: Oficyna Wydawnicza Politechniki Wrocławskiej.
Loukachevitch, N., Lashevich, G., Gerasimova, A., Ivanov, V., & Dobrov, B. (2016). Creating Russian WordNet by Conversion. In Proceeding of conference on Computational linguistics and intellectual technologies Dialogue-2016 (pp. 405-415). Moscow: RSUH.
Loukachevitch, N., & Gerasimova, A. (2019). Linking Russian Wordnet RuWordNet to WordNet. In Fellbaum, C., Vossen, P., Rudnicka, E., Maziarz, M. & Piasecki, M. (Eds.) Proceedings of the 10th Global Wordnet Conference (GWC 2019) (pp. 64-71). Wroclaw: Oficyna Wydawnicza Politechniki Wrocławskiej.
Masevich A., & Zakharov V. (2020). Quantitative analyses of using adjectives of color in Russian poetic texts. In Ronzhin A., Noskova T. & Karpov A. (Eds.) CEUR Workshop Proceedings, vol. 2552 (pp. 121-139). Retrieved on October 25, 2021 from: http://ceur-ws.org/Vol-2552/Paper11.pdf
Miller, G.A. (1995). WordNet: A lexical database for English. Communications of the ACM 38(11), 39-41. DOI: 10.1145/219717.219748.
Ozhegov S. I. (1992). Tolkovyy slovar’ russkogo yazyka. Moskva: Izdatel’stvo “Az”. Retrieved on October 25, 2021 from: https://slovarozhegova.ru (in Russian)
Peters, W. (2004). Detection and Characterization of Figurative Language Use in Word Net. Thesis submitted to the University of Sheffield for the degree of Doctor of Philosophy. University of Sheffield. Retrieved on October 25, 2021 from: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.123.7464&rep=rep1&type=pdf
Ravin, Y., & Leacock, C. (2000). Polysemy: An Overview. In Y. Ravin & C. Leacock (Eds.), Polysemy: Theoretical and Computational Approaches (pp. 1-29). Oxford: OUP.
Solovyev, V., Gimaletdinova, G., Khalitova, L., & Usmanova, L. (2020). Expert assessment of synonymic rows in RuWordNet. In Analysis of Images, Social Networks and Texts (AIST 2019), Communications in Computer and Information Science, vol. 1086 (pp. 174-183). Cham: Springer.
Stefanova, V., & Dimitrova, T. (2017). Classification of Adjectives in BulNet: Notes on an Effort. In Proceedings of the LDK 2017 Workshops: Challenges for Wordnets (pp. 188-196). Galway, Ireland.
Thesaurus of Russian Language RuWordNet. Retrieved on October 25, 2021 from: https://ruwordnet.ru/ru
Tomuro, N. (1998). Semi-automatic Induction of Systematic Polysemy from WordNet. In Proceedings of the Workshop Usage of WordNet in Natural Language Processing Systems, COLING-ACL (pp. 108-114). Montreal, Quebec, Canada.
Veale, T. (2004). Pathways to Creativity in Lexical Ontologies. In P. Sojka, K. Pala, P. Smrz, C. Fellbaum, P. Vossen (Eds.), Proceedings of the 2nd Global WordNet Conference (GWC 2004) (pp. 220-225). Brno: Masaryk University.
WordNet. A Lexical Database for English. Princeton University. Retrieved on October 25, 2021 from: https://wordnet.princeton.edu
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
ISSN 1305-578X (Online)
Copyright © 2005-2022 by Journal of Language and Linguistic Studies