Scientific Journal

Applied Aspects of Information Technology

METHOD OF SEARCHING TERM INTERPRETATIONS FOR DOMAIN DICTIONARIES, USED FOR DEVELOPING SOFTWARE
Abstract:

In this paper, a method of forming definitions of terms for a vocabulary of a subject domain using existing explanatory dictionaries is proposed. It is shown that with a combined search for terms and their interpretations, it is possible to find about ten percent of definitions, which is clearly not enough. A method of automated search for the interpretation of terms is proposed, involving the use of existing explanatory dictionaries. A mathematical model of the subject domain dictionary entry is proposed. A mathematical model of an explanatory dictionary entry is proposed, taking into account the headword, a variety of interpretations of the word, litters and stable phrases. A mechanism has been developed for extracting definitions of a term from an explanatory dictionary depending on the structure of its dictionary entry. An algorithm for automated search for definitions for single-word terms has been developed. An algorithm has been developed for the automated search for definitions for verbose terms, based on the selection of nouns from the term. A mechanism for assessing the quality of possible interpretations, depending on the occurrence of terms from the subject domain, is proposed. A mechanism has been developed for the choice of definitions, when the terms from the vocabulary of the subject domain and the explanatory dictionary are coinciding incompletely, which is based on the procedure of term decomposition, the search for partial interpretations and the synthesis of the resulting interpretation. The software developed that allows to organize the search for interpretations of terms both in local explanatory dictionaries (previously loaded into the system), and in online dictionaries. The expert’s task includes the evaluation the interpretations found and possibly editorial correction of them. Experimental evaluation of the effectiveness of the use of a software product showed a reduction in the expert’s working time compared to the “manual mode” by approximately four times.

 

Authors:
Keywords
DOI
//10.15276/aait.02.2019.1
References
1. Kungurcev, A. B., Potochnyak, Ya. V., & Silyaev, D. A. (2015). “Method of automated construction of explanatory dictionary of subject area”, Technology audit and production reserves, Vol. 2, Issue 2 (22), pp. 58-63, doi: https://doi.org/10.15587/2312-8372.2015.40895.
2. Liubchenko V., & Sulimova, I. (2017). “Examining the attributes of transitions between team roles in the software development project”. Eastern-European Journal of Enterprise Technologies, 1/3 (85). pp. 12-17, https://doi.org/10.15587/1729-4061.2017.91597.
3. (2006). “Best Practices for Data Dictionary Definitions and Usage Version 1.1.” URL: https://www.pnamp.org/sites/default/files/best_practices_for_data_dictionary_definitions_and_usage_version_1.1_2006-11-14.pdf.
4. Califf, M. E., & Mooney, R. J. (2003). “Bottom-up relational learning of pattern matching rules for information extraction”, Journal of Machine Learning Research, 4, pp. 177-210.
5. Hartmann, R. R. K. (2003). “Lexicography: Dictionaries, compilers, critics, and users. Routledge”, 412 p.
6. (2018). “Ways Data Dictionary Increases Software Developers Productivity”. URL: https://dataedo.com/blog/ways-data-dictionary-increases-software-developers-productivity.
7. Bessmertny, I. A., Nugumanova, A. B., Mansurova, M. Ye., & Baiburin, Ye. M. (2017). “Method of rare term contrastive extraction from natural language texts”, Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 17(1), 81-91. https://doi.org/10.17586/2226-1494-2017-17-1-81-91.
8. Dal, V. I. (2015). “Explanatory Dictionary of Russian language. Part 1. Russian”, 636 p.
9. Ozhegov, S. I. (2011). Tolkovyy slovar' russkogo yazyka, [Dictionary of the Russian language], Moscow, Russian Federation, Obrazovaniye, Oniks, 736 p. (in Russian).
10. Tolkovyy slovar' russkogo yazyka, [Dictionary of the Russian language] (in Russian). – Available at: http://vedu.ru/expdic. – Active link : 8.10.2018.
11. “Oxford Dictionaries Online, 2018, Oxford University Press”. – Available at : https://www.oxforddictionaries.com/. – Active link: 18.10.2018.
12. Slovar' na sayte Instituta yazykoznaniya im. A. A. Potebni, [The dictionary on the website of the Institute of Linguistics. A. A. Potebni] (in Ukrainian). – Available at : http://www.inmo.org.ua/sum.html. – Active link : 9.10.2018.
13. Hasan, K. (2014). “Automatic key phrase extraction: a survey of the state of the art”, Proc. 52nd Annual Meeting of the Association for Computational Linguistics, No. 1, pp. 1262-1273, doi: https://doi.org/10.3115/v1/p14-1119.
14. (2017). A. Melnikov et al., “On usage of machine learning for natural language processing tasks as illustrated by educational content mining”. Ontology of designing, Vol. 7(1), pp. 34-47, https://doi.org/10.18287/2223-9537-2017-7-1-34-47.
15. Braslavskii, P. & Sokolov, E. (2006). “Comparison of four methods for automatic two-word term extraction. Computational Linguistics and Intellectual Technologies”, proc. Int. Conf. Dialog, Moscow, Russian Federation, pp. 88-94
16. Vavilenkova, A. (2017). “Methods of identifying logical connections between parts of text documents”, Bulletin of the National Technical University “KhPI” Series: New solutions in modern technologies. Issue 7 (1229), pp. 118-122, doi: https://doi.org/ 10.20998/2413-4295.2017.07.16.
17. Chainikova, G. R. (2014). “Development of learner’s electronic dictionary of a thesaurus type as means of forming internal foreign-language lexicon”. Bulletin of the Perm National Research Polytechnic University, pp. 70-84.
18. Institut programmnykh sistem RAN. Issledovatel'skiy tsentr iskusstvennogo intellekta, slovar' prostykh slovosochetaniy, [Institute of Software Systems RAS. Artificial Intelligence Research Center, a dictionary of simple phrases], – Available at : http://ai-center.botik.ru/Airec/index. php/ru/resources/dictionaries/36-ru-collocation-dict – Active link : 12.11.2018 (in Russian).
19. Ushakov, D. N. (2009). Tolkovyy slovar' sovremennogo russkogo yazyka, [Explanatory Dictionary of Modern Russian], Izdatel'stvo: Al'ta-print; DOM XXI Vek. Moskva, Russian Federation, 1239 p. (in Russian).
20. Kungurtsev, O., Kovalchuk, S., Potochniak, Ia., & Shirokostup, M. (2016). “Creating the domain vocabulary on the basis of automated analysis of Ukrainian texts”, Technical Sciences and Technologies, No. 3 (5), pp. 164-174.
21. Kungurtsev, O., Zinovatnaya, S., Potochniak, Ia., & Kutasevych, M. (2018). “Development of information technology of term extraction from documents in natural language”, Eastern-European Journal of Enterprise Technologies, Vol 6, No. 2 (96), pp. 44-51, doi: https://doi.org/10.15587/1729-4061.2018.147978.
22. (2018). Slovnyk ukrayinsʹkoyi movy, [Dictionary of the Ukrainian language], – Available at: http:// http://sum.in.ua/. – Active link: 20.10.2018
(in Ukrainian).
23. Nielsen, Sandro. (2008). “The Effect of Lexicographical Information Costs on Dictionary Making and Use”, Lexicons, 18 (1). University of Aarhus, рр. 170-189.
24. Navigli, R., & Velardi, P. (2008). “From Glossaries to Ontologies: Extracting Semantic Structure from Textual Definitions”, IOS Press, pp. 71-87.
25. John P. Considine. (2009). “Dictionaries in Early Modern Europe: Lexicography and the Making of Heritage”, Cambridge University Press. Vol 19. pp. 509-515, https://doi.org/10.5788/19-0-452.
26. Bol'shakova, Ye. I., Klyshinskiy, E. S., Lande, D. V., Noskov, A. A., Peskova, O. V., & Yagunova Ye. V. (2011). Avtomaticheskaya obrabotka tekstov na yestestvennom yazyke i komp'yuternaya lingvistika, [Automatic natural language processing and computational linguistics], Moscow, Russian Federation, MIEM, 272 p. (in Russian).
27. Joanna, Olechno-Wasiluk. (2015). Struktura slovarnoy stat'i v slovare “Rossiya. Bol'shoy lingvostranovedcheskiy slovar, [The structure of an article entry in the dictionary Russian], Yezhegodnik Russko-pol'skogo Instituta, No. 1 (8) (in Poland).
28. (2006). “Cognitive Technologies. Software package of syntactic analysis and machine translation “Cognitive Dwarf””. – Available at : http://www.cognitive.ru. – Active link : 12.09.2018.
29. “Interpretatio”. – Available at: http://www.softholm.com/download-software-free16427.htm. – Active link : 2.10.2018.
Published:
Last download:
9 Dec 2019

[ © KarelWintersky ] [ All articles ] [ All authors ]
[ © Odessa National Polytechnic University, 2018.]