APPLICATION OF INFORMATION TECHNOLOGIES FOR SEMANTIC TEXT PROCESSING

O. Kravchenko, Zh. Plakasova, M. Gladka, А. Karapetyan, S. Besedina

Аннотация


An expert system for text analysis based on the heuristic knowledge of an expert linguist is proposed. Methods of linguistic analysis of the text through the use of computer technology have been further developed. Data verification was performed on the example of the Germanic language group. The algorithm of the system operation is given. The sequence of actions of the text analysis process is described. Research relates to the subject of computational linguistics and helps to automate text analysis processes. The main purpose of the research is to improve the machine’s understanding of the semantic structure of the text by finding current connections between the main members of the sentence, current connections between secondary members of the sentence, the best concept of the current word and the function of the current word in the sentence. Semantic networks are used in the software solution. The Java programming shell, such as NetBeans IDE 8.1, and the CLIPS shell, were used to create the software product. The main logical connections and structure of the program are described in the article. Methods and relations are considered on the example of the Germanic group of languages. All languages of the Germanic group are similar because they have a direct line of words which makes them even more similar: subject + predicate + subordinate clauses. Thus, to reflect the structure of the Germanic group of languages, it is sufficient to consider one of them. Namely, English, as it is the most common (1.5 billion people), international, has the largest vocabulary among the group (500 thousand words) and, in our opinion, the most complex.

Ключевые слова


computer linguistics, semantics, text, method, expert system, machine text analysis.

Полный текст:

PDF (English)

Литература


Kocherhan, M.P. (2020, June 6). Vstup do movoznavstva. Resource access mode: https://pidruchniki.

com/1222090548043/dokumentoznavstvo/vstup_do_movoznavstva(in Ukrainian).

Karpilovs’ka, A.E. (2006). Vstup do prykladnoyi linhvistyky: komp’yuterna linhvistyka. – Donets’k:

Yuho-Vostok, 187. (in Ukrainian)

Kenzhaev, A.D. (2020, June 6) Machine translation: history and modernity. Resource access mode:

https://lomonosov-msu.ru/archive/Lomonosov_2014/2568/2200_72719_187154.pdf. (in Russian).

Meyye, A. (2016). Osnovnyye osobennosti germanskoy gruppy yazykov.Per. s fr. Izd. Stereotip.URSS,

Slovari i sistemy mashinnogo perevoda (2020). Resource access mode: http//www.itland.com.ua/

products/sect.php.section.

Ivanov, O. V. (2009). Komp’yuternyy kontent-analiz: problemy ta perspektyvy vyrishennya. Metodolohiya, teoriya ta praktyka sotsiolohichnoho analizu suchasnoho suspil’stva, 15, 335-340.

Monroe, B.L., & Schrodt, P. A. (2008) Introduction to the Special Issue: The Statistical Analysis of

Political Text. Political Analysis, 16, 351-355.

Marchenko, O.O. (2015). Systema analizu koreferentnykh zv’yazkiv u tekstakh// Shtuchnyy intelekt,

№3-4 [Electronic resource]: Resource access mode: http://dspace.nbuv.gov.ua/bitstream/handle/123456789/117200/01/Marchenko.pdf?sequence

Deep Semantic Analysis of Text James F. Allen1,2 Mary Swift1 Will de Beaumont [Electronic

resource]: Resource access mode: https://www.aclweb.org/anthology/W08-2227.pdf (accessed

06.2020)

Klapur, A. (2007) Semantic analy p sis of text and speech [Electronic resource]: Resource access

mode: https://www.cs.tut.fi/sgn/ arg/klap/introduction-semantics.pdf (accessed 01.06.2020)

Dandelion API. Semantic Text Analytics as a service. [Electronic resource]: Resource access mode:

https://dandelion.eu/. (accessed 01.06.2020)

Nikolayeva, S.Yu. (2018) Zmist fakhovoho vyprobuvannya do aspirantury zi spetsial’nosti 011 osvitni/pedahohichni nauky dlya spetsializatsiyi “Teoriya ta metodyka navchannya: hermans’ki/romans’ki

movy”// International Scientific and Practical Conference World Science. И-во: ROST (Dubai), 4(30),

-59.


Ссылки

  • Ссылки не определены.


(P): 2707-9031
(E): 2707-904X


Нур-Султан
Бизнес-центр EXPO, блок C.1.
Казахстан, 010000

sjaitu@astanait.edu.kz