Ðóñ Eng Cn Translate this page:
Please select your language to translate the article


You can just close the window to don't translate
Library
Your profile

Back to contents

Software systems and computational methods
Reference:

Simankov V.S., Tolkachev D.M. Development of information-analytical system for obtaining relevant data and knowledge on the Internet

Abstract: the article is devoted to development of algorithms and methodical provisions for obtaining relevant data and knowledge on the Internet. Under the relevant data and knowledge the authors mean the information needed to solve a problem or task. The article examines issues related to semantic data compression, providing semantic coherence of the text, defining semantic similarity of texts or phrases, as well as with automatic search of brief and accurate answers to questions. Research takes into account peculiarities of the Internet as a source of huge amounts of unstructured information. The study uses a systematic approach, theory of algorithms, algebra of logic, set theory and comparative analysis. The article presents a general algorithm for the problem-oriented auto-reviewing. The authors raise the questions of finding the semantic relationships between sentences. The article describes techniques of generating an integrated review and identifying the semantic similarity of the two texts. The authors developed an algorithm of finding the answers to question and show the results of building the information-analytical system of obtaining relevant data and knowledge on the Internet.


Keywords:

data, knowledge, Internet, search engines, problem-oriented auto-reviewing, semantic connections, pronominal anaphors, regular expressions, semantic similarity, ternary expression


This article can be downloaded freely in PDF format for reading. Download article

This article written in Russian. You can find original text of the article here .
References
1. Simankov V.S., Tolkachev D.M. Problemno-orientirovannoe avtoreferirovanie kak instrument poiska dannykh i znaniy // Nauka vchera, segodnya, zavtra / Sb. st. po materialam XIV mezhdunar. nauch.-prakt. konf. ¹ 7 (14). Novosibirsk: Izd. «SibAK», 2014. – s. 31-35.
2. V.E. Abramov, N.N. Abramova, E.V. Nekrasova, G.N. Ross. Statisticheskiy analiz svyaznosti tekstov po obshchestvenno-politicheskoy tematike. Trudy 13y Vserossiyskoy nauchnoy konferentsii «Elektronnye biblioteki: perspektivnye metody i tekhnologii, elektronnye kollektsii» – RCDL’2011, Voronezh, Rossiya, 2011. – s. 127-133.
3. Simankov V.S., Tolkachev D.M. Obespechenie smyslovoy svyaznosti teksta avtoreferata // Nauchnaya diskussiya: innovatsii v sovremennom mire. ¹ 7 (27): sbornik statey po materialam XKhVII mezhdunarodnoy zaochnoy nauchno-prakticheskoy konferentsii. – M., Izd. «Mezhdunarodnyy tsentr nauki i obrazovaniya», 2014. – s. 12-16.
4. Perl regular expressions [Elektronnyy resurs]. Rezhim dostupa: http://perldoc.perl.org/perlre.html (22.10.2014).
5. Fridl Dzh. Regulyarnye vyrazheniya, 3-e izdanie. – Per. s angl. – SPb.: Simvol-Plyus, 2008. – 608 s.
6. Oliver Müller. Pattern Matching with Regular Expressions in C++ [Elektronnyy resurs]. Rezhim dostupa: http://www.tldp.org/LDP/LGNET/issue27/mueller.html (22.10.2014).
7. Simankov V.S., Tolkachev D.M. Avtomaticheskaya otsenka smyslovogo podobiya tekstov // Tekhnicheskie nauki – ot teorii k praktike / Sb. st. po materialam XXXVII mezhdunar. nauch.-prakt. konf. ¹ 8 (33). Novosibirsk: Izd. «SibAK», 2014. – s. 26-33.
8. K.Kh. Kim, A.P. Savinov. Sintaksicheskiy analizator dlya voprosno-otvetnoy sistemy. Izvestiya Tomskogo politekhnicheskogo universiteta, T. 315. ¹ 5, 2009. – s. 133-138.
9. START, Natural Language Question Answering System [Elektronnyy resurs]. Rezhim dostupa: http://start.csail.mit.edu/index.php (22.10.2014).
10. Simankov V.S., Tolkachev D.M. Poisk otvetov na voprosy v seti Internet // Innovatsii v nauke / Sb. st. po materialam XKhXVI mezhdnar. nauch.-prakt. konf. ¹ 8 (33). Novosibirsk: Izd. «SibAK», 2014. – s. 28-35.
11. Semanticheskaya poiskovaya sistema AskNet [Elektronnyy resurs]. Rezhim dostupa: http://www.asknet.ru/ (22.10.2014).
12. Yandeks [Elektronnyy resurs]. Rezhim dostupa: http://www.yandex.ru/ (22.10.2014)