Menu                
                
            Publications                
            2025
                    
                                        
                        2024
                    
                                        
                        2023
                    
                                        
                        2022
                    
                                        
                        2021
                    
                                        
                        2020
                    
                                        
                        2019
                    
                                        
                        2018
                    
                                        
                        2017
                    
                                        
                        2016
                    
                                        
                        2015
                    
                                        
                        2014
                    
                                        
                        2013
                    
                                        
                        2012
                    
                                        
                        2011
                    
                                        
                        2010
                    
                                        
                        2009
                    
                                        
                        2008
                    
                                        
                        2007
                    
                                        
                        2006
                    
                                        
                        2005
                    
                                        
                        2004
                    
                                        
                        2003
                    
                                        
                        2002
                    
                                        
                        2001
                    
                                Editor-in-Chief                
             
                    Nikiforov
Vladimir O.
D.Sc., Prof.
Partners                
            doi: 10.17586/2226-1494-2018-18-5-863-869
	FEATURES OF NON-LOCAL SEMANTIC LINKS IN RUSSIAN TEXTS
Read the full article
 ';
';
					
	
	        Article in  Russian
		
For citation:
		        
Abstract
 
		
For citation:
	Boyarsky K.K., Kanevsky E.A. Features of non-local semantic links in Russian texts. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2018, vol. 18, no. 5, pp. 863–869 (in Russian). doi: 10.17586/2226-1494-2018-18-5-863-869
Abstract
	Subject of Research. One of the ways of automatic text analysis is the construction of subordination trees, in which the words of a sentence are connected with each other by semantic-syntactic links. The field of research is Russian-language texts, which have a general political, artistic and highly specialized character. Special attention is paid to the cases when the words are connected being far from each other at a considerable distance. Method. The subordination trees were built with the help of semantic-syntactical parser.Then the calculation of the distribution of links of different types by lengths was performed. The appearance frequencies of nonlocal links are studied. Main Results. It is shown that the fraction of non-local connections depending on the type can reach up to tens of percent. This is especially important for links coming from predicate nodes (subject, adverbial, etc.), as well as for anaphoric ones. It is noted that publicly available semantic classifiers and thesaurus have limited applicability for solving the problem of correct linking of remoted words in a sentence. Practical Relevance. It is shown that when solving the problem of extracting information that is ontological or scenario-based, as well as coreference, the long syntactic links that form the non-local semantic context cannot be neglected. The conclusion is drawn that the analysis of n-grams only is insufficient for the adequate selection of information from the text that is ontological or scenario. In this regard, there is a need to compile micro-dictionaries, focused on certain syntactic structures.
	        Keywords: semantic-syntactical analysis, syntactical links, subordination tree, n-grams, coreference		        
References
    
        References
- 
		Barsegyan A.A., Kupriyanov M.S., Stepanenko V.V., Kholod I.I. Data Analysis Technologies:Data Mining, Visual Mining, Text Mining, OLAP. 2nd ed. St. Petersburg, BKhV-Peterburg Publ., 2007. (in Russian)
- 
		Bol'shakova E.I., Baeva N.V., Bordachenkova E.A., Vasil'eva N.E., Morozov S.S.Lexicosyntactic patterns for automatic text processing. Proc. Int. Conf. Dialogue2007. Moscow, 2007, pp. 70–75. (in Russian)
- 
		Kormacheva D., Pivovarova L., Kopotev M. Automatic collocation extraction and classification of automatically obtained bigrams. Proc. Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations. Tubingen, Germany, 2014, pp. 27–33.
- 
		Enikeeva E.V., Mitrofanova O.A. Russian collocation extraction based on word embeddings. Proc. Int. Conf. Dialogue 2017. Moscow, 2017, pp. 52–64.
- 
		Khomitsevich O., Boyarsky K., Kanevsky E., Bulusheva A., Mendelev V.S. Flexible context extraction for keywords in Russian automatic speech recognition results. Communications in Computer and Information Science, 2017, vol. 661, pp. 145–154. doi: 10.1007/978-3-319-52920-2_14
- 
		Dybina A. Development of a textual base on the basis of the analysis of the structure of the scientific text. International Journal Information Technologies & Knowledge, 2012, vol. 6, no. 1, pp. 93–99. (in Russian)
- 
		BoyarskyK., Kanevsky E. SemSin semantic and syntactic parser.Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2015, vol. 15, no. 5, pp. 869–876. (in Russian) doi: 10.17586/2226-1494-2015-15-5-869-876
- 
		Curti O. Modelli Navali. Enciclopedia del Modellismo Navale. Milano, 1980.
- 
		Rоммe М. L'Art de la Marine, оu Principes еt Préceptes Generaux dеl'Art de Construire, d'Armer, de Manœuvrer et de Conduire dеs Vasseaux. La Rochelle, 1787, 542 p.
- 
		Pivovarova L., Pronoza E., Yagunova E., Pronoza A. ParaPhraser: Russian paraphrase corpus and shared task. Communications in Computer and Information Science, 2018, vol. 789, pp. 211–225. doi: 10.1007/978-3-319-71746-3_18
- 
		BoyarskyK., Kanevsky E. Effect of semantic parsing depth on the identification of paraphrases in Russian texts. Communications in Computer and Information Science, 2018, vol. 789, pp. 226–241. doi: 10.1007/978-3-319-71746-3_19
- 
		Kobzareva T.Yu. Вuilding and use of projective fragments of attributive noun and prepositionalphrases.Proc. Int. Conf. Dialogue2007. Moscow, 2007, pp. 242–249. (in Russian)
- 
		Rogozhnikova R.P. Explanatory Dictionary of Combinations Equivalent to Word. Moscow, Astrel'-AST Publ., 2003, 416 p. (in Russian)
- 
		Lukashevich N.V. Thesaurus in Information Retrieval Problems. Moscow, MSU Publ., 2011, 512 p. (in Russian)
- 
		Tuzov V.A. Computer Semantics of Russian Language. St. Petersburg, SPbSU Publ., 2004, 400 p. (in Russian)
 
        
 
                         
                         
                         
                         
                         
                         
                         
                         
                         
                        

