DOI: 10.17586/2226-1494-2015-15-1-155-162


SERVICES OF FULL-TEXT SEARCHING IN A DISTRIBUTED INFORMATION ENVIRONMENT (PROJECT HUMANITARIANA)

S. K. Lyapin, A. V. Kukovyakin, I. A. Mbogo, I. I. Tolstikova, A. V. Chugunov


Read the full article 
Article in Russian

For citation: Lyapin S.Kh., Kukovyakin A.V., Mbogo I.A., Tolstikova I.I., Chugunov A.V. Services of full-text searching in a distributed information environment (project Humanitariana). Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2015, vol. 15, no. 1, pp. 155–162

Abstract

Problem statement. We justify the possibility of full-text search services application in both universal and specialized (in terms of resource base) digital libraries for the extraction and analysis of the context knowledge in the humanities. The architecture and services of virtual information and resource center for extracting knowledge from the humanitarian texts generated by «Humanitariana» project are described. The functional integration of the resources and services for a full-text search in a distributed decentralized environment, organized in the Internet / Intranet architecture under the control of the client (user) browser accessing a variety of independent servers. An algorithm for a distributed full-text query implementation is described.

Methods. Method of combining requency-ranked and paragraph-oriented full-text queries is used: the first are used for the preliminary analysis of the subject area or a combination product (explication of "vertical" context, or macro context), the second - for the explication of "horizontal" context, or micro context within copyright paragraph. The results of the frequency-ranked queries are used to compile paragraph-oriented queries.

Results. The results of textual research are shown on the topics "The question of fact in Russian philosophy", "The question of loneliness in Russian philosophy and culture". About 50 pieces of context knowledge on the total resource base of about 2,500 full-text resources have been explicated and briefly described to their further expert investigating.

Practical significance. The proposed technology (advanced full-text searching services in a distributed information environment) can be used for the information support of humanitarian studies and education in the humanities, for functional integration of resources and services of various organizations, for carrying out interdisciplinary research. 


Keywords: full-text searching, contextual knowledge explication, "horizontal" context, "vertical" context, functional integration of resources, decentralized distributed environment, metasearch engine 

Acknowledgements. Работа выполнена при поддержке гранта Российского гуманитарного научного фонда (грант РГНФ № 14-03-12017). Выражаем благодарность Университету ИТМО (Санкт-Петербург) за возможность использования своей информаци- онно-телекоммуникационной инфраструктуры, а также ООО «Константа» (Архангельск) за предоставление специа- лизированного прикладного программного обеспечения – многофункциональной информационной системы T-Libra с сервисами продвинутого полнотекстового поиска

References

1. Fagan J.C. The effects of reference, instruction, database searches, and ongoing expenditures on full-text article requests: an exploratory analysis. Journal of Academic Librarianship, 2014, vol. 40, no. 3–4, pp. 264– 274. doi: 10.1016/j.acalib.2014.04.002

2. Hadersbeck M., Pichler A., Fink F., Gjesdal Ø.L. Wittgenstein's nachlass: WiTTFind and wittgenstein advanced search tools (WAST). Proc. 1st Int. Conf. on Digital Access to Textual Cultural Heritage, DATeCH 2014. Madrid, Spain, 2014, pp. 91–96. doi: 10.1145/2595188.2595202

3. Yudina T.N., Bogomolova A.V. UIS ROSSIYa: ontologiya predmetnoi oblasti "gosudarstvennoe upravlenie". Trudy XIV Vserossiiskoi Konferentsii "Internet i Sovremennoe Obshchestvo", IMS-2011 [Proc. XIV All-Russian Conf. "Internet and Modern Society", IMS-2011]. St. Petersburg, Russia, 2011, pp. 225– 230.

4. Heeks R., Santos R. Understanding Adoption of e-Government: Principals, Agents and Institutional Dualism. Available at: www.sed.manchester.ac.uk/idpm/research/publications/wp/igovernment/igov_wp19.htm (accessed 30.11.2014).

5. Mbogo I.A., Chugunov A.V. Elektronnaya kollektsiya "Elektronnoe gosudarstvo": tekhnologicheskie aspekty [Electronic collection "Electronic government": technological aspects]. Trudy XV Vserossiiskoi Ob"edinennoi Konferentsii "Internet i Sovremennoe Obshchestvo" [Proc. XV All-Russian Conference "Internet and Modern Society"]. St. Petersburg, Russia, 2012, pp. 345–347.

6. Lyapin S.Kh., Kukovyakin A.V. Servisy polnotekstovogo poiska kak instrument povysheniya tsitiruemosti nauchnykh rabot i reitinga VUZa [Full-text search services as a tool to improve the citation of scientific papers and university rankings]. Trudy XX Vserossiiskoi Nauchno-Metodicheskoi Konferentsii Telematika'2013 [Proc. XX Scientific and Technical Conference Telematika'2013]. St. Petersburg, Russia, 2013, vol. 1, pp. 15–17.

7. Lyapin S.Kh. Elektronnaya polnotekstovaya biblioteka dlya podderzhki sotsiogumanitarnykh issledovanii [Electronic full-text library to support social and humanitarian studies]. Trudy XX Vserossiiskoi NauchnoMetodicheskoi Konferentsii Telematika'2013 [Proc. XX Scientific and Technical Conference Telematika'2013]. St. Petersburg, Russia, 2013, vol. 2, pp. 317–318.

8. Lyapin S.Kh. Servisy elektronnoi polnotekstovoi biblioteki dlya obrazovaniya, nauki i kul'tury [Electronic full-text library services for education, science and culture]. Nauchnaya Periodika: Problemy i Resheniya, 2013, no. 2(14), pp. 9–17.

9. Lyapin S.Kh., Kukovyakin A.V. Elektronnaya polnotekstovaya biblioteka dlya podderzhki analiticheskoi i issledovatel'skoi deyatel'nosti [Electronic full-text library to support analytical and research activities]. Trudy XVI Vserossiiskoi ob"edinennoi konferentsii "Internet i sovremennoe obshchestvo", IMS-2013 [Proc. XVI All-Russian Conf. "Internet and Modern Society", IMS-2013]. St. Petersburg, Russia, 2013, pp. 163–170.

10. Lyapin S.Kh. Kak proiti v raspredelennuyu biblioteku? [How to pass in the distributed library?]. Sovremennaya nauka: aktual'nye problemy teorii i praktiki. Seriya: Gumanitarnye nauki, 2012, no. 7–8, pp. 17–21.

11. Metasearch Engine. Available at: http://en.wikipedia.org/wiki/Metasearch_engine (accessed 30.11.2014).

12. T-Libra 6.7. Available at: http://demo.tlibra.ru (accessed 30.11.2014).

13. Lyapin S.Kh. Teksty, konteksty, kontsepty: ispol'zovanie sovremennykh sistem polnotekstovogo poiska dlya filosofskikh issledovanii (na materiale russkoi filosofii) [Texts, contexts, concepts: the use of modern systems of full-text search for philosophical studies (based on Russian philosophy)]. Available at: http://www.losevlibrary.ru/index.php?pid=6872 (accessed 30.11.2014).

14. Tanenbaum A.S., van Steen M. Distributed Systems. Principles and Paradigms. Prentice Hall, 2002.

15. Zhizhimov O.L., Mazov N.A. Model' raspredelennoi informatsionnoi sistemy Sibirskogo otdeleniya RAN na baze protokola Z39.50. Elektronnye Biblioteki, 1999, vol. 2, no 2, p. 12.

16. Maksimov N.V., Sysoikina M.A. O realizatsii elektronnoi biblioteki s ispol'zovaniem protokolov HTTP i Z39.50 [On the implementation of digital library using HTTP and Z39.50]. Elektronnye Biblioteki, 2002, vol. 5, no. 1, p. 4. 



Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Copyright 2001-2018 ©
Scientific and Technical Journal
of Information Technologies, Mechanics and Optics.
All rights reserved.

Яндекс.Метрика