Abstract
Elasticsearch is one of the most popular full-text search and analytics engine. It can store, search, and analyze big volumes of data in near real time. Before searching, Elasticsearch will build multiple inverted indexes for the data. The analyzer plays a crucial role in this process. An appropriate analyzer can segment text into semantically meaningful words, which can significantly improve the query accuracy. However, the default analyzer has limited performance in the Chinese context. Existing methods generally replace the default analyzer with manual configuration to optimize the query effect. The cost of service downtime caused by manually updating analyzers is often unacceptable in production environments. Based on Elasticsearch’s Restful-API, we have implemented a framework for dynamic configuration of analyzers in a cluster environment. The framework supports common Chinese analyzers and provides a visual interface. Experiments show that the framework proposed in this paper reduces the update and maintenance time cost of the analyzer in the online environment by 94% compared to manual update. At the same time, compared with the default analyzer configuration of Elasticsearch, the accuracy of the system based on this framework is improved by 30%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Elasticsearch Guide. https://www.elastic.co/guide/index.html.. Accessed 19 Mar 2022
Divya, M.S., Goyal, S.K.: ElasticSearch: an advanced and quick search technique to handle voluminous data [J]. Compusoft 2(6), 171 (2013)
Qiu, Q., Xie, Z., Liang W., Li, W.: DGeoSegmenter: a dictionary-based Chinese word segmenter for the geoscience domain[J]. Comput. Geosci. 121, 1–11 2018
Huihui, S., Xin. N.: Research and Implementation of Chinese Automatic Word Segmentation System Based on Complex Network Features [J]. Wireless Commun. Mobile Comput. 2022, 1–10 (2022)
Lu, H., Hong, Y., Yang, Y., Duan, L., Badar, N.: Towards user-oriented RBAC model. In: Wang, L., Shafiq, B. (eds.) DBSec 2013. LNCS, vol. 7964, pp. 81–96. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39256-6_6
Takase, W., Nakamura, T., Watase, Y., et al.: A solution for secure use of Kibana and Elasticsearch in multi-user environment[J]. arXiv preprint arXiv:1706.10040 (2017)
Wei, B., Dai, J., Deng, L., et al.: An Optimization Method for Elasticsearch Index Shard Number[C]. In: 2020 16th International Conference on Computational Intelligence and Security (CIS). IEEE, pp. 191–195 (2020)
Reelsen A. Using elasticsearch, logstash and kibana to create realtime dashboards J]
Varis, J.: Enhancing a product’s development and debugging with supporting product development [J] (2020)
Jibiao, J., Xinghua, Q.I.: Design and realization of online query system of Chinese-English terminology of Chinese medicine [J]. China Terminol. 24(2), 92 (2022)
Concepcion, A.I., Zeigler, B.P.: DEVS formalism: a framework for hierarchical model development[J]. IEEE Trans. Softw. Eng. 14(2), 228–241 (1988)
Cruz, J.P., Kaji, Y., Yanai, N.: RBAC-SC: role-based access control using smart contract [J]. Ieee Access 6, 12240–12251 (2018)
William Stallings. Role-Based Access Control in Computer Security. https://www.informit.com/articles/article.aspx?p=782116.. Accessed 19 Mar 2022
Pi, C., Nie, P., Feng, Y., Xu, L.: Design and implementation of annotation system for character behavior & event based on spring boot. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds.) WISA 2021. LNCS, vol. 12999, pp. 756–763. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-87571-8_66
Chinotan. SpringBoot Actuator. https://my.oschina.net/u/3266761/blog/2960. Accessed 19 Mar 2022
Dhulavvagol, P.M., Bhajantri, V.H., Totad, S.G.: Performance Analysis of Distributed Processing System using Shard Selection Techniques on Elasticsearch [J]. Procedia Comput. Sci 167, 126–136 (2020)
Fan Zhang., F.: Design and Implementation of Physical Education Video Teaching System Based on Spring MVC Architecture [C]. Durham University, Suffolk University. In: Proceedings of the 4th International Conference on Information and Education Innovations (ICIEI 2019. Durham University, Suffolk University: SCIence and Engineering Institute (SCIEI), pp. 117–120 (2019)
Coronel, J.B., Mock, S.: Designsafe: using elasticsearch to share and search data on a science web portal[M]. In: Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact, pp. 1–3 (2017)
Arora, M., Kanjilal, U., Varshney, D.: Evaluation of information retrieval: precision and recall [J]. Int. J. Indian Cult. Bus. Manage. 12(2), 224–236 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sun, J., Nie, P., Xu, L., Zhang, H. (2022). Design and Implementation of Analyzer Management System Based on Elasticsearch. In: Zhao, X., Yang, S., Wang, X., Li, J. (eds) Web Information Systems and Applications. WISA 2022. Lecture Notes in Computer Science, vol 13579. Springer, Cham. https://doi.org/10.1007/978-3-031-20309-1_22
Download citation
DOI: https://doi.org/10.1007/978-3-031-20309-1_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20308-4
Online ISBN: 978-3-031-20309-1
eBook Packages: Computer ScienceComputer Science (R0)