Implementation of Named Entity Recognition for Developing Question Answering System: Case Study Merapi Volcano Museum

Arfiani Nur Khusna(1*), Okhy Kharisma Putri(2), Dimas Chaerul Ekty Saputra(3)

(1) Universitas Ahmad Dahlan
(2) Universitas Ahmad Dahlan
(3) Universitas Gadjah Mada
(*) Corresponding Author


The Merapi Volcano Museum is one of the places used as a means of knowledge and information about the mountain with the website address, namely Generally, the information provided causes website visitors to be dissatisfied with the information. The number of visitors who are dissatisfied with the information on the website is evidenced by the results of a questionnaire from 40 respondents, 50.55% of visitors do not get information that is not in accordance with what is desired. Therefore, a system is implemented using the Question Answering System (QAS) with the Named Entity Recognition (NER) method. The implementation of the system uses a telegram based on the NER methodology. Testing using White Box Testing. The results of testing and analysis of tests carried out with white box testing the system has 3 regions and 3 independent path, with path 1 = 1-2-3-4-11, path 2 = 1-2-3- 4-5 -6-7-8-11, and path 3 = 1-2-3-4-5-6-7-9-10-11. The 3 paths are able to return the right answer after being tested using test scenarios for each independent path.


Named Entity Recognitionl; Question Answering System; Museum; White-Box Testing; Dissatisfied Information

Full Text:



Azmi, N. S. A., Singkaravanit-Ogawa, S., Ikeda, K., Kitakura, S., Inoue, Y., Narusaka, Y., Shirasu, K., Kaido, M., Mise, K., & Takano, Y. (2018). Inappropriate expression of an NLP effector in Colletotrichum orbiculare impairs infection on cucurbitaceae cultivars via plant recognition of the C-terminal region. Molecular Plant-Microbe Interactions, 31(1), 101–111.

Bougar, M., & Ziyati, E. H. (2019). Stemming algorithm for arabic text using a parallel data processing. Advances in Intelligent Systems and Computing, 797(July), 261–268.

Brown, K., & Mairesse, F. (2018). The definition of the museum through its social role. Curator: The Museum Journal, 61(4), 525–539.

Garousi, V., Bauer, S., & Felderer, M. (2020). NLP-assisted software testing: A systematic mapping of the literature. Information and Software Technology, 126, 1–29.

Gusmita, R. H., Durachman, Y., Harun, S., Firmansyah, A. F., Sukmana, H. T., & Suhaimi, A. (2014). A rule-based question answering system on relevant documents of Indonesian Quran Translation. 2014 International Conference on Cyber and IT Service Management, CITSM 2014, 104–107.

Khusna, A. N., & Agustina, I. (2018). Implementation of Information Retrieval Using TF-IDF Weighting Method On Detik.Com’s Website. TSSA-IEEE.

Ladani, D. J., & Desai, N. P. (2020). Stopword Identification and Removal Techniques on TC and IR applications: A Survey. 2020 6th International Conference on Advanced Computing and Communication Systems, ICACCS 2020, 466–472.

Luan, Y., Wadden, D., He, L., Shah, A., Ostendorf, M., & Hajishirzi, H. (2019). A general framework for information extraction using dynamic span graphs. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 1, 3036–3046.

Pramanik, S., & Hussain, A. (2019). Text normalization using memory augmented neural networks. Speech Communication, 109, 15–23.

Prasad, G. N. R. (2021). Identification of Bloom ’ s Taxonomy level for the given Question paper using NLP Tokenization technique Turkish Journal of Computer and Mathematics Education Research Article Identification of Cognitive level of Question. Turkish Journal of Computer and Mathematics Education, 12(13), 1872–1875.

Project, M. D. (2019). N-Grams as a Measure of Naturalness and Complexity. Department of computer science and media technology (CM), Digitala Vetenskapliga Arkivet.

Qiu, M., Housh, M., & Ostfeld, A. (2020). A two-stage LP-NLP methodology for the least-cost design and operation of water distribution systems. Water (Switzerland), 12(5), 1–21.

Ramos-Merino, M., Álvarez-Sabucedo, L. M., Santos-Gago, J. M., & Sanz-Valero, J. (2018). A BPMN Based Notation for the Representation of Workflows in Hospital Protocols. Journal of Medical Systems, 42(10).

Sapitri, A. I., & Al-faraby, S. (2018). Analisis Metode Pattern Based Approach Question Answering System Pada Dataset Hukum Islam Berbasis Bahasa Indonesia. Media Informatika Budidarma (MIB), 2(4), 159–164.

Syaikhuddin, M. M., Anam, C., Rinaldi, A. R., & Conoras, M. E. B. (2018). Conventional Software Testing Using White Box Method. Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, 3(1), 65–72.

Wick, C., & Puppe, F. (2018). Fully Convolutional Neural Networks for Page Segmentation of Historical Document Images of Historical Document Images. In 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), 287–292.

Yadav, V., & Bethard, S. (2019). A survey on recent advances in named entity recognition from deep learning models. ArXiv Preprint ArXiv:1910.11470.

Yu, W., Wu, L., Deng, Y., Mahindru, R., Zeng, Q., Guven, S., & Jiang, M. (2020). A Technical Question Answering System with Transfer Learning. 92–99.

Zhang, N., Chen, X., Xie, X., Deng, S., Tan, C., Chen, M., Huang, F., Si, L., & Chen, H. (2021). Document-level Relation Extraction as Semantic Segmentation. 3999–4006.

Article Metrics

Abstract view : 14 times
PDF - 3 times



  • There are currently no refbacks.

Editorial Office of Journal of Intelligent Computing and Health Informatics (JICHI)

Universitas Muhammadiyah Semarang FT-FMIPA Building, 7nd Floor. 
Department of Informatics
Jl. Kedungmundu Raya No. 18, Kota Semarang, Prov. Jawa Tengah, Indonesia 50273 |
Facebook: (in progress)
Twitter: (in progress)

(024) 8445768
+6288215427973 (Whatsapp/SMS)
Web Analytics Made Easy - StatCounter
View My Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.