Tourism Information QA Datasets for Smart Tourism Chatbot
DOI:
https://doi.org/10.15379/ijmst.v10i1.1451Keywords:
Tourism Information QA, Machine Reading Comprehension, Smart Tourism Chatbot, Pre-trained language model, Mobile App.Abstract
Smart tourism uses artificial intelligence (AI) technology to provide easy and convenient travel services to tourists. The task-oriented chatbot system is a way to provide tourists more efficiently with travel services that were previously provided on the web or apps. In this paper, we develop the question answering (QA) dataset for an AI-based tourism information chatbot system. The tourism information QA dataset is developed in JSON format of KLUE MRC based on the tourism information database and tourism knowledge base built for smart tourism apps and rule-based chatbot services, respectively. To apply the QA model along with the DST and NER models to the smart tourism chatbot system, we develop the QA dataset by considering the previously developed the tourism information NER dataset and the smart tourism DST dataset. We evaluate the tourism information QA datasets with the koBigBird model, which can handle sequences of 4,096 tokens, and the EM (Exact Match) and F1 score are 96.85 and 98.84, respectively.