Training Specialised Chatbots on Ukrainian Scientific Text Corpora Using Transfer Learning

Liashenko, Roman O.; Semerikov, Serhiy O.

Будь ласка, використовуйте цей ідентифікатор, щоб цитувати або посилатися на цей матеріал: http://ds.knu.edu.ua/jspui/handle/123456789/9016

Назва:	Training Specialised Chatbots on Ukrainian Scientific Text Corpora Using Transfer Learning
Автори:	Liashenko, Roman O. Semerikov, Serhiy O.
Ключові слова:	chatbots transfer learning language models fine-tuning Ukrainian language scientific texts
Дата публікації:	8-тра-2025
Видавництво:	IEEE
Бібліографічний опис:	Training Specialised Chatbots on Ukrainian Scientific Text Corpora Using Transfer Learning / Roman O. Liashenko, Serhiy O. Semerikov // 2024 IEEE 19th International Conference on Computer Science and Information Technologies (CSIT), Lviv, Ukraine, 2024, pp. 1-4, doi: 10.1109/CSIT65290.2024.10982573
Короткий огляд (реферат):	This paper presents an extensive experimental study on applying transfer learning techniques to fine-tune pre-trained language models for creating chatbots specialised in Ukrainian scientific texts. The study involves curating two diverse text corpora from research publications, selecting appropriate base models, and fine-tuning them on the prepared datasets using the Hugging Face transformers library. The fine-tuned models are tested for generating chatbot responses to domain-specific user queries. Detailed analyses of the datasets, model architectures, fine-tuning process, and evaluation results are provided. The results demonstrate the feasibility and effectiveness of transfer learning for adapting language models to Ukrainian scientific texts and creating chatbots capable of meaningful dialogue in specialised domains. Challenges and future research directions are discussed.
Опис:	[1] R. Liashenko and S. Semerikov, “The Determination and Visualisation of Key Concepts Related to the Training of Chatbots,” in Information Technology for Education, Science, and Technics: Proceedings of ITEST 2024, Volume 2, ser. Lecture Notes on Data Engineering and Communications Technologies. Springer Cham, 2024, vol. 222. [Online]. Available: https://doi.org/10.1007/978-3-031-71804-5 8 [2] S. Patil, V. Mudaliar, and P. Kamat, “LSTM based Ensemble Network to enhance the learning of Long-term Dependencies in Chatbot,” International Journal of Automation and Smart Technology, vol. 12, no. 1, pp. 2286–2286, Jan. 2022. [Online]. Available: https://doi.org/10.5875/ausmt.v12i1.2286 [3] Z. Ji, “A Multi-modal Seq2seq Chatbot Framework,” in Proceeding of 2021 International Conference on Wireless Communications, Networking and Applications. Singapore: Springer Nature, 2022, pp. 225–233. [Online]. Available: https://doi.org/10.1007/978-981-19-2456-9 24 [4] D. Dharrao and S. Gite, “TherapyBot: a chatbot for mental well-being using transformers,” International Journal of Advances in Applied Sciences, vol. 13, no. 1, pp. 1–12, Mar. 2024. [Online]. Available: http://dx.doi.org/10.11591/ijaas.v13.i1.pp1-12 [5] T.-L. Chou and Y.-L. Hsueh, “A Task-oriented Chatbot Based on LSTM and Reinforcement Learning,” in Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval, ser. NLPIR ’19. New York, NY, USA: Association for Computing Machinery, 2019, p. 87–91. [Online]. Available: https://doi.org/10.1145/3342827.3342844 [6] A. Kansal, “Finetuning: The Theory,” in Building Generative AI-Powered Apps: A Hands-on Guide for Developers. Berkeley, CA: Apress, 2024, pp. 77–100. [Online]. Available: https://doi.org/10.1007/979-8-8688-0205-8 5 [7] Q.-D. L. Tran, A.-C. Le, and V.-N. Huynh, “Enhancing Conversational Model With Deep Reinforcement Learning and Adversarial Learning,” IEEE Access, vol. 11, pp. 75 955–75 970, 2023. [8] V. Ilievski, C. Musat, A. Hossman, and M. Baeriswyl, “Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning,” in Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18. International Joint Conferences on Artificial Intelligence Organization, 7 2018, pp. 4115–4121. [Online]. Available: https://doi.org/10.24963/ijcai.2018/572 [9] T. Taulli, AI-Assisted Programming: Better Planning, Coding, Testing, and Deployment. Sebastopol, CA: O’Reilly Media, Inc., 2024. [Online]. Available: https://www.oreilly.com/library/view/ai-assisted-programming/9781098164553/ [10] A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever, “Language Models are Unsupervised Multitask Learners,” Feb. 2019. [Online]. Available: https://cdn.openai.com/better-language-models/language models are unsupervised multitask learners.pdf
URI (Уніфікований ідентифікатор ресурсу):	http://ds.knu.edu.ua/jspui/handle/123456789/9016
Розташовується у зібраннях:	Кафедра професійної та соціально-гуманітарної освіти

Файли цього матеріалу:

Файл	Опис	Розмір	Формат
183.pdf		197.46 kB	Adobe PDF	Переглянути/Відкрити

Показати повний опис матеріалу Перегляд статистики

Усі матеріали в архіві електронних ресурсів захищені авторським правом, всі права збережені.

DSpace JSPUI

DSpace зберігає і дозволяє легкий і відкритий доступ до всіх видів цифрового контенту, включаючи текст, зображення, анімовані зображення, MPEG і набори даних