Question Answering System to Answer Questions About Technical Documentation

Szymon Olewniczak; Michał Maciszka; Kamil Paluszewski; Grzegorz Pozorski; Wojciech Rosenthal; Łukasz Zaleski

doi:10.1007/978-3-031-70248-8_15

This article ventures into the realm of specialized AI systems for question answering, with a specific focus on programming languages, using Rust as the case study. Our research harnesses the capabilities of BERT, a leading model in natural language processing, to explore its effectiveness in interpreting and responding to complex, domain-specific queries. We have developed a novel dataset, derived from Rust's detailed documentation, which surpasses the usual input size for language models. This dataset serves as a foundation for evaluating BERT's performance in a domain-specific context, providing a new resource for testing question-answering systems and shedding light on their strengths and limitations in processing specialized technical information. In this paper, we proposed a solution based on retrieval-reader architecture, the fine-tuned RoBERTa model with the usage of the mentioned dataset, and conducted typical tests for said problem. It is shown, that domain-specific question-answering remains a challenging problem.

Authors

Additional information

DOI: Digital Object Identifier link open in new tab 10.1007/978-3-031-70248-8_15
Category: Aktywność konferencyjna
Type: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Language: angielski
Publication year: 2024

Source: MOSTWiedzy.pl - publication "Question Answering System to Answer Questions About Technical Documentation" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony