A framework for question answering system using dynamic co-attention networks / by Swetha Busireddy.

Author/creator Busireddy, Swetha author.
Other author Gudivada, Venkat N. degree supervisor.
Other author East Carolina University. Department of Computer Science.
Format Theses and dissertations
Publication[Greenville, N.C.] : [East Carolina University], 2020.
Description42 pages : illustrations (some color)
Supplemental ContentAccess via ScholarShip
Subjects

Summary Question answering (QA) systems have evolved exponentially over the past few years and have reached a reliable human standard. Attention mechanisms, as well as other methods of deep learning, paved the way for this development. But, because of their single-pass nature, they are incapable of recovering from local maxima matching to incorrect answers. Dynamic coattention network (DCN) is used to answer this issue. But as it has only one layer, the ability of the DCN to write diverse input representations is limited. We proposed a few modifications to DCN to overcome these findings. First, we used a bidirectional long short-term memory network (biLSTM) to encode the question and document. Next, we applied the concept of self-attention to DCN by using multiple coattention layers. This helps the encoder to generate more profuse input representations. Lastly, we combine outputs from these layers; this improves the long-range dependencies. We built a question answering system based on this multiattention DCN and tested on one of our course documents. On Stanford question answering dataset (SQuAD), this system improves the F1 mean on validation to 79.9% from its previous state of art at 75.6%.
General notePresented to the faculty of the Department of Computer Science.
General noteAdvisor: Venkat Gudivada
General noteTitle from PDF t.p. (viewed March 8, 2021).
Dissertation noteComputer Science East Carolina University 2020.
Bibliography noteIncludes bibliographical references.
Technical detailsSystem requirements: Adobe Reader.
Technical detailsMode of access: World Wide Web.