SeminarsView all Seminars | Download ICal for this event
Novel Neural Architecture for Multi-Hop Question Answering
Series: M.Tech (Research) Colloquium
Speaker: Mr. G P Shrivatsa Bhargav M.Tech. (Research) Student Dept. of CSA
Date/Time: Jan 21 11:00:00
Location: CSA Seminar Hall (Room No. 254, First Floor)
Faculty Advisor: Prof. Shirish K Shevade
Natural language understanding has been one of the key drivers responsible for advancing the field of AI. To this end, automated Question Answering (QA) has served as an effective way of measuring the language understanding capabilities of AI systems. Our focus in this thesis is on Reading Comprehension style Question Answering (RCQA) task. Reading comprehension is the ability to understand natural language text and answer questions over it. Specifically, we focus on complex questions that require multi-hop reasoning over facts spread across multiple passages. Recently, there has been a surge in the research activities surrounding RCQA task, primarily due to the emergence of large-scale public datasets. For single-hop RCQA datasets, majority of the proposed solutions are based on massively pre-trained Transformer-style models such as BERT. Some of these solutions have exhibited human level performance. Similar solutions have been proposed for the multi-hop RCQA datasets and they have also improved the state-of-the-art. However, we believe that the core challenges involved in the multi-hop RCQA task have not been addressed effectively by existing solutions and hence there is an opportunity to advance the state-of-the-art. We present a novel deep neural architecture, called TAP (Translucent Answer Prediction), to identify answers and evidence (in the form of supporting facts) in an RCQA task requiring multi-hop reasoning. TAP comprises two loosely coupled networks -- Local and Global Interaction eXtractor (LoGIX) and Answer Predictor (AP). LoGIX predicts supporting facts, whereas AP consumes these predicted supporting facts to predict the answer span. The design of LoGIX is inspired by two key design desiderata -- local context and global interaction -- that we identified by analyzing examples of multi-hop RCQA task. The loose coupling between LoGIX and the AP reveals the set of sentences used by the AP in predicting an answer. Therefore, answer predictions of TAP can be interpreted in a translucent manner. We conduct extensive evaluations and analyses on the HotpotQA dataset to understand the characteristics of TAP. TAP achieved state-of-the-art performance on the distractor setting of the HotpotQA dataset.