Seminars

View all Seminars  |  Download ICal for this event

Explainable and Efficient Neural models for Natural Language to Bash Command Translation

Series: M.Tech (Research)Thesis Defence -ONLINE

Speaker: Mr. Shikhar Bharadwaj, M.Tech (Research) student,Dept. of CSA

Date/Time: Jun 22 14:00:00

Location: Microsoft Teams - ON-LINE

Faculty Advisor: Prof. Shirish K Shevade

Abstract:
One of the key goals of Natural Language Processing is to make computers understand natural language. Semantic Parsing has been one of the driving tasks for Natural Language Understanding. It is formally defined as the task of generating meaning representation from natural language input. In this work, we focus on using the Bash command as the meaning representation. Bash is a Unix command language used for interacting with the Operating System. Recent works on natural language to Bash command translation have made significant advances on this problem. The best performing solutions employ a neural network architecture called the Transformer. In this work, we explore the aspects of explainability and efficiency for this task and use the Transformer as one of the baselines for comparing the proposed approaches.
In the first part, we utilize documentation data from Linux manual pages and the Abstract Syntax Tree for Bash to generate explanations for the translated Bash command. We propose a novel architecture that incorporates tree structure information in the Transformer and provides explanations for its predictions via alignment matrices between user invocation and manual page text. We find that the proposed method performs on par with the Transformer performance. Our method performs better than fine-tuned T5, a Transformer-based neural model pre-trained on a large amount of text data in a self-supervised manner.
In the second part, we use the problems inherent synchronous structure and propose the Segmented Invocation Transformer (SIT) that utilizes the information from the constituency parse tree of the natural language invocation. Our method is motivated by the alignment between segments in the natural language text and Bash command components. By utilizing this structure, the proposed method outperforms the state-of-the-art approach while achieving a 1.8x improvement in the inference time (as measured on a CPU) and a 5x reduction in model parameters. We also conduct an attribution analysis using Integrated Gradients to empirically confirm the identified structure and the ability of SIT to capture it.
Microsoft teams link:
https://teams.microsoft.com/l/meetup-join/19%3ameeting_MjZhOWViYzMtZmQ5MC00NzMwLWI3MjktODlhOGU4YjkxZGYz%40thread.v2/0?context=%7b%22Tid%22%3a%226f15cd97-f6a7-41e3-b2c5-ad4193976476%22%2c%22Oid%22%3a%229c3e2bfe-1b0f-4d7b-a589-832878069dc6%22%7d

Speaker Bio:

Host Faculty: