The Impact of Lexical and Grammatical Processing on Generating Code from Natural Language

TitreThe Impact of Lexical and Grammatical Processing on Generating Code from Natural Language
Publication TypeArticle dans des actes
Année de la conférence2022
AuthorsBeau, Nathanaël, and Benoît Crabbé
Nom de la conférenceFindings of the Association for Computational Linguistics: ACL 2022
Pagination2204–2214
Date de publication05-2022
PublisherAssociation for Computational Linguistics
Conference LocationDublin, Ireland
Abstract

Considering the seq2seq architecture of Yin and Neubig (2018) for natural language to code translation, we identify four key components of importance: grammatical constraints, lexical preprocessing, input representations, and copy mechanisms. To study the impact of these components, we use a state-of-the-art architecture that relies on BERT encoder and a grammar-based decoder for which a formalization is provided. The paper highlights the importance of the lexical substitution component in the current natural language to code systems.