| Title | The Impact of Lexical and Grammatical Processing on Generating Code from Natural Language |
| Publication Type | Article dans des actes |
| Année de la conférence | 2022 |
| Authors | Beau, Nathanaël, and Benoît Crabbé |
| Nom de la conférence | Findings of the Association for Computational Linguistics: ACL 2022 |
| Pagination | 2204–2214 |
| Date de publication | 05-2022 |
| Publisher | Association for Computational Linguistics |
| Conference Location | Dublin, Ireland |
| Abstract | Considering the seq2seq architecture of Yin and Neubig (2018) for natural language to code translation, we identify four key components of importance: grammatical constraints, lexical preprocessing, input representations, and copy mechanisms. To study the impact of these components, we use a state-of-the-art architecture that relies on BERT encoder and a grammar-based decoder for which a formalization is provided. The paper highlights the importance of the lexical substitution component in the current natural language to code systems. |