2024

  1. On Instruction-Finetuning Neural Machine Translation Models Ninth Conference on Machine Translation (WMT) [ abstract ]
  2. SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window Vikas Raunak, Tom Kocmi, Matt Post Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) [ abstract ] [ paper ]

2023

  1. Dissecting In-Context Learning of Translations in Large Language Models Vikas Raunak, Hany Hassan Awadalla, Arul Menezes Findings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing [ abstract ] [ paper ]
  2. Leveraging GPT-4 for Automatic Translation Post-Editing Vikas Raunak, Amr Sharaf, Hany Hassan Awadalla, Arul Menezes Findings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing [ abstract ] [ paper ]
  3. Evaluating Metrics for Document-context Evaluation in Machine Translation Vikas Raunak, Tom Kocmi, Matt Post Eight Conference on Machine Translation (WMT) [ abstract ] [ paper ]
  4. Do GPTs Produce Less Literal Translations? Vikas Raunak, Arul Menezes, Matt Post, Hany Hassan Awadalla The 61st Annual Meeting of the Association for Computational Linguistics (ACL) [ abstract ] [ paper ] [ code ]
  5. How Good are GPT Models at Machine Translation? A Comprehensive Evaluation Amr Hendy, Mohamed Abdelrehim, Amr Sharaf, Vikas Raunak, Mohamed Gabr, Hitokazu Matsushita, Young Jin Kim, Mohamed Afify, Hany Hassan Awadalla [ abstract ] [ paper ] [ code ]

2022

  1. Rank-One Editing of Encoder-Decoder Models Vikas Raunak, Arul Menezes 2nd Workshop on Interactive Learning for Natural Language Processing (InterNLP), NeurIPS [ abstract ] [ paper ]
  2. Operationalizing Specifications, In Addition to Test Sets for Evaluating Constrained Generative Models Vikas Raunak, Matt Post, Arul Menezes NeurIPS Workshop on Human Evaluation of Generative Models (HGEM) [ abstract ] [ paper ] [ talk ]
  3. Finding Memo: Extractive Memorization in Constrained Sequence Generation Tasks Vikas Raunak, Arul Menezes Findings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing [ abstract ] [ paper ] [ code ] [ talk ]
  4. SALTED: A Framework for SAlient Long-Tail Translation Error Detection. Vikas Raunak, Matt Post, Arul Menezes Findings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing [ abstract ] [ paper ] [ talk ] [ slides ]
  5. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model BigScience Workshop, Vikas Raunak, et al. Under Submission [ abstract ] [ paper ] [ code ]
  6. GEMv2: Multilingual NLG Benchmarking in a Single Line of Code. Sebastian Gehrmann, Vikas Raunak, et al. Empirical Methods in Natural Language Processing (EMNLP): System Demonstrations [ abstract ] [ paper ] [ code ]
  7. Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models. Aarohi Srivastava, Vikas Raunak, et al. Transactions on Machine Learning Research (TMLR) [ abstract ] [ paper ] [ code ]

2021

  1. NL-Augmenter: A framework for task-sensitive natural language augmentation. Kaustubh Dhole, Vikas Raunak, et al. Northern European Journal of Language Technology (NEJLT) [ abstract ] [ paper ] [ code ]
  2. The Curious Case of Hallucinations in Neural Machine Tanslation. Vikas Raunak, Arul Menezes, Marcin Junczys-Dowmunt Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) [ abstract ] [ paper ] [ code ] [ talk ] [ slides ] [ poster ]
  3. Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks. Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) [ abstract ] [ paper ] [ code ] [ slides ] [ poster ]
  4. The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics Sebastian Gehrmann, Vikas Raunak, et al. 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM) [ abstract ] [ paper ] [ code ] [ media ]

2020

  1. On Long-Tailed Phenomena in Neural Machine Translation. Vikas Raunak, Siddharth Dalmia, Vivek Gupta, Florian Metze Findings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing [ abstract ] [ paper ] [ code ] [ talk ] [ slides ] [ poster ]
  2. Ranking Clarification Questions via Natural Language Inference. Vikas Raunak, Vaibhav Kumar, Jamie Callan 29th ACM International Conference on Information and Knowledge Management (CIKM) [ abstract ] [ paper ] [ talk ] [ slides ]
  3. On Dimensional Linguistic Properties of the Word Embedding Space. Vikas Raunak, Vaibhav Kumar, Vivek Gupta, Florian Metze 5th Workshop on Representation Learning for NLP (RepL4NLP) [ abstract ] [ paper ] [ code ] [ talk ] [ slides ]

2019

  1. On Compositionality in Neural Machine Translation. Vikas Raunak, Vaibhav Kumar, Florian Metze Context and Compositionality in Biological and Artificial Neural Systems Workshop, NeurIPS [ abstract ] [ paper ] [ slides ] [ poster ]
  2. On Leveraging the Visual Modality for Neural Machine Translation. Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze The 12th International Conference on Natural Language Generation (INLG) [ abstract ] [ paper ] [ slides ] [ poster ]
  3. On Leveraging Visual Modality for ASR Error Correction Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze Workshop on The How2 Challenge: New Tasks for Vision and Language, ICML [ abstract ] [ paper ]
  4. Learned in Speech Recognition: Contextual Acoustic Word Embeddings. Vikas Raunak, Shruti Palaskar, Florian Metze IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) [ abstract ] [ paper ] [ slides ]
  5. Effective Dimensionality Reduction for Word Embeddings Vikas Raunak, Vivek Gupta, Florian Metze 4th Workshop on Representation Learning for NLP (RepL4NLP) [ abstract ] [ paper ] [ code ]