
Google at ACL 2025
Google at ACL 2025
The 63rd annual meeting of the Association for Computational Linguistics (ACL) is taking place in Vienna, Austria from July 27 to August 1. Google is proud to be a Diamond Level sponsor of ACL 2025, where researchers from Google Research, Google Deepmind and more will be contributing at all levels. As a leader in natural language processing and understanding, Google will showcase the latest research in the field with over 35 publications, and active involvement in a variety of workshops, orals, a keynote speaker, and several in-booth demo sessions.
Attending ACL 2025 in person? We hope that you’ll visit the Google booth (9D) to learn more about the projects at Google that go into solving interesting problems for billions of people. Visit the @GoogleResearch X and Google Research LinkedIn accounts for announcements about Google booth activities (e.g., demos and Q&A sessions, which are also listed below).
Continue below to learn more about how Google researchers are engaged at ACL 2025 (Google affiliations highlighted in bold).
All session times are provided in CET.
*Date, time and session location may be subject to change.
Google Booth Activities
*Dates and times may be subject to change. Stop by the Google booth (9D) for more details.
-
Tuesday, July 29 | 10:00 AM - 10:30 AM
ECLeKTic: A Benchmark for Evaluating Cross-Lingual Knowledge TransferPresenters: Omer Goldman, Reut Tsarfaty
-
Tuesday, July 29 | 3:30 PM -4:00 PM
Gemini CanvasPresenters: Heidi Zhang, Naman Goyal
Keynotes
Tue, Jul 29 | 9:00AM — 10:00AM, Level 2 Hall A
Whose Gold? Re-imagining Alignment for Truly Beneficial AI
Speaker: Verena Rieser
Panels
Mon, Jul 28 | 4:30PM — 5:30PM, Level 2 Hall A
Generalization on NLP Models
Panelist: Mirella Lapata
Orals
-
Mon, Jul 28 | 2:00PM — 3:30PM, Hall A
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark DatasetTobi Olatunji, Charles Nimo, Abraham Owodunni, Tassallah Abdullahi, Emmanuel Ayodele, Mardhiyah Sanni, Chinemelu Aka, Folafunmi Omofoye, Foutse Yuehgoh, Timothy Faniran, Bonaventure F. P. Dossou, Moshood Yekini, Jonas Kemp, Katherine Heller, Jude Chidubem Omeke, Chidi Asuzu MD, Naome A. Etori, Aimérou Ndiaye, Ifeoma Okoh, Evans Doe Ocansey, Wendy Kinara, Michael Best, Irfan Essa, Stephen Edward Moore, Chris Fourie, Mercy Nyamewaa Asiedu
-
Mon, Jul 28 | 2:00PM — 3:30PM, Hall M.2 (Session 3: Ethics, Bias, and Fairness) 5
Amplifying Trans and Nonbinary Voices: A Community-Centred Harm Taxonomy for LLMsEddie L. Ungless*, Sunipa Dev, Cynthia L. Bennett, Rebecca Gulotta, Jasmijn Bastings, Remi Denton
-
Tue, Jul 29 | 10:30AM — 12:00PM, Hall L (Session 7: IND Orals) 4
User Feedback Alignment for LLM-powered Exploration in Large-Scale Recommendation SystemsJianling Wang, Yifan Liu, Yinghao Sun, Xuejian Ma, Yueqi Wang, He Ma, Zhengyang Su, Minmin Chen, Mingyan Gao, Onkar Dalal, Ed H. Chi, Lichan Hong, Ningren Han, Haokai Lu
-
Tue, Jul 29 | 2:00PM — 3:30PM, Room 1.15-16 (Session 9: Multilingualism and Cross-Lingual NLP) 2
Data Quality Issues in Multilingual Speech Datasets: The Need for Sociolinguistic Awareness and Proactive Language PlanningMingfei Lau, Qian Chen, Yeming Fang, Tingting Xu, Tongzhou Chen, Pavel Golik
Accepted papers
Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models
Fei Wang*, Xingchen Wan, Ruoxi Sun, Jiefeng Chen, Sercan Ö. Arik
Bias in Language Models: Beyond Trick Tests and Towards RUTEd Evaluation
Kristian Lum, Jacy Reese Anthis, Kevin Robinson, Chirag Nagpal, Alexander D’Amour
BIG-Bench Extra Hard
Mehran Kazemi, Bahare Fatemi, Hritik Bansal, John Palowitch, Chrysovalantis Anastasiou, Sanket Vaibhav Mehta, Lalit K. Jain, Virginia Aglietti, Disha Jindal, Peter Chen, Nishanth Dikkala, Gladys Tyen, Xin Liu, Uri Shalit, Silvia Chiappa, Kate Olszewska, Yi Tay, Vinh Q. Tran, Quoc V. Le, Orhan Firat
Confidence Improves Self-Consistency in LLMs
Amir Taubenfeld, Tom Sheffer, Eran Ofek, Amir Feder, Ariel Goldstein, Zorik Gekhman, Gal Yona
ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability
Antonin Poché, Alon Jacovi, Agustin Martin Picard, Victor Boutin, Fanny Jourdan
DARE: Diverse Visual Question Answering with Robustness Evaluation
Hannah Sterz, Jonas Pfeiffer, Ivan Vulić
Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling
Maximillian Chen*, Ruoxi Sun, Sercan Ö. Arik
Debiasing Online Preference Learning via Preference Feature Preservation
Dongyoung Kim, Jinsung Yoon, JinwooShin, Jaehyung Kim
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Sohee Yang, Nora Kassner, Elena Gribovskaya, Sebastian Riedel, Mor Geva
EditInspector: A Benchmark for Evaluation of Text-Guided Image Edits
Ron Yosef, Moran Yanuka, Yonatan Bitton, Dani Lischinski
Embedding-Converter: A Unified Framework for Cross-Model Embedding Transformation
Jinsung Yoon, Sercan Ö. Arik
Enhancing Human Evaluation in Machine Translation with Comparative Judgment
Yixiao Song*, Parker Riley, Daniel Deutsch, Markus Freitag
Entailed Between the Lines: Incorporating Implication into NLI
Shreya Havaldar*, Hamidreza Alvari, John Palowitch, Mohammad Javad Hosseini, Senaka Buthpitiya, Alex Fabrikant
Few-Shot Multilingual Open-Domain QA from 5 Examples
Fan Jiang, Tom Drummond, Trevor Cohn
FRACTAL: Fine-Grained Scoring from Aggregate Text Labels
Yukti Makhija, Priyanka Agrawal, Rishi Saket, Aravindan Raghuveer
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Abhilasha Ravichander, Shrusti Ghela, David Wadden, Yejin Choi
Help Me Write a Story: Evaluating LLMs’ Ability to Generate Writing Feedback
Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata
In Prospect and Retrospect: Reflective Memory Management for Long-Term Personalized Dialogue Agents
Zhen Tan*, Jun Yan, I-Hung Hsu, Rujun Han, Zifeng Wang, Long T. Le, Yiwen Song, Yanfei Chen, Hamid Palangi, George Lee, Anand Iyer, Tianlong Chen, Huan Liu, Chen-Yu Lee, Tomas Pfister
Magnet: Multi-Turn Tool-Use Data Synthesis and Distillation via Graph Translation
Fan Yin, Zifeng Wang, I-Hung Hsu, Jun Yan, Ke Jiang, Yanfei Chen, Jindong Gu, Long T. Le, Kai-Wei Chang, Chen-Yu Lee, Hamid Palangi, Tomas Pfister
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation
Ching-Wen Yang, Zhi-Quan Feng, Ying-Jia Lin, Che Wei Chen, Kun-da Wu, Hao Xu, Jui-Feng Yao, Hung-Yu Kao
MDCure: A Scalable Pipeline for Multi-Document Instruction-Following
Gabrielle Kaili-May Liu, Bowen Shi, Avi Caciularu, Idan Szpektor, Arman Cohan
Optimizing Pre-Training Data Mixtures with Mixtures of Data Expert Models
Lior Belenki, Alekh Agarwal, Tianze Shi, Kristina Toutanova
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
Jaydeep Borkar, Matthew Jagielski, Katherine Lee, Niloofar Mireshghallah, David A. Smith, Christopher A. Choquette-Choo
Revisiting In-Context Learning with Long Context Language Models
Jinheon Baek*, Sun Jae Lee, Prakhar Gupta, Geunseob (GS) Oh, Siddharth Dalmia, Prateek Kolhar
R3Mem: Bridging Memory Retention and Retrieval via Reversible Compression
Xiaoqiang Wang, Suyuchen Wang, Yun Zhu*, Bang Liu
Self-play Through Computational Runtimes Improves Chart Reasoning
Tautvydas Misiunas, Hassan Mansoor, Jasper Uijlings, Oriana Riva, Victor Carbune
SIKeD: Self-guided Iterative Knowledge Distillation for Mathematical Reasoning
Shivam Adarsh, Kumar Shridhar, Caglar Gulcehre, Nicholas Monath, Mrinmaya Sachan
SkillVerse: Assessing and Enhancing LLMs with Tree Evaluation
Yufei Tian*, Jiao Sun, Nanyun Peng, Zizhao Zhang
Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions
Taedong Yun, Eric Yang, Mustafa Safdari, Jong Ha Lee, Vaishnavi Vinod Kumar, S. Sara Mahdavi, Jonathan Amar, Derek Peyton, Reut Aharony, Andreas Michaelides, Logan Schneider, Isaac Galatzer-Levy, Yugang Jia, John Canny, Arthur Gretton, Maja Matarić
Spectra 1.1: Scaling Laws and Efficient Inference for Ternary Language Models
Tejas Vaidhya, Ayush Kaushal, Vineet Jain, Francis Couture-Harpin, Prashant Shishodia, Majid Behbahani, Irina Rish, Yuriy Nevmyvaka
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
Zhicheng Guo, Sijie Cheng, Yuchen Niu, Hao Wang, Sicheng Zhou, Wenbing Huang, Yang Liu
Substance Over Style: Evaluating Proactive Conversational Coaching Agents
Vidya Srinivas*, Xuhai Xu, Xin Liu, Kumar Ayush, Isaac Galatzer-Levy, Shwetak Patel, Daniel McDuff, Tim Althoff
Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications
Yanxiang Zhang, Zheng Xu, Shanshan Wu, Yuanbo Zhang, Daniel Ramage
TANQ: An Open Domain Dataset of Table Answered Questions
Mubashara Akhtar*, Chenxi Pang, Andreea Marzoca, Yasemin Altun, Julian Eisenschlos
Towards Geo-Culturally Grounded LLM Generations
Piyawat Lertvittayakumjorn, David Kinney, Vinodkumar Prabhakaran, Donald Martin Jr., Sunipa Dev
TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge
Cheng-Han Chiang, Hung-yi Lee, Michal Lukasik
WikiMixQA: A Multimodal Benchmark for Question Answering Over Tables and Charts
Negar Foroutan, Angelika Romanou, Matin Ansaripour, Julian Martin Eisenschlos, Karl Aberer, Rémi Lebret
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects
Daniel Deutsch, Eleftheria Briakou, Isaac Caswell, Mara Finkelstein, Rebecca Galor, Juraj Juraska, Geza Kovacs, Alison Lui, Ricardo Rei, Jason Riesa, Shruti Rijhwani, Parker Riley, Elizabeth Salesky, Firas Trabelsi, Stephanie Winkler, Biao Zhang, Markus Freitag
Workshops
-
Thu, Jul 31 — Fri, Aug 1 | 9:00AM — 5:30PM, Hall C
GEM2 Workshop: Generation, Evaluation & MetricsOrganizer: Oyvind Tafjord
-
Thu, Jul 31 — Fri, Aug 1 | 9:00AM — 6:00PM, Hall N2
The International Conference on Spoken Language Translation (IWSLT 2025)Organizer: Elizabeth Salesky
-
Thu, Jul 31 | 9:00AM — 5:45PM, Room 2.17
Natural Language Processing meets Climate Change WorkshopOrganizer: Markus Leippold
-
Thu, Jul 31 | 9:00AM — 6:00PM, Hall N1
NLP for Positive Impact WorkshopSpeaker: Vinodkumar Prabhakaran
-
Thu, Jul 31 | 8:45AM — 5:15PM, Room 1.61-62
REALM: Research on Agent Language Models WorkshopSpeaker: Roberta Raileanu
Organizer: Shikhar Murty
-
Thu, Jul 31 | 8:30AM — 6:00PM, Room 1.14
SDP 2025: Scholarly Document Processing WorkshopSpeaker: James A. Evans
-
Thu, Jul 31 | 9:00AM — 5:00PM, Room 2.15
Table Representation Learning WorkshopSpeaker: Ruoxi Sun
Organizer: Wenhu Chen
-
Fri, Aug 1 | 9:00AM — 5:00PM, Room 1.31-32
Large Language Model Memorization (L2M2) WorkshopSpeaker: Reza Shokri
Organizer: Yangsibo Huang
-
Fri, Aug 1 | 9:00AM — 5:30PM, Hall N1
Towards Knowledgeable Foundation Models WorkshopOrganizer: Mor Geva Pipek
Birds of a Feather and Affinity Group Events
Tue, Jul 29 | 2:00PM — 3:30PM, Room 1.31-1.32
Southeast Asian NLP Community, Projects, and Beyond
Organizer: Alham Fikri Aji
Board & Organizing Committee
-
Pushkar Mishra
- Board Member
-
Shruti Rijhwani
- Board Member
-
Joshua Maynez
- Program Committee
-
Verena Rieser
- Program Committee
-
Mor Geva Pipek
- Program Committee
-
Jasmijn Bastings
- Program Committee
-
Kalpesh Krishna
- Program Committee
-
Ranjay Krishna
- Program Committee
-
Fangyu Liu
- Program Committee
-
Emanuele Bugliarello
- Program Committee
-
Jonathan Berant
- Program Committee
-
Ivan Vulic
- Program Committee
-
Raj Dabre
- Program Committee
-
Mohammad Javad Hosseini
- Program Committee