news

May 07, 2025 New paper out from my Meta internship on preventing LLM Hallucinations with a model-specific finetuning method.
May 01, 2025 Two papers accepted at ICML 2025! One paper with Felipe on attributing LLM answers to either finetuning or pretraining. Second paper is on RL for Quantum Physics, out of a great collaboration with Jan and Aniket.
Nov 04, 2024 Started my internship at Google DeepMind in Zurich. Working on LLM reasoning in the Gemini Post-Training Team with Vikas Yadav, Eric Malmi and Aliaksei Severyn.
Jun 15, 2024 Started my internship at Meta AI in Seattle. Working in the LLama Post-Training Safety Team with Yuning Mao, Luke Zettlemoyer and Madian Khabsa.
Feb 09, 2024 Recent paper on a live LLM benchmark based on Twitter Community notes and Wikipedia Page edits accepted to ACL 2024. Great colab with Suny.
Feb 09, 2024 Linas paper on rethinking out-of-distribution detection in RL will be at AAMAS 2024.
Jan 08, 2024 Two papers accepted at ICLR 2024! One paper on adversarial robustness in RL, the other paper is on imitating desired behaviors from multi-agent observations.
Sep 23, 2023 Joint paper with Felipe on extracting reward functions from Diffusion Models accepted at Neurips 2023!
Sep 20, 2022 Will be presenting my recent paper on Cross-Domain Imitation Learning at NeurIPS 2022.
Jun 14, 2022 Workshop paper on adversarial robustness accepted at ICML 2022.
Jun 10, 2022 New website is live!