May 07, 2025 | New paper out from my Meta internship on preventing LLM Hallucinations with a model-specific finetuning method. |
May 01, 2025 | Two papers accepted at ICML 2025! One paper with Felipe on attributing LLM answers to either finetuning or pretraining. Second paper is on RL for Quantum Physics, out of a great collaboration with Jan and Aniket. |
Nov 04, 2024 | Started my internship at Google DeepMind in Zurich. Working on LLM reasoning in the Gemini Post-Training Team with Vikas Yadav, Eric Malmi and Aliaksei Severyn. |
Jun 15, 2024 | Started my internship at Meta AI in Seattle. Working in the LLama Post-Training Safety Team with Yuning Mao, Luke Zettlemoyer and Madian Khabsa. |
Feb 09, 2024 | Recent paper on a live LLM benchmark based on Twitter Community notes and Wikipedia Page edits accepted to ACL 2024. Great colab with Suny. |
Feb 09, 2024 | Linas paper on rethinking out-of-distribution detection in RL will be at AAMAS 2024. |
Jan 08, 2024 | Two papers accepted at ICLR 2024! One paper on adversarial robustness in RL, the other paper is on imitating desired behaviors from multi-agent observations. |
Sep 23, 2023 | Joint paper with Felipe on extracting reward functions from Diffusion Models accepted at Neurips 2023! |
Sep 20, 2022 | Will be presenting my recent paper on Cross-Domain Imitation Learning at NeurIPS 2022. |
Jun 14, 2022 | Workshop paper on adversarial robustness accepted at ICML 2022. |
Jun 10, 2022 | New website is live! |