Tim Franzmeyer

Hi! I am a fourth-year PhD student at Oxford University, interested in multi-agent systems, reinforcement learning, imitation learning, AI safety and LLMs. I am lucky to be working with Philip Torr, Joao Henriques, and Jakob Foerster.
I’m always interested in meeting new people and in new collaborations. If you’d like to get in touch with me, please email me at frtim at robots dot ox dot ac dot uk.
news
May 07, 2025 | New paper out from my Meta internship on preventing LLM Hallucinations with a model-specific finetuning method. |
---|---|
May 01, 2025 | Two papers accepted at ICML 2025! One paper with Felipe on attributing LLM answers to either finetuning or pretraining. Second paper is on RL for Quantum Physics, out of a great collaboration with Jan and Aniket. |
Nov 04, 2024 | Started my internship at Google DeepMind in Zurich. Working on LLM reasoning in the Gemini Post-Training Team with Vikas Yadav, Eric Malmi and Aliaksei Severyn. |
Jun 15, 2024 | Started my internship at Meta AI in Seattle. Working in the LLama Post-Training Safety Team with Yuning Mao, Luke Zettlemoyer and Madian Khabsa. |
Feb 09, 2024 | Recent paper on a live LLM benchmark based on Twitter Community notes and Wikipedia Page edits accepted to ACL 2024. Great colab with Suny. |
Feb 09, 2024 | Linas paper on rethinking out-of-distribution detection in RL will be at AAMAS 2024. |
Jan 08, 2024 | Two papers accepted at ICLR 2024! One paper on adversarial robustness in RL, the other paper is on imitating desired behaviors from multi-agent observations. |
Sep 23, 2023 | Joint paper with Felipe on extracting reward functions from Diffusion Models accepted at Neurips 2023! |
Sep 20, 2022 | Will be presenting my recent paper on Cross-Domain Imitation Learning at NeurIPS 2022. |
Jun 14, 2022 | Workshop paper on adversarial robustness accepted at ICML 2022. |
Jun 10, 2022 | New website is live! |
selected publications
- Measuring the Contribution of Fine-Tuning to Individual Responses of LLMsInternational Conference on Machine Learning (ICML), 2025
- Reinforcement Learning for Quantum Control under Physical ConstraintsInternational Conference on Machine Learning (ICML), 2025
- HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia editsAssociation for Computational Linguistics (ACL), 2024
- Select to Perfect: Imitating desired behavior from large multi-agent dataIn International Conference on Learning Representations (ICLR), 2024
- Illusory Attacks: Information-theoretic detectability matters in adversarial attacksIn International Conference on Learning Representations (ICLR), 2024
- Rethinking out-of-distribution detection for reinforcement learning: Advancing methods for evaluation and detectionarXiv preprint arXiv:2404.07099, 2024
- Extracting Reward Functions from Diffusion ModelsIn Advances in Neural Information Processing Systems (NeurIPS), 2023
- Learn what matters: cross-domain imitation learning with task-relevant embeddingsIn Advances in Neural Information Processing Systems (NeurIPS), 2022
- Learning Altruistic Behaviours in Reinforcement Learning without External RewardsInternational Conference on Learning Representations (ICLR), 2022