publications | Tim Franzmeyer

2025

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

Tim Franzmeyer, Archie Sravankumar, Lijuan Liu, Yuning Mao, Rui Hou, Sinong Wang, Jakob N Foerster, Luke Zettlemoyer, and Madian Khabsa

arXiv preprint arXiv:2506.04051, 2025
Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs

Felipe Pinto Coelho Nuti, Tim Franzmeyer^†, and João F Henriques^†

International Conference on Machine Learning (ICML), 2025
Reinforcement Learning for Quantum Control under Physical Constraints

Jan Ole Ernst^*, Aniket Chatterjee^*, Tim Franzmeyer^*, and Axel Kuhn

International Conference on Machine Learning (ICML), 2025

arXiv

HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits

Tim Franzmeyer^*, Aleksandar Shtedritski^*, Samuel Albanie, Philip Torr, João F Henriques, and Jakob N Foerster

Association for Computational Linguistics (ACL), 2024

arXiv
Select to Perfect: Imitating desired behavior from large multi-agent data

Tim Franzmeyer, Edith Elkind, Philip HS Torr, Jakob N Foerster^†, and João F Henriques^†

In International Conference on Learning Representations (ICLR), 2024

arXiv
Illusory Attacks: Information-theoretic detectability matters in adversarial attacks

Tim Franzmeyer, Stephen McAleer, João F Henriques, Jakob N Foerster, Philip HS Torr, Adel Bibi, and Christian Schroeder Witt

In International Conference on Learning Representations (ICLR), 2024

arXiv
Rethinking out-of-distribution detection for reinforcement learning: Advancing methods for evaluation and detection

Linas Nasvytis, Kai Sandbrink, Jakob Foerster, Tim Franzmeyer^†, and Christian Schroeder Witt^†

arXiv preprint arXiv:2404.07099, 2024