
This user has not added any information to their profile yet.
Publications
- FaST: Feature-aware Sampling and Tuning for personalized preference alignment with limited data
 The Conference on Empirical Methods in Natural Language Processing (EMNLP), Suzhou, China, 4-9 November 2025
 
- Compositional preference models for aligning LMs
 The International Conference on Learning Representations (ICLR), Vienna, Austria, 7-11 May, 2024
 
- Compositional preference models for alignment with scalable oversight
 Socially Responsible Language Modelling Research (SoLaR)at NeurIPS 2023, 16 December 2023
 
- Aligning language models with preferences through 𝑓-divergence minimization
 International Conference on Machine Learning (ICML), 23-29 July 2023, Hawaii, USA. PMLR 202:11546-11583, 2023.
 
- Should you marginalize over possible tokenizations?
 61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, 9-14 July, 2023
 
- disco: a toolkit for DIStributional COntrol of generative models
 61st Annual Meeting of the Association for Computational Linguistics (ACL), Toronto, Canada, 9-14 July, 2023
 
- Aligning language models with preferences through f-divergence minimization
 arXiv:2302.08215
 
- On reinforcement learning and distribution matching for fine-tuning language models with no catastrophic forgetting
 Neural Information Processing Systems (NeurIPS), hybrid event, New Orleans, Louisiana, USA, 28 November – 9 December, 2022
 
- Beyond the imitation game: quantifying and extrapolating the capabilities of language models
 arXiv:2206.04615
 
- BLOOM: a 176B-parameter open-access multilingual language model
 arXiv:2211.05100
 
