Please refer to Google Scholar for latest updates.
- Position: Auditing Is Not Evaluating; LLM Audit Requires Dynamic, Contextual, Budget-Aware and Reliable Evidence
Cléa Chataigner, Pablo Piantanida, Golnoosh Farnadi
Preprint
[pdf] - Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework
Cléa Chataigner, Rebecca Ma, Prakhar Ganesh, Yuhao Chen, Afaf Taïk, Elliot Creager, Golnoosh Farnadi
To appear in EACL26 main proceedings, selected for oral presentation
[arxiv] - Multilingual Hallucination Gaps in Large Language Models
Cléa Chataigner, Afaf Taïk, Golnoosh Farnadi
Proceedings of Machine Learning Research 279, 133-155, 2024
[arxiv] [pdf] [poster]
