[05/14/2024] LLM as A Evaluator

Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaarXiv.org

Pride and Prejudice: LLM Amplifies Self-Bias in Self-RefinementarXiv.org

Previous[05/14/2024] Validity Coding Next[05/14/2024] Social Skill Training via LLMs (Diyi's Group)

Last updated 1 year ago