Chatbot Arena.
An open platform for evaluating llms by human preference.
Large Language Models (LLMs) have unlocked new capabilities and applications; however, evaluating the alignment with human preferences still poses significant challenges.
Join the discussion on this paper page.
Comments are closed.