Code and Dataset for "Evaluating Large Language Models’ Ability to Generate Interpretive Arguments"
advancing-machine-human-reasoning-lab / llm-evaluation Goto Github PK
View Code? Open in Web Editor NEWCode and Dataset for "Evaluating Large Language Models’ Ability to Generate Interpretive Arguments"