AgentBench: Evaluating LLMs as Agents

AgentBench: Evaluating LLMs as Agents

arxiv.org

Saved by Darren LI

Xiao Liu AgentBench: Evaluating LLMs as Agents

Xiao Liu AgentBench: Evaluating LLMs as Agents