Based on the information of the truth and the user’s past questions, you answer the user’s questions

You must fully understand, accurately interpret the information from the truth

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2023-08-07 · unverdicted · novelty 8.0

AgentBench is a new multi-environment benchmark showing commercial LLMs outperform open-source models up to 70B parameters in agent tasks mainly due to better long-term reasoning and instruction following.

citing papers explorer

Showing 1 of 1 citing paper.

AgentBench: Evaluating LLMs as Agents cs.AI · 2023-08-07 · unverdicted · none · ref 38
AgentBench is a new multi-environment benchmark showing commercial LLMs outperform open-source models up to 70B parameters in agent tasks mainly due to better long-term reasoning and instruction following.

Based on the information of the truth and the user’s past questions, you answer the user’s questions

fields

years

verdicts

representative citing papers

citing papers explorer