Comments (1)
Hi, @Tangent-90C 这是正常的,这个涉及到Mind2Web环境本身的一些问题,可以暂时忽略
from agentbench.
Related Issues (20)
- [Bug/Assistance] 测试kg-std任务时,输出文件中全部状态都是task limit reached HOT 1
- [Bug/Assistance] kg-std任务运行的runs.jsonl文件中问题在数据集中找不到 HOT 4
- [Feature] Use for benchmarking agents like AutoGPT? HOT 1
- 我该怎么解决这个问题,跑mind2web,不太清楚该如何操作这个任务,能给出一些具体的指导吗,谢谢 HOT 17
- Card_Game这个任务跑不起来 HOT 4
- Benchmark for mistral models HOT 1
- Connection error HOT 3
- 增加对Cluade3的评测 HOT 2
- [Bug/Assistance] - Reproducing Results on Alfworld (HH) (vs. ReAct paper) HOT 4
- OS std 测试集结果 HOT 1
- Excellent Job! Well, no offense, it seems LLM-Bench rather than AgentBench in essence. HOT 1
- 请问支持使用openai的tool_call接口进行测试吗? HOT 1
- 请问如何使用本地的llama-2-hf模型进行测试呢,希望得到一些明确的指导![Bug/Assistance] HOT 1
- [Feature] 请问每个任务的分是怎么计算的呢?比如OS任务中得到的只是一个准确率,但是在论文中Table3每个任务对应的都是分数,这中间的映射过程我在文中并没有找到,可以提示一下吗 HOT 1
- Would llama3 wizardlm2 and other latest models be tested and published in leaderboard? 请求添加llama3 wizardlm等24年4-5月大模型的测试结果 HOT 3
- [Feature] Add a LICENSE to the project HOT 2
- 请问trajectories有公开吗
- urgent - if there one of the problems throws an error , why does the overall.json not show up??
- DBbench-std task with error "Can't connect to MySQL server" HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from agentbench.