Smolagent GAIA Evaluation Runner
Instructions:
- Log in to your Hugging Face account using the button below.
- Click 'Process Questions' to run the agent on all questions and save answers.
- After processing is complete, click 'Submit Answers' to submit the answers to the evaluation server.
Note: Processing questions will take time as the agent processes each question. The agent is specifically configured to format answers according to the GAIA benchmark requirements:
- Numbers: No commas, no units
- Strings: No articles, no abbreviations
- Lists: Comma-separated values following the above rules
Separating processing and submission helps avoid losing work due to rate limiting or other errors.
Questions and Agent Answers