Smolagent GAIA Evaluation Runner

Instructions:

  1. Log in to your Hugging Face account using the button below.
  2. Click 'Process Questions' to run the agent on all questions and save answers.
  3. After processing is complete, click 'Submit Answers' to submit the answers to the evaluation server.

Note: Processing questions will take time as the agent processes each question. The agent is specifically configured to format answers according to the GAIA benchmark requirements:

  • Numbers: No commas, no units
  • Strings: No articles, no abbreviations
  • Lists: Comma-separated values following the above rules

Separating processing and submission helps avoid losing work due to rate limiting or other errors.

Questions and Agent Answers

Questions and Agent Answers