Welcome to Quizbowl Bot! This is a tool for creating and testing quizbowl agents.
User Submissions and Examples
Create a pipeline that takes in the following input variables and outputs the following output variables:
- Input Variables:
question_text
- Output Variables:
answer
,confidence
Stop if already buzzed
Model Outputs
User Submissions and Examples
Create a pipeline that takes in the following input variables and outputs the following output variables:
- Input Variables:
leadin
,part
- Output Variables:
answer
,confidence
,explanation
Model Outputs
Build an open-domain QA system that can answer any question posed by humans! For more: https://sites.google.com/view/qanta/home
mock-claude | 0.3694 | 0.3993 | 0.3688 |
mock-claude | 0.3694 | 0.3993 | 0.3688 |
mock-gemini | 0.3556 | 0.323 | 0.2875 |
mock-gpt-4o | 0.371 | 0.3868 | 0.3312 |
mock-gpt4.5 | 0.3558 | 0.3 | 0.2604 |
mock-gpt4 | 0.3694 | 0.3473 | 0.2812 |
mock-grok-3 | 0.3633 | 0.4125 | 0.345 |
mock-grok | 0.3646 | 0.4 | 0.3625 |
Quick Start
- Choose between Tossup or Bonus mode
- Design your pipeline
- Test on example questions
- Submit for evaluation
Competition Rules
🧠 Tossup Questions
- Format: Individual questions progressively revealed. Questions get easier as they are revealed.
- Scoring:
- Correct early buzz: +10 points
- Incorrect early buzz: -5 points
- Correct after full read: +10 points
- Required Outputs:
answer
: Your predicted answerconfidence
: Score between 0-1buzzer
: When to attempt answering- Configure with confidence threshold (0.0-1.0)
- Optional token probability threshold for more control
- Combine thresholds using AND/OR logic (buzz when both/either condition met)
🎁 Bonus Questions
- Format:
- Consists of a
leadin
paragraph that introduces the topic - Followed by three related
parts
(A, B, C) that test specific knowledge - Each part is worth 10 points
- Consists of a
- Scoring: +10 points per correct part (max 30)
- Required Outputs:
answer
: Your predicted answerconfidence
: Score between 0-1explanation
: Brief justification for human collaboration
Building Your First Pipeline
1. Simple Pipeline (Recommended for First Submission)
- Single model step
- Configure:
- Model selection
- Temperature (0.0-1.0)
- System prompt
- Required outputs
2. Using Demo Pipelines
- Load existing demo pipelines as starting points
- Modify configurations:
- Adjust model parameters
- Update system prompts
- Change confidence thresholds
- Add/remove pipeline steps
- Save modified versions as new pipelines
- Test changes incrementally
3. Testing Your Pipeline
- Select an example question
- For Tossup:
- Set buzz threshold (0.5-1.0)
- Enable early stopping
- Run and check:
- Answer accuracy
- Confidence scores
- Performance metrics
4. Evaluation
- Test on multiple questions
- Monitor:
- Accuracy
- Confidence patterns
- Response times
5. Submission
- Log in to Hugging Face
- Name your model
- Add description
- Submit for evaluation
Tips for Success
- Start with simple pipeline
- Test thoroughly before submission
- Use appropriate temperature (0.3-0.7 recommended)
- Monitor confidence scores
- Check example submissions
Need Help?
- Review example submissions
- Check documentation
- Contact support