Vizpy

Big-Bench Hard

27 challenging reasoning subtasks

Big-Bench Hard (BBH)

Difficulty: Advanced | File: examples/03_bbh.py

Collection of 27 challenging reasoning tasks.

Usage

python examples/03_bbh.py --lm anthropic/claude-haiku-4-5-20251001 --demo

On this page