Trim to a budget, keep the variety.

When you shrink a trajectory set to a fixed budget, the common move is to keep the shortest runs. But short traces are the least diverse: you throw away procedural variety. procgrep selects for diversity instead, preserving far more of the dataset's distinct procedures at the same budget.

How each model actually behaves.

The same dataset, broken down by the model that produced each trajectory: how long its runs are, how much it repeats itself, and the mix of actions it favours.

← query

The agent-trace ecosystem.

Every public trajectory dataset procgrep discovered on Hugging Face, and what it can parse today. Most non-parseable entries aren't failures. They're benchmarks (task definitions, no trajectories), corpora, or out-of-scope domains.

benchmarks (click to filter)

the index

group by author

dataset	status	downloads	likes	updated

Grep for agent traces.

Trim to a budget, keep the variety.

How each model actually behaves.

The agent-trace ecosystem.