Reference

Goal taxonomy

The 12 goal types Kalibr uses for classification and routing. Input type, output type, and cognitive load determine the goal_id. The goal_id determines the default path order and the success contract.

Classification table

goal_idInput to OutputLoadDefault path orderSuccess contract
web_scrapingURL to rowslowDeepSeek, Llama, Mixtral, gpt-4o-minifield_completeness >= 0.8, min 1 row
data_enrichmentrows to rowslowDeepSeek, Llama, Qwen, gpt-4o-mininull_rate_after < null_rate_before
lead_scoringtext to scorelowDeepSeek, Llama, Mixtral, gpt-4o-miniscore numeric, in [0, 100]
classificationtext to labellowDeepSeek, Llama, Qwen, gpt-4o-minilabel in allowed_labels
summarizationtext to proselowDeepSeek, Llama, Mixtral, claude-haikucompression ratio 0.05 to 0.4
data_pipelinedata to rowslowDeepSeek, Llama, Qwen, gpt-4o-minirows_out > 0, no exception
researchtext to synthesismediumLlama, DeepSeek, deepseek-r1, claude-sonnetstructural: min 200 chars, no error markers + float judge 20%
outreach_generationrows to contentmediumLlama, DeepSeek, Mixtral, claude-sonnetstructural: subject + body present, 50-2000 chars + float judge 20%
code_generationany to codehighSonnet, GPT-4o, o3-mini, deepseek-r1AST parse passes or tests_pass = True
code_reviewcode to prosehighSonnet, GPT-4o, deepseek-r1, o3-minimin 50 chars of structured feedback
system_designany to prosehighSonnet, deepseek-r1, GPT-4o, o3-minimin 200 chars of structured output
agent_orchestrationmulti to coordinateshighSonnet, GPT-4o, deepseek-r1, o3-minisubtasks_completed = True, no timeout

Default path ordering

The listed order is the cold-start default: cheapest capable model first. Thompson Sampling shifts traffic away from this order based on actual outcomes in your production environment. After roughly 50 outcomes per goal_id, the router has enough signal to start making confident routing decisions. The default order only matters during the cold-start phase.

Skip routing for

Conversational replies, status checks, config changes, memory operations, and simple lookups. These carry no signal worth routing.

Eval rules

Next

Path 2: Agent orchestrator

How routing works