Braintrust Integration
Everruns integrates with Braintrust to provide LLM observability, evaluation, and trace visualization for your agentic workflows.
What You Get
Section titled “What You Get”- Trace Visualization: See complete agent turns as hierarchical traces
- Token Usage Tracking: Monitor input/output tokens and prompt cache efficiency
- Performance Metrics: Time-to-first-token, LLM call duration, tool execution times
- Debug Workflows: Inspect multi-step agent reasoning with tool calls
Quick Start
Section titled “Quick Start”1. Get Your API Key
Section titled “1. Get Your API Key”- Sign up at braintrust.dev
- Go to Settings → API Keys
- Create a new API key
2. Configure Everruns
Section titled “2. Configure Everruns”Set environment variables:
# Requiredexport BRAINTRUST_API_KEY=sk-bt-your-api-key
# Recommended: specify your project nameexport BRAINTRUST_PROJECT_NAME="My Project"| Variable | Required | Default | Description |
|---|---|---|---|
BRAINTRUST_API_KEY | Yes | - | API key from Braintrust settings |
BRAINTRUST_PROJECT_NAME | No | My Project | Project name for organizing traces |
BRAINTRUST_PROJECT_ID | No | - | Direct project UUID (skips name lookup) |
BRAINTRUST_API_URL | No | https://api.braintrust.dev | API base URL |
3. View Traces
Section titled “3. View Traces”- Open the Braintrust dashboard
- Navigate to your project
- Go to Logs to see incoming traces
Trace Hierarchy
Section titled “Trace Hierarchy”Each agent turn creates a trace with the following structure:
agent turn (root span)├── reason (iteration 1)│ └── llm.generation (gpt-4o)├── act (iteration 1)│ ├── tool.call (search)│ └── tool.call (fetch)├── reason (iteration 2)│ └── llm.generation (gpt-4o)└── (no more tool calls - turn complete)Span Types
Section titled “Span Types”| Span | Type | Description |
|---|---|---|
| Agent Turn | task | Root span for the entire user request |
| Reason | task | LLM reasoning phase (may iterate) |
| Act | task | Tool execution phase |
| LLM Generation | llm | Individual LLM API call |
| Tool Call | tool | Individual tool execution |
Metrics Captured
Section titled “Metrics Captured”LLM Generations
Section titled “LLM Generations”prompt_tokens- Input token countcompletion_tokens- Output token countcache_read_tokens- Tokens read from prompt cache (Claude)cache_creation_tokens- Tokens written to prompt cache (Claude)time_to_first_token- Time until first token receivedduration_ms- Total LLM call duration
Tool Calls
Section titled “Tool Calls”status- Success/failureduration_ms- Execution timeerror- Error message (on failure)
Troubleshooting
Section titled “Troubleshooting”Traces Not Appearing
Section titled “Traces Not Appearing”- Check API key: Verify
BRAINTRUST_API_KEYis set correctly - Check logs: Look for “Braintrust listener initialized” at startup
- Project name mismatch: If
BRAINTRUST_PROJECT_NAMEdoesn’t match an existing project in Braintrust, logs will be silently dropped. Verify the project name exists in your Braintrust dashboard.