HoneyHive Docs

Charts

Define and manage saved charts. Charts are visualizations that aggregate metrics over time with bucketing, filters, and groupings. (5 commands)

Manage individual records inside datasets, including batch creation and mapping to source events. (6 commands)

Curate collections of datapoints used as test sets for evaluations and experiments. (6 commands)

Read and write trace events. Events are the spans that capture every step of an AI application’s execution. (4 commands)

Run, retrieve, and compare evaluation runs to measure how prompt or configuration changes affect agent performance. (11 commands)

Snapshot, list, and deploy versions of a metric’s definition so changes can be reviewed and rolled back without losing history. (3 commands)

Define and run evaluators, i.e. automated quality checks that score traces against criteria like accuracy, safety, or correctness. (5 commands)

Group related trace events into sessions, the top-level container for a multi-step or multi-service AI interaction. (2 commands)

⌘I