Analysis

Once an evaluation has run, Inspect provides a number of tools for inspecting, analysing, and reviewing the results:

Log Files Read, view, and work with evaluation log files for developing, debugging, and analysing evaluations.
Log Dataframes Extract dataframes of evals, samples, messages, and events from log files.
Scanning Review transcripts to find issues like misconfigured environments, refusals, and evaluation awareness.
Inspect Viz Create high quality, interactive visualisations from Inspect evaluation logs.
Task Views Customise how a task’s samples, scores, and scanner results render in the log viewer.