inspect score

Score a previous evaluation run.

Usage

inspect score [OPTIONS] LOG_FILE

Options

Name Type Description Default
--scorer text Scorer to use for scoring None
-S text One or more scorer arguments (e.g. -S arg=value) None
--action choice (append | overwrite) Whether to append or overwrite the existing scores. None
--overwrite boolean Overwrite log file with the scored version False
--output-file file Output file to write the scored log to. None
--stream text Stream the samples through the scoring process instead of reading the entire log into memory. Useful for large logs. Set to an integer to limit the number of concurrent samples being scored. False
--log-level choice (debug | trace | http | info | warning | error | critical | notset) Set the log level (defaults to ‘warning’) warning
--log-dir text Directory for log files. ./logs
--display choice (full | conversation | rich | plain | log | none) Set the display type (defaults to ‘full’) full
--traceback-locals boolean Include values of local variables in tracebacks (note that this can leak private data e.g. API keys so should typically only be enabled for targeted debugging). False
--env text Define an environment variable e.g. –env NAME=value (–env can be specified multiple times) None
--debug boolean Wait to attach debugger False
--debug-port integer Port number for debugger 5678
--debug-errors boolean Raise task errors (rather than logging them) so they can be debugged. False
--help boolean Show this message and exit. False