Skip to main content
Run and inspect model evaluations. List eval definitions, view run groups, and check results across models. Alias: inf evals

List Eval Run Groups

Display eval run groups for a given eval definition.
inf eval list --eval-id <eval-definition-id>
Alias: inf eval ls

Options

FlagRequiredDescription
--eval-id <id>YesThe eval definition ID to list runs for
--eval-version-id <id>NoFilter by a specific eval version
The output table shows the run group ID, version, model count, derived status, and creation date.

Get Eval Run Group Details

View detailed information about a specific eval run group.
inf eval get <id>

Arguments

ArgumentRequiredDescription
idYesThe eval run group ID
Displays a detail view with:
  • Run group ID, eval definition, and version
  • Dataset and judge configuration (provider, model)
  • Model count and creation date
Followed by a table of individual runs showing the run ID, provider, model, status, average score, and progress (completed/total).

List Eval Definitions

Display all eval definitions in the active project.
inf eval definitions
Alias: inf eval defs

Options

FlagDescriptionDefault
--include-archivedInclude archived definitionsOff
The output table shows the definition ID, name, latest version number, version count, and creation date.

List Eval Datasets

Display datasets available for evaluations.
inf eval datasets

Options

FlagDescriptionDefault
-l, --limit <n>Maximum number of results50
--include-archivedInclude archived datasetsOff
The output table shows the dataset ID, name, inference count, and creation date.