Execute agent-tool-trajectory evaluator

curl --request POST \
  --url https://api.traceloop.com/v2/evaluators/execute/agent-tool-trajectory \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "input": {
    "executed_tool_calls": "[{\"name\": \"search\", \"input\": {\"query\": \"weather\"}}]",
    "expected_tool_calls": "[{\"name\": \"search\", \"input\": {\"query\": \"weather\"}}]"
  },
  "config": {
    "input_params_sensitive": true,
    "mismatch_sensitive": false,
    "order_sensitive": false,
    "threshold": 0.5
  }
}
'

{
  "reason": "Tool calls match the expected trajectory",
  "result": "pass",
  "score": 0.85
}

Agents

Execute agent-tool-trajectory evaluator

Compare actual tool calls against expected reference tool calls

Request Body:

input.executed_tool_calls (string, required): JSON array of actual tool calls made by the agent
input.expected_tool_calls (string, required): JSON array of expected/reference tool calls
config.threshold (float, optional): Score threshold for pass/fail determination (default: 0.5)
config.mismatch_sensitive (bool, optional): Whether tool calls must match exactly (default: false)
config.order_sensitive (bool, optional): Whether order of tool calls matters (default: false)
config.input_params_sensitive (bool, optional): Whether to compare input parameters (default: true)

POST

evaluators

execute

agent-tool-trajectory

Execute agent-tool-trajectory evaluator

curl --request POST \
  --url https://api.traceloop.com/v2/evaluators/execute/agent-tool-trajectory \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "input": {
    "executed_tool_calls": "[{\"name\": \"search\", \"input\": {\"query\": \"weather\"}}]",
    "expected_tool_calls": "[{\"name\": \"search\", \"input\": {\"query\": \"weather\"}}]"
  },
  "config": {
    "input_params_sensitive": true,
    "mismatch_sensitive": false,
    "order_sensitive": false,
    "threshold": 0.5
  }
}
'

{
  "reason": "Tool calls match the expected trajectory",
  "result": "pass",
  "score": 0.85
}

Authorizations

Authorization

string

header

required

Type "Bearer" followed by a space and JWT token.

Body

application/json

Request body

input

object

required

Show child attributes

config

object

Show child attributes

Response

reason

string

Example:

"Tool calls match the expected trajectory"

result

string

Example:

"pass"

score

number

Example:

0.85

Execute agent-goal-completeness evaluator Execute intent-change evaluator

⌘I

API Reference

Tracing

GDPR & Privacy

Costs

Warehouse

Organizations

Evaluators

Metrics

Auto-monitor-setups

Execute agent-tool-trajectory evaluator

Authorizations

Body

Response