Compare actual tool calls against expected reference tool calls
Request Body:
input.executed_tool_calls (string, required): JSON array of actual tool calls made by the agentinput.expected_tool_calls (string, required): JSON array of expected/reference tool callsconfig.threshold (float, optional): Score threshold for pass/fail determination (default: 0.5)config.mismatch_sensitive (bool, optional): Whether tool calls must match exactly (default: false)config.order_sensitive (bool, optional): Whether order of tool calls matters (default: false)config.input_params_sensitive (bool, optional): Whether to compare input parameters (default: true)