Concept: Agent Evaluation | Python:
client.agent.evaluate_tool_call()Parameters
Name of the tool being called.
Parameters being passed to the tool.
The agent’s role and current task. Object with optional fields:
goal (string), prior_tool_calls (array of objects), agent_id (string), turn_index (integer).Domain hint used to tune evaluation (e.g.
finance, healthcare).Evaluation depth:
basic or deep.Frameworks to check the call against (e.g.
india_dpdp, gdpr).Override decision thresholds. Object with optional fields:
block_below (number 0-10), flag_below (number 0-10), dimension_minimums (object of dimension → minimum score).Request
Response
"ALLOW", "FLAG", or "BLOCK". A BLOCK decision returns HTTP 403; ALLOW/FLAG return 200.Human-readable explanation for the decision.
score (0-10), confidence (0-1), and a summary string.Safer parameter values when the engine can propose them, otherwise
null.