← Back to catalog

One-shot Tool Call

Quick tool-call benchmark for gpt-5.4: 10 of 10 tests passed across Bash Execution and File Operations with no misfires and 100% first-attempt accuracy.

OK

Files