What is Call Simulation?
Testing voice AI manually is slow, incomplete, and nobody’s favourite task. Call Simulation runs real voice calls with your agent so you know it’s ready, before launch, after every update, and every time something changes. You define what your agent does. Tuner builds the scenarios that stress test it, connects a real voice call to your agent, and scores every call against the same Evals you’ve configured for production. Simulation works in both call directions: inbound (Tuner dials your agent over SIP) and outbound (Tuner triggers your endpoint, and your agent dials Tuner’s simulation agent).
How It Works
Connect your agent
A one-time setup that depends on your agent’s call direction, pick yours:
- Inbound (your agent receives calls)
- Outbound (your agent places calls)
Tuner dials into your agent over SIP, like a real customer calling in. You paste your agent’s SIP URI (and credentials, if any) into Agent Settings → SIP Settings.
Full setup steps: Inbound Simulation, SIP setup.

Configure your Simulation Mix
Open Run Simulation and set the number of calls (5–20 per batch), adjust the Routine Callers vs Pressure Tests mix, and narrow which intents or evals each side targets.

Choose language and accent
Click Change next to the language row to pick which language and accent the simulation agents use. Test how your agent handles different locales without waiting for live traffic from each region.

Select Pressure Test targets
Click Narrow down under Pressure tests target to focus on specific failure modes, hallucination, scope boundary, escalation handling, offensive language, personal data, and more. Unselected evals won’t generate scenarios.

The Simulation Mix
You control what gets tested. Run a full health check or narrow down to a single Intent, Call Outcome, or eval. Every run is yours to configure.| Routine Callers | Pressure Tests | |
|---|---|---|
| What they do | Simulation agents that play your everyday customers, cooperative, on-topic, working through your agent’s normal workflows | Simulation agents that play the difficult ones, unhappy customers, edge cases, people trying to bypass your process or ask for something outside scope |
| What you learn | How your agent performs on the happy path when everything goes the way it’s supposed to | Where your agent has a breaking point, hallucination, scope boundary, escalation handling, and other failure modes |
| How they’re generated | From your agent’s configured Intents and Call Outcomes | Auto-generated from your configured evals |
Why volume matters
In voice, the same intent said ten different ways produces ten different conversations. Rushed, hesitant, overly detailed, unclear, small differences in how a customer speaks can completely change how your agent responds. The more simulations you run, the more of these variations you cover.Languages and accents
In the Run Simulation dialog, click Change on the language row to choose the language and accent for your simulation agents. Supported languages include Arabic, English, French, German, Italian, and Spanish. For English, you can also pick a regional accent: AU, UK, or US. Use this to stress-test multilingual agents or confirm your agent still performs when customers speak with a different accent than your default setup.Your Production Evals Run Every Simulation
No separate scoring. No extra configuration. Every simulation call is evaluated against the exact same Evals, Call Outcomes, and Intents you’ve already set up for your live agent.- Same Evals, every run. Whatever you’ve configured to catch failures in production, hallucinations, missed Intents, wrong Call Outcomes, runs automatically on every simulation call. What fails in production will fail here first.
- See failures before your customers do. Every run surfaces exactly what passed and what failed. Fix the failures, adjust your agent, and run again.
- Regression built-in. Changed a prompt? Updated a workflow? Every time you run a simulation, you’re also checking that nothing you already fixed has broken again.
Getting Started
Pick the path that matches your agent’s call direction:Inbound Simulation
Your agent receives calls. Tuner dials your agent over SIP, configure a SIP URI and credentials in Agent Settings → SIP Settings. Guides for Vapi, Retell, Twilio, Telnyx, LiveKit Cloud SIP, and any SIP URI you already have.
Outbound Simulation
Your agent places calls. Tuner triggers your webhook with a SIP URI to dial, and your agent calls Tuner’s simulation agent. Configure it in Agent Settings → Simulation Setup, guides for LiveKit, Pipecat, and Custom API.
FAQ
How long does a simulation run take?
How long does a simulation run take?
Each simulated call takes 1–4 minutes depending on scenario complexity. A batch of 10 calls typically completes in 5–10 minutes.
Does simulation cost credits?
Does simulation cost credits?
Yes. Each simulated call consumes credits from your balance. The rate is shown in the Run Simulation dialog before you start.
Can I run simulation on an agent that's already in production?
Can I run simulation on an agent that's already in production?
Yes. Simulation calls are separate from production traffic and don’t affect your production metrics.
What providers are supported?
What providers are supported?
Inbound: managed platforms Vapi and Retell, plus any SIP-capable provider (Other platforms, including Twilio, Telnyx, LiveKit Cloud SIP, etc.) if you paste the correct URI and credentials into Agent Settings → SIP Settings. See Inbound Simulation, SIP setup.Outbound: any stack that can place a call to a SIP URI, with guides for LiveKit, Pipecat, and Custom API. See Outbound Simulation.Contact support if you are unsure which path fits.

