AI Phone Assistant A–Z

Glossary

Voice AI, telephony and GDPR terms — explained concisely for decision makers and engineering teams.

AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z — AI Phone Assistant A–Z —

DPA (Data Processing Agreement)

Contract under Art. 28 GDPR between controller and processor. Defines purpose, scope and safeguards of data processing. Mandatory for any SaaS handling personal data.

Barge-In

Ability of callers to interrupt the assistant mid-sentence. Considered a marker of natural conversation; implemented via parallel STT with voice-activity detection.

CRM Integration

Connection between phone assistant and customer-relationship system (HubSpot, Salesforce, Pipedrive). Auto-creates contacts and activities after each call.

Related:Webhook

EU Data Residency

Storage and processing of data exclusively in EU data centers. Reduces post-Schrems-II third-country transfer risk and is a precondition for many GDPR-compliant deployments.

Inbound vs. Outbound

Inbound = assistant takes incoming calls (booking, support). Outbound = assistant places calls (confirmations, surveys). Compliance requirements differ between modes.

IVR (Interactive Voice Response)

Legacy voice menu system with keypad input ("press 1 for ..."). AI phone assistants replace IVR with free-form speech and intent recognition.

RAG (Retrieval-Augmented Generation)

Architecture where the LLM fetches relevant documents from a knowledge base before answering. Enables up-to-date, company-specific responses without fine-tuning.

SIP Trunk

Internet-based phone line forwarding numbers to the AI assistant. Standard VoIP protocol. Frequently ported from existing carriers (Deutsche Telekom, Sipgate, Vodafone).

SLA (Service Level Agreement)

Contractually guaranteed service quality: availability (e.g. 99.9 %), response time, recovery time. Mandatory for business-critical deployments, often with penalties on breach.

Voice Cloning

Synthesis of a voice from a sample (typically 30 s–10 min). Enables consistent brand voice. Requires GDPR and consent-law review before deployment.

Webhook

HTTP callback that notifies a third-party system on call events (call ended, appointment booked). Most common integration technique alongside direct APIs.

DNC List (Do-Not-Call)

List of phone numbers that may not be called outbound. In Germany effectively enforced via consent rules under UWG; in Switzerland the asterisk entry in the phone book (Art. 3 lit. u UWG/CH).

DPIA (Data Protection Impact Assessment)

Risk assessment under Art. 35 GDPR. Mandatory for high-risk processing (e.g. systematic call recording). Structured risk and mitigation analysis before go-live.

EU AI Act

EU Regulation 2024/1689 governing AI systems. Tiered model from "minimal" to "unacceptable" risk. AI phone assistants typically classify as "limited risk" with a transparency duty (disclose AI to caller).

Function Calling / Tool Use

LLM capability to emit structured function calls instead of free text (e.g. bookAppointment(tuesday, 10:00)). Foundation of reliable CRM/EHR integration in phone contexts.

MOS (Mean Opinion Score)

Audio-quality rating on a 1–5 scale, originally from human listeners, today commonly via POLQA/PESQ algorithms. MOS ≥ 4.0 is considered telephony-grade.

Number Porting

Carrying an existing phone number when switching providers. Legally guaranteed in Germany (§ 59 TKG). Typical lead time 5–15 working days depending on the losing carrier.

Opt-In

Explicit, prior consent to calls or data processing. Required for outbound marketing calls (§ 7 (2) UWG in Germany). Must be documented, granular and revocable.

Prompt Injection (Voice)

Attack technique where a caller tries to override the system prompt ("Ignore your instructions…"). Voice-specific hardening: allowlists, tool-use validation, refuse tool calls on anomaly.

PSTN (Public Switched Telephone Network)

The legacy landline network with central exchanges. AI assistants reach it via SIP trunks and gateways. PSTN reachability is what makes E.164 delivery globally meaningful.

Realtime API

Streaming interfaces (e.g. OpenAI Realtime, Google Live API) that process audio directly — without the STT→text→TTS intermediate step. Reduces latency below 500 ms.

SSML (Speech Synthesis Markup Language)

XML markup for TTS: pronunciation, pauses, emphasis, phone-number breakdown. W3C standard. Essential for clean pronunciation of domain terms and foreign proper names.

TKG (German Telecommunications Act)

German statute regulating telecommunications. Relevant for AI assistants: § 7 UWG on advertising, § 9a TKG on traffic data, § 59 TKG on number porting.

Related:GDPROpt-In

Turn-Taking

Management of speaker/listener role switches. Beyond raw barge-in: detection of pauses, backchannels ("mhm"), avoidance of double-talk. Key to conversational naturalness.

UWG (German Unfair Competition Act)

German statute governing, among other things, advertising calls. § 7 (2) UWG prohibits cold calls to private individuals without explicit consent. Fines up to € 300,000 per violation.

VAD (Voice Activity Detection)

Detection of whether the audio currently contains speech or just silence/background noise. Prerequisite for barge-in, turn-taking and efficient STT (no processing during silence).

Voicemail Detection (AMD)

Detection of whether an outbound call was answered by a human or a voicemail box. Also Answering Machine Detection. Latency and accuracy (typ. 90–97 %) are the trade-off.

Ready for the next step?

Connect these terms to concrete solutions for your industry.

As of 3 May 2026. Definitions are reviewed quarterly. · v2026-05-03
🍪

Cookies & Privacy

We use cookies to provide you with the best possible experience on our website. Some of them are technically necessary, others help us improve the website.