What are real-time voice agents and why are they different from ordinary chatbots
Real-time voice agents in Microsoft Copilot Studio represent a significant shift from classic interactive voice responses (IVR) that we encounter in bank lines or call centers. Instead of selecting options from a menu using phone numbers, a customer can simply talk — and the AI responds immediately in its own voice.
While traditional chatbots often work with text or predefined phrases, real-time voice agents use large language models (LLM) and advanced speech processing. Thanks to this, they understand natural speech with accents, dialects, and incomplete sentences. They can maintain the context of the entire conversation, recall previously mentioned information, and smoothly transition between topics. This makes them a tool suitable for more complex customer scenarios where classic tree menus fail.
Key capabilities that Microsoft brings to the table
According to Microsoft's official documentation, real-time voice agents bring seven key functions:
- Natural language understanding — understanding the caller's intent without having to learn exact phrases or go through menus.
- Voice-first design — the entire experience is built primarily for spoken interaction with natural dialogue pacing.
- Real-time responsiveness — minimal delay between question and answer, including natural pauses and interruptions.
- Context awareness — the agent remembers the course of the conversation and can reference previous interactions.
- Multilingual support — support for multiple languages, including Czech. According to Microsoft's official list, Microsoft Copilot Studio supports Czech among more than twenty languages.
- Flexible integration — connection to CRM systems, knowledge bases, APIs, and Power Automate automation flows.
- Deterministic control — a combination of generative AI with structured topics ensures consistency and compliance with regulations.
What you need for deployment
Real-time voice agents are not a standalone application that you download from the internet. They require a set of enterprise licenses and roles from Microsoft. Specifically, these are:
- Dynamics 365 Contact Center with an active voice channel and call routing.
- The Omnichannel administrator role, which allows configuring the voice channel and work streams.
- Copilot Studio for creating and managing agents.
- The Copilot Studio maker role with permission to create and configure agents.
For smaller companies or individuals who want to experiment with voice AI, this means an initial investment in the Dynamics 365 ecosystem. The platform is therefore primarily intended for medium and large enterprises with existing Microsoft infrastructure.
Pricing: pay-as-you-go, but watch out for hidden costs
Microsoft Copilot Studio works on a Pay-As-You-Go basis. Customers do not pay a fixed monthly fee, but rather so-called Copilot Credits based on actual consumption. The amount of credits depends on the complexity of the agent, knowledge sources, and the complexity of the generated response or action. Microsoft does not publish a specific price list for credits directly on the website — for an accurate calculation, it is necessary to use the Azure Pricing Calculator or contact a sales representative.
In addition to the credits themselves, the costs for Dynamics 365 Contact Center must be taken into account, which vary depending on the size of the organization and call volume. Companies should expect that real-time voice interaction is among the more demanding tasks and will consume more credits than a text chatbot.
A fundamental problem for Czech companies: data ends up in the USA
Here comes the most important limitation for European users. According to Microsoft's current documentation, the following rules apply from April 2026:
- The real-time voice AI model is hosted exclusively in North America.
- Customers outside North America must enable cross-geo processing — their voice data is therefore transferred and processed on servers in the USA.
- Customers within the EU Data Boundary (i.e., companies that require their data to remain in the EU) cannot use real-time voice agents at all.
For Czech companies, this means a fundamental complication. While standard text agents in Copilot Studio can run in European data centers, the real-time voice variant requires an exception from European data transfer rules. This may be completely unacceptable for companies subject to strict compliance requirements — for example, in banking, healthcare, or public administration. Microsoft recommends monitoring future updates, when availability might expand to European regions.
Security and responsible AI use
Microsoft openly warns in its documentation that security and ethical evaluations have revealed limitations in the behavior of real-time voice agents that may lead to a higher risk of generating potentially harmful content. Customers are therefore urged to adopt appropriate mitigation measures, such as setting up structured topics and deterministic controls.
Every operator is also fully responsible for compliance with applicable laws and Microsoft terms, including the Microsoft Enterprise AI Services Code of Conduct. In the context of the Czech and European environment, this means respecting GDPR, the personal data protection act, and any sectoral regulations.
How do they stand against the competition?
The conversational AI market is evolving rapidly. Similar capabilities are offered, for example, by Google Dialogflow CX with integration of Google Cloud Speech-to-Text and Text-to-Speech, or Amazon Lex with real-time voice bots for AWS. Both competitors, however, face similar regional and pricing challenges. Microsoft's advantage is deep integration with Office 365, Dynamics 365, and Power Platform, which can be a decisive argument for companies already rooted in the Microsoft ecosystem.
At the same time, however, Google and Amazon offer their voice models in a wider range of regions, which may be a more practical choice for European companies at this moment. Microsoft will have to quickly catch up on geographic availability for real-time voice agents in Copilot Studio to be a real alternative for European customers.
Can I use real-time voice agents in Copilot Studio in Czech?
Yes, Copilot Studio officially supports Czech among more than twenty languages. While text agents can run in European data centers, real-time voice agents currently require processing in North America, which may be a regulatory obstacle for some Czech companies.
What licenses do I need to run a real-time voice agent?
You need Dynamics 365 Contact Center with a voice channel, Copilot Studio, and the appropriate administrative roles. Copilot Studio is billed via a Pay-As-You-Go model using so-called Copilot Credits, the price of which depends on the complexity of the interaction.
Why can't companies in the EU Data Boundary use real-time voice agents?
Since April 2026, the real-time voice AI model is hosted only in North America. Customers within the EU Data Boundary are prohibited from transferring personal data outside the EU, which makes the use of this function impossible because voice data would have to be processed in the USA.