
With the rapid adoption of generative AI in contact centres and customer service, companies are facing a crucial question: How can automation be scaled without losing control, compliance and quality? In this article, Paul Preute from NiCE Cognigy shows why the "prompt and pray" approach is reaching its limits in an enterprise context and how Agentic AI with clear guardrails is becoming a viable basis for productive, secure and responsible AI systems in customer service.
With the spread of ChatGPT, Gemini and comparable large language model-based systems, customer expectations have changed fundamentally. Natural dialogues, contextual understanding and an apparent "understanding" are now taken for granted and transferred to the enterprise context.
A pure response CX is no longer enough: Customers expect their concerns to be resolved and not just explained. Agentic AI addresses precisely this change by enabling natural, personalised and solution-oriented interactions instead of rigid and deterministic conversations, which independently lead processes to a result.
The rapid development of large language models (LLMs) has significantly simplified the entry into agentic AI and at first glance seems like a shortcut to automation: one model, one prompt - and suddenly answers, summaries, emails or even process suggestions are generated in seconds. This approach quickly became the standard in early projects: Formulate prompt, check result, iterate.
This often works impressively in demos because the context is controlled, the input is predictable and expectations are deliberately kept low. In productive operation, however, this logic breaks down. LLMs are probabilistic systems: they deliver fluent, convincing outputs, but not necessarily correct ones. Hallucinations, incomplete justifications or inconsistent answers are not a marginal phenomenon, but a systemic characteristic.
As long as a person controls every output, this is manageable. However, as soon as systems rely on results or process them automatically, "prompt and pray" becomes a risk strategy. The point is not that prompting is useless, but that it is not sufficient as the only operating model for enterprise automation.
Agentic AI is not a "better chatbot". The key difference is that agents think and act. They plan steps, select tools, initiate actions and make decisions - sometimes autonomously.
Instead of working through strictly predefined process chains, an agentic system plans its next steps in a context-dependent and goal-orientated manner. This allows it to react flexibly to different situations and make decisions without having to model all possible processes in advance.
With action comes responsibility. Any wrong action can cause costs, reputational damage or compliance violations. This is why different quality standards apply to agentic AI than to deterministic systems. An answer that is "only" imprecise may be unpleasant in service - an action that is carried out incorrectly is potentially critical.
Autonomy not only increases efficiency, but also multiplies the risk. This is precisely why agent-based systems must not only be clever, but above all controllable, reproducible and auditable.
As soon as prompting is scaled, the enterprise reality becomes apparent: reproducibility becomes the core problem. Small variations in the input or prompt lead to noticeably different results. What seemed stable in tests yesterday can deviate in production today - without a clear reason and without deterministic debugging paths.
Added to this are governance and security requirements: Who is allowed to see which data? What actions are permitted? How are decisions explained, logged and, in case of doubt, reversed?
In many prompt-based approaches, business logic is stored as free text in the prompt. There it is difficult to version, almost impossible to test and practically impossible to audit. This leads to a dangerous mix of reasoning and control.
Prompt-level guardrails are also fragile. They often work in controlled scenarios, but in reality they can be partially cancelled out by unexpected inputs, prompt injection or ambiguous formulations. The more users, channels and use cases are added, the larger the attack surface becomes - and the more difficult it is to consistently guarantee security, compliance and quality with prompting alone.
The result: companies invest a lot in prompt tuning, but do not achieve reliable operational maturity.
The way out of this impasse is a composite approach: Agentic AI is designed as a system, not as a single prompt. The core idea is the separation of responsibilities.
The LLM does what it is good at - understanding language, designing responses, recognising intentions. Control, policies and critical decisions, on the other hand, are anchored outside the model.
In practical terms, this means a multi-layered architecture with clear guardrails:
Eine Wissens- und Kontextschicht stellt sicher, dass der Agent mit verlässlichen, aktuellen Unternehmensinformationen arbeitet, statt zu raten.
Eine Orchestrierungs- und Workflow-Schicht zerlegt komplexe Aufgaben in überprüfbare Schritte und entscheidet, welche Tools und Systeme wann genutzt werden.
Eine Policy- und Validierungsschicht setzt Regeln durch: Berechtigungen, Datenzugriffe, erlaubte Aktionen, Prüfungen von Outputs, Risikoklassifizierung und Eskalation.
Optional wird Human-in-the-Loop dort integriert, wo Risiko oder Unsicherheit es erfordern.
Guardrails are not an add-on in this model, but the foundation. They turn probabilistic reasoning into a controllable operating system for autonomy.
This has three direct advantages:
wird Verhalten reproduzierbarer, weil nicht jede Entscheidung im Prompt versteckt ist.
wird Governance operationalisierbar, weil Policies zentral definiert und systematisch durchgesetzt werden.
wird Skalierung möglich, weil neue Use Cases nicht durch Prompt-Magie entstehen, sondern durch wiederverwendbare Bausteine wie Tools, Workflows, Validierer und Observability.
"Prompt and Pray" was a helpful start, but not a sustainable operating model for agentic AI in the enterprise. As soon as agents take action, reliable guardrails are needed: policies, validation, observability and clear orchestration.
Guardrails do not slow down innovation - they make it responsible. If you want to use agentic AI productively and on a large scale, you need to switch from prompts as artefacts to architecture as an operating concept.
Autonomy becomes scalable in the enterprise not through hope, but through structure.
No Comments