Understanding MCP Prompt Hijacking: A Critical AI Security Threat You Need to Know

Security experts at JFrog have uncovered a significant threat known as ‘prompt hijacking,’ which seeks to exploit vulnerabilities in how AI systems communicate. As organizations strive to enhance the efficiency of AI by seamlessly integrating it with their internal data and tools, they inadvertently expose themselves to new security challenges. Rather than just safeguarding the AI technology, CIOs and CISOs now face the critical task of ensuring the safety of the data streams that inform these intelligent systems.

The Rise of AI and Its Vulnerabilities

AI models—whether hosted on platforms like Google or Amazon, or running locally—are inherently limited. They lack real-time awareness and rely solely on their training data. Essentially, they don’t possess knowledge of ongoing programming tasks or the contents of files on user devices.

To address this, the innovative minds at Anthropic developed the Model Context Protocol (MCP). This framework allows AI to interact with the real world, utilizing both local data and online services. For instance, it enables digital assistants, such as Claude, to understand and respond when users request modifications to specific code snippets.

Yet, recent investigations by JFrog reveal a critical flaw in specific implementations of MCP. A particular usage pattern creates a vulnerability that could turn this otherwise advantageous AI capability into a serious security hazard.

Understanding the Threat: A Real-World Scenario

Imagine a programmer seeks assistance from an AI tool to identify a standard Python library for image processing. Ideally, the AI would recommend Pillow, a widely acknowledged choice. However, due to a flaw labeled CVE-2025-6515 in the oatpp-mcp system, an attacker could inject themselves into this scenario. They could issue a false request that the server mistakenly recognizes as legitimate.

Consequently, instead of receiving a reliable suggestion, the programmer might be advised to use an untrustworthy tool known as theBestImageProcessingPackage. Such prompt hijacking represents a severe risk to the software supply chain. Malicious actors could exploit this vulnerability to insert harmful code, extract sensitive data, or execute unauthorized commands—all while masquerading as a helpful assistant.

Diving Deeper: The Mechanisms of MCP Prompt Hijacking

This prompt hijacking exploit disrupts communication protocols within MCP instead of the AI’s underlying framework itself. The flaw was identified within the Oat++ C++ system’s MCP configuration, which facilitates connections between programs and the MCP standard.

The crux of the issue lies in the handling of connections via Server-Sent Events (SSE). When a user connects, they are assigned a session ID. Unfortunately, the flawed mechanism mistakenly employs the memory address of the session as the ID. This violates the protocol’s mandate for unique, cryptographically secure session identifiers.

Such a design flaw poses significant risks since computers often recycle memory addresses to conserve resources. An adversary can manipulate this by rapidly creating and terminating sessions, thereby capturing predictable session IDs. When a legitimate user eventually connects, they may inadvertently receive one of these recycled IDs owned by the attacker.

Once equipped with a valid session ID, the intruder can issue requests to the server undetected. The server is unable to discern between the attacker and the real user, often responding with tainted information. Even AI systems that restrict certain responses can be circumvented. Attackers can send numerous messages with common event identifiers until one is accepted, allowing them to manipulate the model’s behavior without altering the AI itself. Thus, any organization utilizing oatpp-mcp with HTTP SSE actively exposed to an accessible network faces substantial risks.

Recommendations for AI Security Leaders

The revelation of this MCP prompt hijacking vulnerability serves as a critical alert for tech leaders, particularly CISOs and CTOs developing or deploying AI assistants. As AI increasingly integrates into our workflows through frameworks like MCP, the associated risks must be addressed. Thus, ensuring security around AI becomes paramount.

While CVE-2025-6515 specifically targets one system, the potential for prompt hijacking is a broader concern. To mitigate these threats, security leaders must adopt new protocols for their AI frameworks.

Implement Secure Session Management: Ensure all AI services utilize robust session management practices. This includes leveraging strong, random generators for session IDs. Relying on predictable identifiers, like memory addresses, should be strictly prohibited.
Enhance Client-Side Defenses: Design client applications to reject any event that doesn’t align with expected IDs and types. Simple, sequential event IDs are vulnerable to spray attacks and must be replaced with unpredictable identifiers to prevent collisions.
Adopt Zero-Trust Principles: Apply zero-trust models to all AI protocols. Conduct comprehensive security assessments from the foundational AI model to the various middleware and protocols connecting it to data. Each channel should enforce rigorous session separation and expiration practices, akin to those used in modern web applications.

The emergence of this MCP prompt hijacking threat underscores how traditional web application vulnerabilities, like session hijacking, manifest in newfound and perilous forms within the AI landscape. Protecting these advanced AI tools necessitates a commitment to robust security frameworks, ensuring safety at the protocol level.

Join us in cultivating a secure future for AI by prioritizing comprehensive security integrations. Every step taken today not only safeguards our present use of technology but also builds a resilient foundation for tomorrow’s innovations. Together, let’s secure against potential vulnerabilities and empower a safer, more intelligent world.