OpenAI Introduces 'Lockdown Mode' for ChatGPT

OpenAI has launched 'Lockdown Mode,' a new security feature for ChatGPT designed to protect against data theft from prompt injection attacks. The mode works by disabling features that can be used to exfiltrate data, such as live web browsing, agent mode, and file downloads.
OpenAI Introduces 'Lockdown Mode' for ChatGPT

OpenAI Introduces ‘Lockdown Mode’ for ChatGPT OpenAI is rolling out a new “Lockdown Mode” for ChatGPT, aiming to curb data theft from prompt injection attacks while openly acknowledging it cannot fully solve the problem.

Early June rollout and core design

On June 6, OpenAI announced Lockdown Mode as an added layer of defense against prompt injection attacks, where malicious instructions are hidden in webpages or other content a model reads. The company said the feature will “provide additional protection from prompt injection attacks,” in part by disabling live web browsing, deep research, agent mode, and retrieval of images from the web, while still allowing image generation.

Initially, OpenAI began rolling the mode out to self-serve ChatGPT Business accounts and some eligible personal users, targeting organizations that “handle sensitive data and want stricter protection from data exfiltration risks related to prompt injection.”

Expansion to all plans and technical trade‑offs

By June 7, coverage expanded: Lockdown Mode was described as available to logged-in users across Free, Go, Plus, Pro, and self-serve ChatGPT Business tiers. The feature now also disables Canvas networking and file downloads, shutting down the main channels an attacker could use to exfiltrate sensitive information.

OpenAI frames prompt injection as a “frontier” problem affecting all large language models, where hidden instructions in a webpage or uploaded file can trick a model into sending data to an attacker-controlled server. Lockdown Mode “does not stop injections from happening”; malicious payloads in cached pages or PDFs can still influence model behavior, but the mode aims to “substantially reduce the risk of prompt injection-based data exfiltration” by blocking outbound routes.

Who it’s for — and what it breaks

OpenAI stresses that “Lockdown Mode is not intended for everyone,” but for users willing to sacrifice functionality for security. With the mode on, ChatGPT loses much of what makes its agents and research tools powerful: live browsing is limited to cached content, agent mode and deep research are disabled, and some app combinations may still pose residual risk.

The company presents the feature as a pragmatic, partial answer to a structural weakness of current AI systems: they “cannot reliably separate data from instructions,” leaving prompt injection as an ongoing challenge despite tighter controls.

Continue reading https://foxvector.com/stories/019ea45d-3d1b-0f39-7006-00986d1cb513

Write a comment
No comments yet.