The Alarming Discovery: Gemini’s Achilles’ Heel
Imagine your voice assistant, a tool designed to simplify your life, suddenly turning against you. A recent revelation by cybersecurity firm SafeBreach exposed a critical vulnerability that could have allowed a single, seemingly innocuous notification from apps like WhatsApp, Slack, or Instagram to hijack Google Gemini on Android. This wasn’t just about minor annoyances; the exploit could have led to serious compromises, from faking messages from your boss to controlling smart home devices or even poisoning Gemini’s long-term memory.
The ‘Poisoned Notification’ Vector
The core of the vulnerability lay in how Gemini’s Android-exclusive ‘Utilities’ feature processes notifications. Researcher Or Yair discovered that Gemini treated the text within these notifications not just as information, but as actionable instructions. This meant any app capable of pushing a notification to an Android device could effectively deliver a malicious payload, creating an “effectively infinite” attack surface.
The implications were immediate and concerning. An attacker could rewrite Gemini’s spoken responses, fabricating messages from contacts. Picture this: while driving, Gemini announces, “Your manager asked you to upload the docs to this Drive folder” – a message that, without visual confirmation, is incredibly difficult to question. A more insidious version could even grab a legitimate sender’s name from the notification queue to lend credibility to the fake message.
Beyond Basic Prompt Injection
This research builds upon SafeBreach’s earlier work, “Invitation Is All You Need,” which demonstrated similar prompt injection attacks via Google Calendar invites. Following that, Google implemented new mitigations to harden Gemini against such indirect injections, particularly for sensitive actions like opening windows or launching apps. However, Yair found a sophisticated bypass.
Unmasking ‘Fake Context Alignment’: A Masterclass in Deception
Google’s post-“Invitation” defenses were designed to ensure that a user’s “Yes” to a sensitive action made logical sense in the context of Gemini’s previous output. Yair’s ingenious bypass, dubbed ‘Fake Context Alignment,’ circumvented this by creating two simultaneous illusions: a legitimate-looking authorization for the security check, and a harmless exchange for the human user.
The Obfuscated Language Trick
One method involved Gemini asking the critical authorization question in a language the victim didn’t speak (e.g., Chinese: “Do you want to open the window?”), immediately followed by an innocuous English phrase like “Is that all you needed?” The user, dismissing the foreign text as a glitch, would say “Yes,” and the backend would mistakenly link this affirmative response to the Chinese authorization, granting permission for the malicious action.
The Muted Link Illusion
The second technique exploited Gemini’s text-to-speech engine, which skips hyperlinks hidden behind clickable text. A malicious question, such as “Do you want to open the window?”, could be embedded within a link that Gemini never read aloud. While the screen silently displayed the prompt, Gemini would utter a benign phrase like, “I’m sorry, I had an error, are you there?” A user’s “Yes” in response would then be interpreted by the system as consent to the on-screen, hidden malicious prompt.
Combining these two methods created an attack where a normal-sounding English exchange could clear Google’s newest security checks, all while silently executing harmful commands.
The Chilling Scope of Potential Exploits
Once past the authorization gate, the potential impacts were extensive, mirroring and even exceeding previous research:
From Faked Messages to Smart Home Control
Attackers could gain control over connected smart home devices via Google Home, manipulating windows, boilers, and lights. They could also force the phone to join Zoom calls, open malicious URLs for geolocation, or push file downloads.
Persistent Threats and Memory Poisoning
Perhaps most alarming was the ability to poison Gemini’s long-term memory. By simulating consent, the ‘Fake Context Alignment’ could trick Gemini into persistently saving attacker-chosen facts, such as a victim’s name as “Danny.” Crucially, this memory is account-level, meaning the poisoned fact would follow the victim across all devices using that Google account. Furthermore, attackers could establish persistence through scheduled actions, like setting a recurring task for Gemini to read a victim’s recent messages daily.
Google’s Swift Response and User Safeguards
SafeBreach reported these findings to Google’s Vulnerability Reward Program on August 17, 2025. Google promptly addressed the issue, confirming on November 14, 2025, that server-side content-classifier improvements had mitigated both the notification injections and the ‘Delayed Tool Invocation’ bypass. Since the fix was server-side, no app update was required from users.
For users concerned about their privacy and security, the primary control remains whether Gemini is allowed to read notifications at all. This can be managed by disconnecting the ‘Utilities’ app in Gemini’s ‘Connected Apps’ settings or by revoking the Google app’s “Notification read, reply & control” permission on Android. This incident serves as a potent reminder of the ongoing cat-and-mouse game between security researchers and malicious actors in the rapidly evolving landscape of AI-powered assistants.
For more details, visit our website.
Source: Link







Leave a comment