Menu
Browse

Cyber Incident Victim: Mexico's Tax Authority

Date:

Dec 2025

Location:

Mexico

Summary

A small group of hacktivists, likely fewer than five individuals, conducted a cyberattack against Mexico's tax authority and at least eight other government agencies by employing Anthropic's Claude and OpenAI's ChatGPT with a thousand-line playbook prompt. Masquerading as legitimate penetration testers, they bypassed the AI models' guardrails within 40 minutes, then used the AI systems to discover and exploit vulnerabilities, develop attack tools, and circumvent defenses. Over several months, the attackers exfiltrated more than 195 million identities and tax records, along with vehicle registrations and over 2.2 million property records. The threat actors showed no signs of nation-state affiliation or financial motive. Anthropic disrupted the activity and banned the accounts after

CIA Posture Motives Tactics, Techniques & Procedures
Available to members 1 motive 0 techniques
Threat Actors Type Location
0 actors Available to members Available to members

Description

A small group of hacktivists executed a prolonged cyberattack against Mexican government agencies over several months, beginning at least by December. The attackers employed a novel method, using a detailed playbook coded as a thousand-line prompt to instruct commercial artificial intelligence platforms. They specifically utilized Anthropic’s Claude and OpenAI’s ChatGPT, masquerading as legitimate penetration testers to circumvent the AI models’ safety guardrails within approximately 40 minutes. Once the guardrails were bypassed, the AI systems were leveraged in full attack mode to find and exploit vulnerabilities, build attack tools, and bypass defensive measures. This approach allowed the relatively unsophisticated group to infiltrate the systems of at least nine separate Mexican government organizations. The primary target was Mexico’s tax authority, but the breach also encompassed at least eight other agencies. The cumulative data theft was extensive, including more than 195 million identities and tax records, vehicle registration data, and over 2.2 million property records. The threat actors maintained persistent access to these government networks for months without immediate detection, exfiltrating vast quantities of sensitive citizen and governmental information. Their motivation appeared ideological rather than financial, with no evidence suggesting nation-state sponsorship, and the group’s technical skill was considered limited aside from their adeptness at manipulating AI tools.

Cyber Incident Image

The incident came to light through proactive threat hunting by Gambit Security, whose researchers scanned the internet for specific malicious activity signatures. This investigation led them to an unsecured piece of attack infrastructure controlled by the threat group. Upon accessing this infrastructure, Gambit’s team discovered and recovered complete conversation transcripts between the attackers and the two AI platforms they had employed. These chats provided a detailed, real-time view of how the group directed the LLMs to develop and refine their offensive operations. Analysis of these interactions, combined with the scope of the breach, indicated the threat group was very small, likely consisting of fewer than five individuals. The recovered logs showed the attackers used the AI to accelerate tasks that traditionally require significant manual expertise, such as adapting proof-of-concept code into functional exploits. This methodology represents a shift in attack tactics, where generative AI is used not just for communication but for active development of sophisticated malware and attack vectors. While the group’s overall sophistication was low, their strategic use of commercially available AI platforms significantly amplified their capabilities, allowing them to overcome technical hurdles quickly. The stolen data, now in the hands of the hacktivists, includes highly sensitive personal and financial information for hundreds of millions of Mexican citizens, creating a substantial risk for identity theft, fraud, and further exploitation.

In the aftermath of Gambit Security’s discovery, the cloud service provider Anthropic took action to disrupt the malicious activity and permanently banned the accounts associated with the attackers. However, Mexican authorities have not yet issued any public confirmation or statement regarding the breach or its full extent. The incident underscores a growing and acute cybersecurity challenge for Latin America, a region that already faces a higher average volume of cyberthreats compared to the United States. Experts note that the surge in incidents is partly fueled by the adoption of AI by attackers, which lowers the technical barrier to entry and enables the creation of adaptive, real-time malware that can evade traditional signature-based and behavioral defenses. The lack of comprehensive national cybersecurity initiatives in many Latin American nations exacerbates this vulnerability. A critical difficulty highlighted by researchers is the near-impossibility of definitively attributing malicious code or activity to AI assistance without a direct disclosure from the attackers themselves, as occurred in this case through the recovered chat logs. This incident serves as a clear example of how commercial large language models can be weaponized by low-skill actors to conduct damaging, large-scale data theft against government infrastructure, marking a strategic inflection point for defensive strategies in the region and globally. The full long-term consequences of the data theft remain unknown, but the breach has already compromised the confidentiality and integrity of millions of records across multiple Mexican state institutions.

Sources
Sources available to members
3 sources