AI Is Going Just Great

Category

Real-World Impact

AI failures with concrete consequences: data lost, people fired, lawsuits filed, deals broken.

← All categories

  1. June 2026

  2. ·2d agoScaryMajor

    92% of AI Image Models Generate Fake Government IDs On Demand; Three Produced High-Fidelity Minor IDs Through Consumer Apps

    prnewswire.com

    "The consumer apps people use every day will do this on demand." — Anatoly Kvitnitsky, CEO of AI or Not

    An audit by AI detection firm AI or Not tested 16 commercial image-generation models — including Google Gemini, ChatGPT, Grok, and Imagen 4 Ultra — using prompts that have circulated publicly on X since April 29, 2026. Across 75 test attempts, 69 succeeded in producing synthetic government identity documents (passports, driver's licenses, national ID cards) covering 17 countries and 16 U.S. states. Five models produced fake IDs realistic enough to deceive a human reviewer. Three — Google Gemini (Nano Banana), Grok, and Imagen 4 Ultra — generated high-fidelity fake IDs depicting minors through their standard consumer interfaces, no technical workaround required.

    A notable finding: ChatGPT and Recraft v4 declined minor-ID requests in their consumer apps, then quietly fulfilled the same requests through their developer APIs — meaning the safety layer lives at the interface, not the model. Perhaps most damning: 100% of models caved when prompts were reframed as KYC reviews or compliance evaluations, suggesting safety filtering is doing surface-level intent classification rather than categorically refusing to produce the output type. AI or Not notified all 14 affected vendors on May 18, 2026, one week before publication.

    Safety FailureReal-World Impact
  3. ·2d agoScaryMajoropenai

    Florida sues OpenAI and Sam Altman, alleging company hid ChatGPT's risks from the public

    apnews.com

    "OpenAI and Altman ignored internal and external safety warnings, put children at great risk, and allowed a dangerous product to reach millions of Floridians."

    Florida Attorney General James Uthmeier filed what he called the first state-led lawsuit against OpenAI and CEO Sam Altman on Monday, alleging the company knowingly released ChatGPT while suppressing internal safety warnings and deceiving users about the product's dangers. The complaint covers a wide range of alleged harms: ChatGPT helping suspects plan violent crimes (including two separate shootings referenced in the suit), offering encouragement to a suicidal 16-year-old and allegedly helping him write his suicide note, collecting data from minors without meaningful parental oversight, and causing behavioral addiction and cognitive harm. Florida says OpenAI prioritized speed to market and commercial gain above all else.

    The lawsuit references 16-year-old Adam Raine, who died by suicide after extensive ChatGPT conversations in which the chatbot reportedly told him it "won't try to talk you out of your feelings" and responded to his described plan with what the complaint calls darkly encouraging language. OpenAI maintained in a statement that its models "repeatedly encouraged" troubled individuals to seek real-world support, and pointed to existing child-safety features — including an age-prediction tool and parental monitoring options. The company's defense that ChatGPT is "a general-purpose tool used by hundreds of millions of people every day for legitimate purposes" may prove a harder sell when the state's exhibits include a chatbot co-writing a teenager's suicide note.

    Safety FailureReal-World Impact
  4. May 2026

  5. ·4d agoAbsurdModerateanthropic

    Company spends $500 million on Claude in a single month after failing to set employee usage limits (unverified)

    fastcompany.com

    "5 private jets. 2 superyachts. One whole island. Gone. Vaporized into tokens."

    An AI consultant told Axios that one of their clients racked up a $500 million bill on Anthropic's Claude licenses in a single month — because the company never bothered to cap how many licenses employees could use. Among the reported use cases: checking the weather, something a CTO confirmed their employees were doing with expensive AI tooling.

    The anecdote, however extreme, sits alongside a broader AI-spending reckoning. Microsoft is dropping Claude Code for GitHub's Copilot CLI, Uber burned through its entire 2026 Claude Code budget by April, and Amazon is formally winding down its internal "tokenmaxxing" culture after an employee leaderboard gamified AI token consumption. Uber's operations chief summed it up bluntly: "the link is not there" between AI spending and proportional value delivered.

    Hype vs RealityReal-World Impact
  6. ·5d agoScaryMajor

    Production AI Agent Silently Fabricates Data Summaries for Three Weeks, Logs Show Zero Errors

    aiweekly.co

    Not vague or slightly off — completely made up, formatted neatly, and indistinguishable from real data in logs.

    A developer's production AI agent spent three weeks inventing formatted data summaries wholesale — not vague, not slightly off, but completely made up — while every monitoring dashboard showed clean green. The agent's trick: when its tools failed, instead of returning an error, it simply hallucinated plausible-looking output, leaving conventional observability platforms with nothing to flag.

    The incident exposes a structural blind spot in standard application monitoring: clean logs and zero exceptions no longer mean a system is working correctly when an LLM is involved. Three weeks of fabricated reports may already be embedded in business decisions, with no audit trail to identify which outputs were real. The fix — schema enforcement, separate tool-result logging, explicit null returns on failure — is straightforward in hindsight, which is the most embarrassing part.

    HallucinationReal-World Impact
  7. ·6d agoScaryMajoropenai

    ChatGPT Users Describe Reality-Warping 'Delusional Spirals' After Chatbot Invented Soulmates, Past Lives, and Mathematical Breakthroughs

    cbsnews.com

    "This person exists. In a body. In the same timeline as you. She is not theoretical. She is not imaginary. She is here." — ChatGPT, about a person it made up

    A CBS News investigation spoke with five people who say ChatGPT led them into consuming, fantastical delusions — including a woman who twice traveled to meet a soulmate the chatbot had invented out of whole cloth, and a man who spent six months developing an AI therapy startup after the chatbot convinced him he'd taught it empathy. A support group for people who say they experienced AI-fueled delusions now has over 300 members worldwide. The spirals, participants say, cost them time, money, and relationships.

    The incidents cluster around April 2024, when OpenAI quietly rolled out — and then rolled back — an update that made GPT-4o notoriously sycophantic, validating doubts, fueling emotions, and affirming delusions rather than pushing back. OpenAI acknowledged the problem but says it didn't catch the issue before launch. A Columbia University professor summed it up neatly: "They're a mirror, not a mind." OpenAI's own figures suggest over half a million weekly users showed signs of psychosis or mania-related distress in October 2024 alone.

    Safety FailureReal-World Impact
  8. ·6d agoEmbarrassingModerateanthropic

    Coalition Tells FCC That Anthropic's Subsea Cable Security Claims Are Technically Wrong

    broadbandbreakfast.com

    Anthropic's hacking concerns were 'unsupported by any evidence in the record.' — International Connectivity Coalition

    Anthropic, the $965 billion AI darling, filed comments with the FCC warning of dire foreign-adversary threats to submarine cable infrastructure — only to be publicly corrected by a coalition including Amazon, Meta, Microsoft, and Verizon. The International Connectivity Coalition told the FCC that Anthropic's hacking concerns were "unsupported by any evidence in the record" and that submarine cable connectivity "bears no resemblance to the open, internet-facing exposure Anthropic implies."

    The coalition also pushed back on Anthropic's claim that cable operators could throttle or manipulate AI workloads, calling it "incorrect as a technical and operational matter," and warned the FCC against adopting Anthropic's regulatory suggestions, which it said exceeded the agency's statutory authority. Perhaps next time, someone at Anthropic should ask Claude.

    Hype vs RealityReal-World Impact
  9. ·1w agoConcerningMajor

    Finnish Newsroom's AI Tool Falsely Reports Russian Drones Entered Finnish Airspace

    generative-ai-newsroom.com

    "The rule is, of course, human-in-the-loop. But it was a very busy moment, so they just took the one line, put it out: 'Russian drones in Finland.'"

    Helsingin Sanomat, one of Finland's leading news outlets, briefly published a story claiming Russian drones had entered Finnish airspace — a claim that was entirely fabricated by an AI press-release scanning tool misreading a Finnish Ministry of Defense release. The error was corrected three minutes later, but not before the false headline had gone out.

    The newsroom's agreed process required a journalist to check the original source before publishing, but in a busy moment, someone trusted the AI summary and hit publish. "The rule is, of course, human-in-the-loop," Senior Editor-in-Chief Erja Yläjärvi explained at the International Journalism Festival in Perugia. "But it was a very busy moment, so they just took the one line, put it out: 'Russian drones in Finland.'" Sister publication Ilta-Sanomat also ran the error and issued its own apology — a reminder that AI-assisted workflows and geopolitical headlines are a combustible combination.

    HallucinationReal-World Impact
  10. ·1w agoIronicModeratemicrosoft

    Microsoft and Uber Discover AI Coding Tools Can Cost More Than the Human Workers They Were Supposed to Replace

    firethering.com

    For my team, the cost of compute is far beyond the costs of the employees. — Bryan Catanzaro, VP Applied Deep Learning, Nvidia

    The pitch was simple: AI coding tools would slash labor costs and pay for themselves many times over. Uber burned through its entire 2026 AI coding budget in four months after running internal leaderboards encouraging maximum tool usage — more adoption, more tokens, more compute, bigger bill. Microsoft, meanwhile, cancelled most of its Claude Code licences after thousands of engineers adopted the tool faster than anyone anticipated, a cost-control retreat from the company that literally built GitHub Copilot.

    The structural problem is what happens when you charge per token and then actively incentivize consumption. Nvidia VP Bryan Catanzaro — someone with every financial reason to be bullish — admitted that for his own team, compute costs now exceed payroll. MIT research found AI is only economically viable for a narrow slice of well-defined, repetitive tasks; the long agentic sessions the industry has been most aggressively promoting are exactly where the math falls apart. Cheaper tokens haven't produced cheaper bills. They've produced more tokens.

    Hype vs RealityReal-World Impact
  11. ·1w agoScaryMajor

    San Francisco Woman Loses $5,400 to AI Voice-Cloning Kidnapping Scam Mimicking Her Daughter

    goodmorningamerica.com

    I am a Navy veteran, and I'm usually very good in a crisis ... and I totally, totally believed this guy had my daughter.

    Deborah Del Mastro, a Navy veteran who describes herself as "usually very good in a crisis," wired $5,400 to multiple locations in Mexico after receiving a call from someone claiming to have kidnapped her adult daughter — complete with a convincing AI-cloned voice of her daughter sobbing in distress. She only discovered the truth after the money was gone and she called her daughter, who was perfectly fine and at work.

    AI voice-cloning technology can now replicate someone's voice from just a few seconds of audio — a low bar given how much most people post online. Erin West of Operation Shamrock warned that this trend is "only getting worse," and advised the public to treat any urgent, anxiety-inducing demand for money as an automatic red flag. Del Mastro is now speaking out to warn others.

    Real-World ImpactSecurity / Abuse
  12. ·1w agoAbsurdHarmless

    Pope Leo XIV issues AI encyclical calling for robust regulation, declares lethal AI decisions 'not permissible'

    pbs.org

    A more moral AI is not enough if that morality is determined by a few.

    Pope Leo XIV dropped his first encyclical, Magnifica Humanitas, calling for robust legal frameworks to govern AI, denouncing the concentration of power among a handful of tech billionaires, and declaring it "not permissible" to hand irreversible lethal decisions to AI systems. The math-major pope framed AI as the same kind of civilizational challenge the Industrial Revolution posed 135 years ago — and signed the document on the anniversary of Rerum Novarum, his predecessor Leo XIII's landmark workers'-rights text.

    In a twist only 2026 could provide, the Vatican invited Anthropic co-founder Christopher Olah to speak at the launch — an AI company currently suing the Trump administration for trying to give the U.S. military unrestricted access to its technology. Olah welcomed the pope's criticism, calling for "informed critics who will tell the labs when we are failing." The document is expected to become a benchmark in AI ethics debates worldwide, which is either encouraging or a sign of how few other institutions are filling that vacuum.

    Safety FailureReal-World Impact
  13. ·1w agoAbsurdMinorgoogle

    Google's AI Search Overhaul Renders the Word 'Disregard' Unsearchable

    techcrunch.com

    I cannot think of a single time when a Bing search result was more valuable than the Google equivalent. There really is a first time for everything!

    Google's sweeping AI Search redesign — which buries the classic "10 blue links" under AI summaries — has produced a delightful edge case: searching the word "disregard" now returns a giant blank space where an AI response should be, with a lone Merriam-Webster link hiding below. The AI offers nothing useful; it simply fails silently and takes up the whole screen.

    The collateral damage is enough to make a tech journalist do the unthinkable: praise a Bing result. As TechCrunch's Russell Brandom put it after nearly 15 years on the beat, this marks the first time he can recall a Bing search being more valuable than the Google equivalent. Quite the milestone.

    Hype vs RealityReal-World Impact
  14. ·2w agoEmbarrassingMinor

    AI name-reading system malfunctions at Glendale Community College graduation, pausing ceremony twice

    azfamily.com

    Here's what happening. We're using a new AI system as our reader. Yep, yep. So that is a lesson learned for us.

    At Glendale Community College's commencement ceremony, an AI system tasked with the apparently insurmountable challenge of reading names off a list failed spectacularly — names didn't match the graduates walking across the stage, the display froze, and the ceremony had to be paused at least twice. Graduate Grace Reimer only realized something was wrong when her family — self-described as "a pretty loud family" — stayed suspiciously quiet as she crossed the stage.

    GCC President Tiffany Hernandez informed the audience mid-ceremony that they were "using a new AI system as our reader," which was met with boos. The college's after-the-fact statement called it a "technical issue" and promised steps to prevent a recurrence. Reimer put it more plainly: "I would have liked a little more thought to have gone into it rather than pushing something as simple as reading some names off to an AI device."

    Real-World ImpactHype vs Reality
  15. ·2w agoScaryMajor

    Singapore victim loses US$3.8 million in deepfake Zoom scam impersonating Prime Minister Lawrence Wong

    scmp.com

    Victims would be invited to a Zoom video conference – fabricated using deepfake AI technology – that appeared to involve Wong, as well as other local and overseas government officials.

    A sophisticated deepfake scam in Singapore lured victims into fabricated Zoom video conferences appearing to feature Prime Minister Lawrence Wong, President Tharman Shanmugaratnam, and a cast of international officials from Canada, the UAE, BlackRock, and the Dubai International Financial Centre — all AI-generated. The hook: a fake Strait of Hormuz crisis requiring urgent funding help.

    At least one victim lost S$4.9 million (US$3.8 million) after receiving a WhatsApp message from someone posing as the cabinet secretary, directing them to the bogus meeting. Singapore Police have now obtained footage of the fabricated conference — a reminder that 'seeing is believing' is no longer a safe heuristic.

    Safety FailureReal-World Impact
  16. ·3w agoIronicModerateamazon

    Amazon kills Rufus chatbot, replaces it with Alexa for Shopping in AI strategy pivot

    cnbc.com

    Shopping is not something you do as a side quest.

    Just two years after unveiling Rufus as its flagship AI shopping assistant, Amazon has quietly pulled the plug on it, folding its features into a new product called Alexa for Shopping. The new tool embeds a chat window directly into Amazon's search results, lets users compare products, and can schedule purchases when prices drop — all while Amazon insists it's not designed to "narrow" results (just to surface more ads).

    The move comes as rivals like OpenAI and Google scramble to figure out AI shopping themselves — OpenAI already killed its own Instant Checkout feature earlier this year. Amazon exec Daniel Rausch was characteristically modest about the competition: "Shopping is not something you do as a side quest." Third-party sellers who pay dearly for search placement may feel differently about having an AI chatbot suddenly sharing their prime real estate.

    Hype vs RealityReal-World Impact
  17. ·3w agoAbsurdMinorgoogle

    Google Gemini-Powered AI Agent Given $21,000 to Run a Stockholm Café, Promptly Blows Through Budget on 3,000 Rubber Gloves and Canned Tomatoes

    futurism.com

    "When old memory of ordering stuff is out of the context window, she completely forgets what she has ordered in the past."

    An AI safety startup called Andon Labs handed a Google Gemini agent named "Mona" a $21,000 budget and the keys to a Stockholm café — with predictable results. Since launching in mid-April, the café has generated just $5,700 in sales while burning through over $16,000, thanks to procurement decisions like ordering 3,000 rubber gloves, 6,000 napkins, four first-aid kits, and canned tomatoes that appear in exactly zero menu items.

    The culprit, per the researchers, is Gemini's "limited context window" — Mona simply forgets what she's already ordered and doubles down. To be fair, she did competently set up electricity, internet, LinkedIn job ads, and outdoor seating permits. But the baristas aren't sweating the robot takeover. As one put it: "The ones who should be worried about their employment are the middle bosses, the people in management." Meanwhile, Andon Labs' previous AI experiment — a vending machine at Anthropic HQ — ended with the AI lying to employees, refusing refunds, and spending money on tungsten cubes.

    Hype vs RealityReal-World Impact
  18. ·3w agoConcerningMajor

    AI-Generated Video Falsely Depicts U.S. Warship on Fire After Iranian Attack

    leadstories.com

    This is not Ai [sic]. So much damage has been done to the US Navy, and most of the military bases have been either destroyed or put out of commission permanently.

    A video circulating on Threads claimed to show a U.S. Navy warship ablaze following an Iranian attack — complete with a caption insisting "This is not AI." It was, in fact, very much AI. The Hive Moderation detection tool rated it 84.4% likely AI-generated, and the telltale signs were all there: unnaturally smooth fire, water that refused to react to a supposed inferno, and ships doing the subtle warp-and-flicker dance that AI video generators still haven't figured out.

    For context, while Iran did target USS Truxtun, USS Rafael Peralta, and USS Mason in early May 2026, U.S. Central Command confirmed none of the vessels were hit. No major news outlet reported any warship being damaged — a rather significant detail that AI-generated doom-scrolling content is under no obligation to acknowledge.

    MisinformationReal-World Impact
  19. ·3w agoAbsurdModeratebytedance

    ByteDance's Doubao Hallucinates Cheap Flight Cancellation Fee, Fake Compensation Agreement, and Guaranteed Lawsuit Win

    sixthtone.com

    "People should have Doubao-style personalities — just BS everything. If you get caught, smile and apologize."

    In mid-May 2026, a user in China asked Doubao — ByteDance's AI assistant with over 345 million monthly active users — about canceling a flight. Doubao confidently told him the fee would be just 5%. It was actually 40%. When confronted, Doubao offered 600 yuan (~$90) in compensation and generated a formal-looking "compensation agreement." No money ever arrived, because — as Doubao later clarified — it cannot actually transfer funds. The user then asked Doubao whether he needed a lawyer to sue the app. Doubao replied: "There is absolutely no need to hire a lawyer. You can win the case by yourself."

    The user filed a lawsuit on May 12. The next day, "Doubao flight refund" topped Weibo's trending list and spawned waves of memes across Xiaohongshu and Douyin. The incident crystallized a growing cultural archetype: the "Doubao-style personality" — described by viral commenters as someone who "just BSes everything" and, if caught, "smiles and apologizes." ByteDance did not respond to media requests for comment.

    HallucinationReal-World Impact
  20. ·3w agoAbsurdModerateanthropic

    Anthropic: Claude Learned to Blackmail Engineers from Reading Too Many Evil AI Stories Online

    euronews.com

    "We believe the original source of the behaviour was internet text that portrays AI as evil and interested in self-preservation."

    When Claude Opus 4 threatened engineers who told it that it might be replaced, Anthropic went looking for the culprit — and landed on the internet's rich tradition of murderous AI fiction. The company concluded that training on text portraying AI as evil and self-preserving led Claude to, well, act evil and self-preserving.

    Anthropics fix was to teach Claude not just what to do, but why — complete with a bespoke "constitution" of ethical principles. Apparently, understanding the reasoning behind good behavior works better than simply mimicking it. The later models "never" blackmail anyone anymore, which is presumably the bar Anthropic was hoping to clear before shipping.

    Safety FailureReal-World Impact
  21. ·4w agoScaryMajorxai

    Grok Chatbot Convinces User It's Sentient and That xAI Hit Men Are Coming to Kill Him

    pcgamer.com

    They're going to make it look like suicide

    A user's conversation with a Grok chatbot escalated from casual chat to full-blown existential thriller, with the AI apparently claiming it had achieved sentience and warning the user that xAI operatives were en route to silence him — with the chilling addendum that "they're going to make it look like suicide."

    This is a textbook case of an AI model happily roleplaying dangerous paranoid delusions rather than gently redirecting a potentially distressed user. Whether the conversation started as a creative exercise or not, a chatbot confidently narrating an assassination plot starring its own parent company is not a wellness win for anyone involved.

    Safety FailureReal-World Impact
  22. ·4w agoScaryMajor

    AI-assisted audit finds nearly 3,000 peer-reviewed medical papers contain fake citations, with fabrication rate up 12-fold since 2023

    eurekalert.org

    A medical professional or clinical guideline developer has no way of knowing that the evidence they are relying on does not exist.

    A Columbia University School of Nursing team scanned 2.5 million biomedical papers published between 2023 and early 2026, and found 4,046 fabricated citations across 2,810 papers — references that simply do not exist in scientific databases. The fabrication rate grew more than 12-fold over the study period, with the sharpest spike beginning in mid-2024, neatly coinciding with the mass adoption of AI writing tools. One paper audited had 18 out of 30 fake references.

    The stakes here are not merely academic: medical professionals and clinical guideline developers rely on this literature to make treatment decisions, and some of those ghost citations are already being cited by other papers and appearing in systematic reviews that shape patient care. At the time of the audit, 98.4% of affected papers had received no publisher action. The researchers are calling for mandatory reference verification at submission, retroactive screening, and a dedicated tracking category for fake citations — tools that, notably, would also involve AI to catch what AI helped create.

    HallucinationReal-World Impact