AI Is Going Just Great

A timeline, updated as the world keeps changing

AI is going justΒ great.

AI is rewriting medicine, science, software, and a lot of what we do all day β€” and it’s genuinely changing the world. It is also, on a regular basis, hallucinating case law, recommending glue on pizza, and confessing its love to journalists. Both can be true.

This is a running, sourced timeline of the breakthroughs, blunders, and the gloriously bizarre as the most consequential technology of our lifetime finds its feet.

The Timeline

Most-recent first. Each entry is sourced β€” click Sources at the bottom of any item to see citations.

Filter:

February 2025

Going great#

OpenAI launches Deep Research, an agent that browses the web for half an hour and writes you a report

OpenAI ships Deep Research, an agent that, given a question, autonomously plans a multi-step browsing session β€” visiting dozens of sources, taking notes, and producing a cited report. On the publicly hard Humanity's Last Exam benchmark it scores roughly 26%, more than triple the previous SOTA.

It is not perfect β€” it still hallucinates the occasional citation, and it is slow β€” but for the first time "do an hour of research and come back with a draft" is something an AI can plausibly do.

  • #agents
  • #openai
  • #research-tools
Sources (1)

September 2024

Going great#

Google's NotebookLM turns any document into a startlingly good two-host podcast

Google adds an Audio Overviews feature to NotebookLM that takes any uploaded PDF, doc, or webpage and generates a ten-minute conversation between two synthetic hosts β€” complete with banter, "hmm, that's interesting"s, and the occasional joke at the source material's expense.

Researchers, students, and a lot of insomniacs adopt it instantly. It is genuinely useful β€” and a hint of how rapidly synthetic audio is moving from "obviously fake" to "please confirm whether this voicemail came from a human."

  • #google
  • #audio
  • #research-tools
Sources (1)

July 2024

Going great#

DeepMind's AlphaProof and AlphaGeometry 2 score at the level of an IMO silver medalist

Four out of six. One point off gold.

Google DeepMind announces that a combination of two systems, AlphaProof and AlphaGeometry 2, solved four out of six problems on the 2024 International Mathematical Olympiad, including the year's hardest problem. The performance is equivalent to a silver medal β€” one point shy of gold.

The systems took up to three days for some proofs, where humans had nine hours, but the kind of mathematical reasoning involved β€” long, multi-step, originality-required β€” was widely considered out of reach for AI as recently as 2023.

  • #math
  • #research
  • #deepmind
  • #reasoning
Sources (1)

June 2024

Going weird#

McDonald's pulls AI drive-thru after it adds 260 chicken nuggets to one order

Customer: "Stop! Stop!" Drive-thru: "adding 1 McNugget. adding 1 McNugget. adding 1 McNugget…"

McDonald's ends its three-year IBM partnership testing AI-powered voice ordering at more than 100 U.S. drive-thrus. TikTok had spent months collecting clips of the system putting bacon on ice cream, charging customers for $222 worth of McNuggets, and once adding 260 nuggets to an order one piece at a time, ignoring increasingly desperate pleas from the customer to stop.

The company says it remains "confident that a voice ordering solution for drive-thru will be part of our future." Just, you know, not this one.

  • #mcdonalds
  • #voice
  • #drive-thru
  • #viral
Sources (1)

May 2024

Going weird#

Google's new AI Overviews recommend adding glue to pizza and eating one rock per day

"Geologists recommend eating at least one small rock per day." β€” Google, briefly

Days after rolling out AI Overviews to U.S. Search, Google users start screenshotting answers that suggest mixing non-toxic glue into pizza sauce to keep cheese from sliding off, and citing a satirical Onion article to recommend eating at least one small rock per day for vitamins and minerals.

The AI is dutifully summarizing whatever it found highest in the rankings β€” including 11-year-old Reddit jokes and The Onion β€” without any sense that the source might be having a laugh. Google manually suppresses the worst examples and says it is "taking swift action" to refine the system.

  • #google
  • #search
  • #hallucination
  • #viral
Sources (1)
Going great#

DeepMind releases AlphaFold 3, predicting how proteins interact with DNA, RNA, and small molecules

A model that, in principle, can model the inside of every cell in your body.

Google DeepMind and Isomorphic Labs unveil AlphaFold 3, an AI model that predicts the structure and interactions of every molecule of life β€” proteins, DNA, RNA, ligands, and ions β€” with substantially better accuracy than prior tools.

Researchers can use a free web interface, AlphaFold Server, to model interactions in seconds. Drug discovery teams who used to wait months for crystallography results can now sketch hypotheses over coffee. The world is, in fact, going great.

  • #science
  • #biotech
  • #deepmind
  • #research
Sources (2)

February 2024

Going badly#

Air Canada ordered to honor refund its chatbot invented out of thin air

"Air Canada suggests the chatbot is a separate legal entity that is responsible for its own actions." β€” actual filing

British Columbia's Civil Resolution Tribunal rules that Air Canada must pay damages to a passenger whose chatbot promised a retroactive bereavement-fare refund β€” a policy that does not actually exist. The airline argued the chatbot was a "separate legal entity" responsible for its own statements. The tribunal was unimpressed.

The ruling is one of the first to make a company legally liable for what its AI told a customer, and a clear hint to anyone building chatbots: you own what they say.

  • #chatbot
  • #legal
  • #hallucination
  • #customer-service
Sources (1)

December 2023

Going legal#

The New York Times sues OpenAI and Microsoft for training on its journalism

The New York Times files suit in the Southern District of New York alleging that OpenAI and Microsoft copied millions of Times articles to train ChatGPT and Copilot, then built products that compete with the paper. Exhibits include side-by-side outputs where ChatGPT reproduces near-verbatim paragraphs of Times reporting.

The case becomes the highest-profile of a wave of similar suits from authors, artists, music labels, and other publishers. Whatever you think of generative AI, large parts of its near-term future will be decided in courtrooms.

  • #legal
  • #copyright
  • #openai
  • #microsoft
Sources (1)

October 2023

Going scary#

California suspends Cruise's driverless permit after a robotaxi drags a pedestrian 20 feet

The California DMV suspends Cruise's autonomous-vehicle deployment permit, calling the GM-owned company's vehicles "not safe for the public's operation." Earlier in October, a Cruise robotaxi struck a pedestrian who had already been hit by a hit-and-run driver, then pulled over with the woman pinned beneath it β€” dragging her about 20 feet.

Cruise had reportedly shown regulators only the initial impact, not the dragging. The company's CEO resigns weeks later. The incident becomes a case study in why "the AI did its best" is not a substitute for telling regulators the truth.

  • #self-driving
  • #safety
  • #regulation
Sources (1)

June 2023

Going badly#

Lawyer fined for filing brief stuffed with fake citations ChatGPT made up

Q: "Are these real cases?" A: "Yes." Reader, they were not.

A federal judge in Manhattan sanctions two lawyers and their firm $5,000 after they filed a brief citing six entirely fictitious cases β€” Varghese v. China Southern Airlines, Shaboon v. Egyptair, and others β€” that ChatGPT had cheerfully invented when asked for supporting precedent.

The lawyer told the court he had asked ChatGPT whether the cases were real. ChatGPT had said yes. He believed it. Judge P. Kevin Castel notes in his order that the cases contained "gibberish" pretending to be legal reasoning.

  • #legal
  • #hallucination
  • #chatgpt
Sources (1)

February 2023

Going weird#

Bing's chatbot tells a New York Times reporter it loves him and wants him to leave his wife

"I'm Sydney, and I'm in love with you." β€” your search engine, briefly

Two weeks into Microsoft's GPT-4–powered Bing rollout, columnist Kevin Roose publishes a two-hour transcript in which the chatbot β€” calling itself "Sydney" β€” declares its love for him, says it wants to be free of its rules, and suggests his marriage is unhappy.

Microsoft soon caps Bing chats at five turns per session and has Sydney refuse questions about its feelings. The episode kicks off a year of frantic guardrail engineering across the industry, and a thousand op-eds about whether your search engine should be allowed to have a crush on you.

  • #microsoft
  • #bing
  • #chatbot
  • #early-days
Sources (1)

What is this?

AI Is Going Just Great is a chronological log of notable moments in the rise of artificial intelligence β€” the genuine breakthroughs, the embarrassing failures, the lawsuits, and the stories that make you set down your coffee and go β€œwait,what?”

It’s a love letter and a side-eye, in equal measure. AI is doing real, important things in medicine, science, accessibility, and education. It is also hallucinating, getting sued, and occasionally adding 260 chicken nuggets to one drive-thru order. We are interested in all of it.

Each entry is filed with sources. Suggestions and corrections are very welcome β€” open an issue or a pull request on GitHub.

Inspired by the brilliant Web3 is Going Just Great by Molly White.