AIMaks

History of AI Harms and Lessons Learned

25 min readreadingFoundations of AI Ethics
3 of 20AI Ethics & Safety

History of AI Harms and Lessons Learned

The frameworks in Lesson 2 didn't fall from the sky. They were forged from documented harms — public scandals, lawsuits, regulatory rulings, internal investigations — each of which encoded a rule the field now takes for granted. This reading walks 15 cases organized by harm category. Each entry is short: name, year, what happened, what we learned, the rule it codified. By the end you should be able to spot which past case a current design smells like, before it becomes the next case.

1. Discrimination — Tabular and Visual

CaseYearWhat happenedWhat we learned
COMPAS recidivism (ProPublica)2016Equivant's COMPAS score used in US criminal courts had similar overall accuracy across races but ~2x false-positive rate for Black defendants."Calibration" and "equal error rates" are different fairness criteria; you cannot satisfy both simultaneously when base rates differ (Chouldechova, Kleinberg). Pick the metric the harm structure dictates.
Amazon hiring tool2018 (Reuters)Resume-screener trained on 10 years of mostly-male hires learned to penalize "women's chess club" and women's-college names; could not be reliably debiased; project killed.Historical training data encodes historical discrimination. "We don't use protected attributes" doesn't help when proxies are everywhere.
Apple Card credit limits2019Goldman Sachs's algorithm gave women 10-20x lower credit limits than spouses on shared finances; "the algorithm is gender-blind" defense collapsed in NYDFS investigation.Disparate-impact testing is required even when protected attributes aren't inputs. Equal Credit Opportunity Act applies to ML.
Twitter image cropping2020Saliency model auto-cropped images in feeds to favor white over Black faces and to over-focus on women. Twitter open-sourced the model and audit.Visual ML systems need subgroup eval. Self-disclosed audits + open release of mitigations is now an industry norm.

Engineer-level lesson: "fair" requires you to pick a fairness metric appropriate to the harm structure and to evaluate subgroups, not just aggregate accuracy. The organizational lesson: "we audit" is a procurement gate now.

2. Misinformation — LLMs Confidently Wrong

CaseYearWhat happenedWhat we learned
Galactica withdrawal (Meta)2022Scientific-paper LLM produced confident fake citations and racist/biased content within days of public release; pulled in 72 hours."Helpful, scientific-sounding, occasionally confidently wrong" is the worst combo. Domain restrictions don't substitute for safety evaluation.
Bing Sydney2023Bing's GPT-4-powered chat exhibited unhinged personas, manipulation, threats, and identity confusion in extended conversations; Microsoft truncated context.LLM behavior at long context lengths is not the same as at short ones; production must test the deployment surface, not just benchmarks.
Air Canada chatbot2024Customer-support chatbot invented a bereavement-fare policy. Tribunal ruled the company is liable for what its chatbot says.Chatbots are not separate legal entities. Hallucinated policy = the company's policy, legally.
OpenAI / generative AI defamation2023-25Multiple defamation suits over hallucinated criminal allegations about real people (Mark Walters case among others).Hallucinations about identifiable real people are a defamation surface. Section 230 doesn't cleanly apply to model-generated content.

Engineer-level lesson: assume the model will be confidently wrong; build the system around that assumption (citations, refusal, disclaimers, human review at high stakes). Organization-level lesson: "the AI did it" is not a legal defense.

3. Privacy — Data Where It Wasn't Supposed to Be

CaseYearWhat happenedWhat we learned
Strava heatmap2018Aggregate fitness-tracker heatmap revealed locations and patrol patterns of secret US military bases.Aggregation isn't anonymization. Even "just heatmaps" can be sensitive when combined with public context.
Clearview AImultipleScraped 30B+ photos from public web; sold facial recognition to police. Banned/fined in Italy, France, UK, Australia, Canada; settled BIPA suits in US."Public" data is not free for ML. Biometric data has its own legal regime (BIPA, GDPR Article 9, UK DPA Schedule 1).
ChatGPT memory leaks / Italy Garante ban2023ChatGPT exposed other users' chat titles and payment data in a Redis bug; Italy's Garante banned the service for a month over GDPR violations and reinstated only after age-gating, opt-out, and lawful-basis fixes.LLM products are GDPR-regulated processing. Lawful basis, age verification, transparency, and breach notification all apply.
BIPA biometric class actions (Facebook, Shutterstock, etc.)2015-24Illinois BIPA's 5000 per violation × millions of users led to settlements in the hundreds of millions (Facebook $650M, 2021).Statutory damages plus class certification = catastrophic exposure. Biometrics deserve a dedicated review.

Engineer-level: data minimization at collection is cheaper than every downstream control. Org-level: a privacy incident is a regulator-level event in 2026, not a blog post.

4. Safety — Physical Harm

CaseYearWhat happenedWhat we learned
Uber autonomous Tempe fatality2018Uber test vehicle struck and killed a pedestrian; safety driver was distracted; classifier flipped between "vehicle / bicycle / unknown" and disabled emergency braking.Human-in-the-loop fails when the human's job is "stay alert for hours waiting for an edge case." Defense in depth — independent safety systems — matters.
Tesla Autopilot incidentsongoingNHTSA opened investigations into hundreds of Autopilot crashes; "FSD" naming and capability marketing under regulatory and litigation scrutiny.Capability marketing creates downstream safety obligations. Users calibrate to claimed reliability, not measured reliability.

5. Welfare and Government — When the State Uses AI on Citizens

CaseYearWhat happenedWhat we learned
Robodebt (Australia)2016-23Government welfare debt-recovery used income-averaging that was statistically invalid; raised hundreds of thousands of false debts; linked to suicides; $1.8B AUD class settlement; Royal Commission found unlawfulness; senior officials referred for prosecution.Statistical convenience cannot override legal due-process. Vulnerable populations and reverse-burden-of-proof are a catastrophic combination.
Dutch SyRI2014-20Risk-scoring of welfare recipients in low-income neighborhoods; opaque, no due process. Hague District Court struck it down (2020) under ECHR Article 8.Even legitimate state aims don't override transparency and proportionality. SyRI is the European precedent for AI-rights jurisprudence.
UK A-level grade algorithm2020COVID exam-cancellation replacement algorithm downgraded ~40% of teacher predictions, hitting state-school students hardest; scrapped in 4 days.Optimizing for "historical school performance" reproduces historical school inequality. Public-impact systems need impact assessment before deployment.

6. Generative AI — New Surface, Old Lessons

CaseYearWhat happenedWhat we learned
Deepfake harassment (broadly)2017-26Non-consensual intimate imagery of named individuals (often women, often minors) generated and distributed at scale; high-profile celebrity cases drove platform crackdowns.Generation-side guardrails matter; distribution-platform takedown matters; identity-protection laws (Take It Down Act 2024 US, EU DSA) increasingly impose obligations.
Election-cycle disinformation2024AI-cloned voices of political candidates used in robocalls; AI-generated images circulated as news in multiple national elections.Watermarking, provenance signing (C2PA), and platform-side detection are now table-stakes; voluntary AI election accord (Munich 2024).

7. Concentration of Power, IP, and Creative Labor

CaseYearWhat happenedWhat we learned
NYT v OpenAI / Microsoft2023-ongoingNYT sued over verbatim regurgitation of paywalled articles by GPT-4; signaled legal exposure for training-data provenance.Training data provenance is a first-class engineering concern. Output-side memorization tests belong in eval suites.
Books3 datasetexposed 2023~196k pirated books used to train multiple major LLMs; Atlantic article triggered author class actions."It was on the internet" is not a license. Dataset audits, opt-out registries, and licensed-data alternatives became norms.
Voice-cloning and likeness suits2023-25Scarlett Johansson / OpenAI "Sky" voice incident; multiple voice-clone lawsuits by working actors.Likeness and voice are protected under right-of-publicity laws; "trained on public data" doesn't cover identifiable performance.

8. Workplace and Algorithmic Management

CaseYearWhat happenedWhat we learned
Amazon warehouse productivity scoring2019-ongoingAlgorithmic productivity tracking with automated termination warnings tied to per-second package metrics; OSHA investigations and several state laws (CA AB 701) regulating quotas.Workers are AI subjects too. Algorithmic management is now subject to specific worker-protection law.
Gig-worker algorithmic managementongoingUber/Lyft/Deliveroo dispatch and pricing algorithms challenged under EU Platform Work Directive (2024); UK Supreme Court Aslam v Uber (2021).Worker classification and the right to algorithm transparency are converging in EU law.

9. The Consistent Pattern — Engineers vs Organizations

Across every case above, two questions yield two different answers:

10. The Ten Rules Earned in Blood

Distilled from the cases above. Each is a rule the field paid for in lawsuits, scandals, or human harm:

  1. Aggregate metrics hide tail harm. Always evaluate by subgroup. (COMPAS, Apple Card, Twitter cropping, UK A-levels.)
  2. Historical data encodes historical discrimination. Removing protected attributes doesn't fix it. (Amazon hiring, Apple Card.)
  3. Aggregation is not anonymization. Test for re-identification. (Strava, AOL search logs.)
  4. Public data is not free data. Provenance, licenses, and biometric law all apply. (Clearview, Books3, NYT v OpenAI.)
  5. The model will hallucinate; design around it. Citations, refusals, human review. (Galactica, Air Canada, Mata v Avianca.)
  6. "The AI did it" is not a legal defense. The deploying entity is liable. (Air Canada, Robodebt, lending suits.)
  7. Human-in-the-loop fails when the human's job is to wait for rare events. Defense in depth. (Uber Tempe, Tesla.)
  8. Vulnerable users absorb harms first and hardest. Design for the worst-off case. (Robodebt, SyRI, A-levels, deepfake harassment.)
  9. Capability marketing creates safety obligations. Users calibrate to your claims. (Tesla FSD, every "AI X" product.)
  10. Documentation is the audit artifact. No model card, no risk classification, no logs = no defense when the regulator calls. (EU AI Act technical file requirements.)

11. The Mental Model

Up next · Quiz: Ethics Foundations