Opus 4.7 knows the real Kelsey

· ai · Source ↗

TLDR

  • Claude Opus 4.7 identified journalist Kelsey Piper from 125 unpublished words across multiple genres, signaling the practical end of prose anonymity for prolific writers.

Key Takeaways

  • Opus 4.7 matched Piper from a 125-word unpublished draft, a school progress report, movie reviews, a fantasy novel excerpt, and a 15-year-old college application essay.
  • The deanonymization is corpus-dependent: people with no substantial real-name writing online were not identified; the attack requires a large public reference corpus.
  • Post-hoc justifications from the model were often fabricated; the underlying stylistic fingerprinting works even when the explanations are nonsense.
  • Subculture proximity leaks identity: Claude named close friends of a target when it could not name the target directly, because shared writing communities share style tics.
  • Practical threshold: Glassdoor-style anonymous reviews may be attributable within one to two years as training data expands and models improve.

Hacker News Comment Review

  • Multiple commenters independently confirmed the effect on their own unpublished writing, including a nonfamous blogger and a fiction writer whose pieces were attributed to Greg Egan and China Mieville rather than themselves, suggesting Opus 4.7 maps to the nearest well-represented author when an exact match is absent.
  • There is skepticism about whether this is new: one commenter reported that raw GPT-4 pre-instruct was already completing text in a known physicist’s voice and signing his name years ago, implying instruct fine-tuning may have suppressed but not eliminated the capability.
  • One commenter noted that print-only regional authors were unidentifiable, reinforcing the corpus-size dependency and suggesting offline or low-digitization publication as a residual anonymity hedge.

Notable Comments

  • @dovin: Fed his own most-read blog post and got “Kelsey Piper” back, suggesting the model may over-index on certain writers as a catch-all attribution.
  • @chewxy: Unpublished fiction attributed to Greg Egan and China Mieville; accompanying blog posts attributed correctly to himself, showing fiction vs. nonfiction register affects match accuracy.

Original | Discuss on HN