In December 2025, Anna’s Archive the world’s largest “shadow library” shook the internet by announcing its latest conquest: a massive, nearly complete backup of Spotify.
Known primarily for preserving books and academic papers, the team at Anna’s Archive has officially expanded their mission of “preserving humanity’s knowledge and culture” into the realm of music. Here is a breakdown of what this backup includes and why it’s a milestone for digital preservation.
The Scale: 300TB of Human Heritage
The numbers behind the “Spotify Scrape” are staggering. The archive isn’t just a list of songs; it’s a meticulously organized database designed for long-term survival:
- 256 Million Tracks: Metadata for 99.9% of all tracks on the platform.
- 86 Million Music Files: Roughly 300 terabytes of audio, covering 99.6% of all listens on the service.
- The Metadata King: It is now the largest publicly available music metadata database in existence, containing 186 million unique ISRCs (for comparison, MusicBrainz has around 5 million).
Why Backup Spotify?
The volunteer behind the post (identified as “ez”) explained that while music is generally “well-preserved” by enthusiasts, existing efforts have three major flaws:
- Popularity Bias: Most archives only focus on top-tier artists, leaving a “long tail” of obscure music at risk of disappearing.
- Storage Bloat: Audiophiles chase lossless FLAC files, which makes archiving everything prohibitively expensive in terms of storage.
- No Central Authority: There was previously no single, open-source list aiming to represent “all music ever produced.”
How the Archive is Structured
The release is divided into two parts to make it manageable for “data hoarders” and preservationists:
- The Metadata (SQLite): A searchable database containing artists, albums, and tracks. It’s even designed to allow a “True Shuffle” something Spotify users have complained about for years allowing you to shuffle every single song in the archive without algorithmic bias.
- The Files: Distributed in bulk torrents via the “Anna’s Archive Containers” (AAC) format.
- High Popularity: Original OGG Vorbis quality (160kbit/s) with metadata added.
- Zero Popularity: Re-encoded to OGG Opus (75kbit/s) to save space while maintaining decent quality, focusing on the millions of songs that get almost no streams.
The “Preservation” Ideology
Anna’s Archive frames this as a safeguard against “natural disasters, wars, budget cuts, and other catastrophes.” By making the archive “fully open,” they ensure that anyone with enough disk space can mirror the entire collection, making it nearly impossible for authorities or corporate entities to delete the history of digital music.
What’s Next?
While the project is currently a torrents-only archive intended for bulk backup, the blog post hints that if there is enough interest, they may eventually add functionality to download individual songs directly from the Anna’s Archive website.
Link: Backing up Spotify - Anna’s Blog
Disclaimer: Anna’s Archive operates in a legal gray area (and is often classified as a piracy site). While this backup is a landmark for digital preservation, it involves significant copyright complexities.