home›LLMs›

Inside Talkie: The 13B LM Trained Only on Pre-1931 Text

Exploring the motivations, training data, capabilities, and community reactions to a language model that only knows the world before 1931

May 21, 2026

#Agents #Content Generation #LLM #Open Source #Training

Talkie is a 13B-parameter language model trained exclusively on 260 billion tokens of text published before 1931. Built by Nick Levine, Alec Radford, and David Duvenaud to study AI generalization, it sparks discussion on historical perspective and anachronistic outputs. This deep dive covers data sources, processing, limitations, and public release plans.

A Language Model Trapped in Time

On April 27, 2026, a small team of researchers released a language model unlike any other.

Named Talkie, the 13-billion-parameter transformer was trained exclusively on text published before 1931.

The announcement, posted to X (formerly Twitter) at 11:34 PM by co-author Nick Levine (@status_effects), immediately drew attention.

Within 24 hours, the tweet had accumulated over 1.1 million views, along with 3,000 likes, 543 replies, and 1,600 bookmarks.

The model was developed by Nick Levine, Alec Radford (@AlecRad), and David Duvenaud (@DavidDuvenaud).

Their stated goal was not to build a practical assistant, but to answer a fundamental research question: how do language models generalize beyond their training distribution?

If a model has never seen a line of code, can it be taught to write Python? If it only knows the world as it was described in 1930, can it reason about the present?

Talkie was built to find out.

The Archive: Building a Pre-1931 Corpus

Training a 13-billion-parameter model requires enormous quantities of text.

For Talkie, the team assembled approximately 260 billion tokens of pre-1931 material.

This figure was raised by user will depue (@willdepue) in the announcement thread and was not disputed by the authors.

The primary data sources were the Institutional Data Initiative and the Internet Archive, supplemented by a "bunch of other sources."

Crucially, the corpus was not a random sample of historical text.

It included dozens of historical reference books with regular, structured formats:

Encyclopedias capturing the sum of early-20th-century knowledge
Etiquette manuals codifying social norms of the era
Letter writing manuals preserving formal communication styles
Cookbooks documenting domestic life and available ingredients

To extract and structure this data, the team wrote custom parsers using Claude Code.

These parsers transformed scanned and digitized pages into clean, machine-readable text suitable for language model training.

The authors have committed to publishing more details about the corpus in the future.

Where possible, they plan to release the data itself, or at minimum the scripts required to reconstruct the dataset from public sources.

A surreal, dimly lit library where towering shelves of antique books dissolve into pixelated data streams, their pages fluttering like autumn leaves. In the center, a translucent human silhouette composed of overlapping text fragments—encyclopedias, etiquette manuals, cookbooks—holds a quill that drips ink into a glowing, modern computer terminal. The ink swirls into a mirror reflecting a foggy, anachronistic cityscape: horse-drawn carriages next to neon signs, gambling dens overgrown with vines. Soft sepia tones clash with cold blue digital light, evoking a statistical pastiche—a window that is also a mirror, blending eras without resolution.

Capabilities, Limitations, and Anachronistic Behavior

Talkie's responses are shaped entirely by the temporal scope of its training data.

Early users quickly identified both charming and unsettling behaviors.

One user reported that Talkie "is claiming that gambling won't exist by 2020."

This is a classic example of anachronistic inference: the model extrapolated from pre-1931 trends — perhaps a period of anti-gambling sentiment or legal prohibition — and projected that trajectory forward, unaware of the massive expansion of legal gambling in the late 20th and early 21st centuries.

Other users noted more fundamental limitations.

Yasi Khan (@yasmeena_khan) observed that Talkie "seems to think it only covers 1700-1799," suggesting the model lacks the metadata or self-awareness to accurately describe its own training period.

The authors did not address this observation in the thread.

Christiano Boria (@christianoboria) offered a harsher critique: "the answers do not seem like someone from pre 1931."

This raises a subtle but important point. A model trained on text from a period does not necessarily replicate the cognitive style, beliefs, or conversational norms of a person living in that period.

It may produce a kind of statistical pastiche — a blend of voices and perspectives that never existed in any single human mind.

Andrew Gordon Wilson (@andrewgwils) extended this critique to the broader enterprise of LLM interaction:

"I don't feel that talking with LLM trained on current data is like talking with a modern person."

If a model trained on contemporary data does not replicate a contemporary human, what does it mean to talk with a model trained on historical data?

Is it a window into the past, or merely a mirror reflecting the statistical patterns of archived text?

Community Reactions and Practical Applications

Despite the philosophical questions, the community response was overwhelmingly positive.

Users described the project as "cool," "amazing," "insanely good," "wholesome," and "the coolest thing I've seen this month."

The checkpoint was made available for download, and multiple users expressed interest in running it locally.

Phil Trubey indicated his intention to download and run the model. Other users asked for terminal access to experiment with it directly.

The practical applications users envisioned were creative and varied.

One user wanted to ask Talkie: "How's the whole woman voting thing going?" — a question that would be answered very differently depending on whether the model was aware of the 19th Amendment (ratified in 1920) or not.

Ben Nash (@bennash) tagged author Orson Scott Card, noting that the project "sounds like one step closer to Pastwatch" — a reference to Card's science fiction novel Pastwatch: The Redemption of Christopher Columbus, in which researchers use a device to view and potentially alter the past.

The thread also referenced an earlier speculative post from September 30, 2025, by user Deep Thrill (@DeeperThrill).

That post had wondered about training an AI on all knowledge up to 1899 and testing whether it could make creative leaps in physics — specifically regarding the aether and the perihelion of Mercury.

Talkie, trained on data extending 32 years further, represents a partial realization of that thought experiment.

Open Questions and Future Directions

The Talkie project leaves several important questions unanswered.

Jiaxin Wen (@jiaxinwen22) asked a critical technical question: was the instruction data used for fine-tuning generated by prompting LLMs with historical texts?

If so, the model's behavior might reflect not just the historical corpus, but also the biases and framing of the modern LLM used to generate the instruction examples.

The authors did not respond to this question in the thread.