Reddit r/LocalLLaMA·27 May 2026

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS

Signal

Hype

In three linesDeveloper ran a 260K-param LLM (llama2.c/stories260K) on a JavaScript emulator of a 1990s Motorola 68K CPU, itself running inside a 2008 RTOS. INT8 quantization + lookup tables for RoPE and inverse square root (Quake) to bypass missing FPU. Generation: 2-4 seconds/token.

Read source

Your take?

Llama Code generation Fine-tuning Open source

Summary generated by Claude — human-verified

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS

Other angles on this story