Back to feed
Reddit r/LocalLLaMA·

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS

Signal
75
Hype
45
In three linesDeveloper ran a 260K-param LLM (llama2.c/stories260K) on a JavaScript emulator of a 1990s Motorola 68K CPU, itself running inside a 2008 RTOS. INT8 quantization + lookup tables for RoPE and inverse square root (Quake) to bypass missing FPU. Generation: 2-4 seconds/token.
Read source
Your take?
LlamaCode generationFine-tuningOpen source

Summary generated by Claude — human-verified