Back to feed
Reddit r/LocalLLaMA·

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)

Signal
78
Hype
25
In three linesNumind releases NuExtract3, a 4B open-weight VLM based on Qwen3.5-4B (Apache-2.0 license). The model extracts structured data and converts documents/images to Markdown. Trained for 3 days on 8xH100, it handles PDFs, forms, tables with multiple quantizations (GPTQ, W8A8, FP8, Q4, Q6) for self-hosting from 4GB VRAM.
Read source
Your take?
QwenVisionOpen sourceCode generationRAG

Summary generated by Claude — human-verified