Back to feed
Reddit r/MachineLearning·

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Signal
82
Hype
25
In three linesNumind releases NuExtract3, a 4B open-weight VLM based on Qwen3.5-4B under Apache-2.0 license. The model extracts structured data from complex documents (PDFs, forms, tables, invoices) to Markdown or JSON. Trained for 3 days on 8xH100, it supports multiple quantizations (GPTQ, W8A8, FP8, Q4, Q6) and runs on 4GB VRAM minimum.
Read source
Your take?
VisionOpen sourceCode generationToolsQwen

Summary generated by Claude — human-verified