Back to feed
Simon Willison·

Microsoft's new MAI models

Signal
78
Hype
25
In three linesMicrosoft announces MAI-Thinking-1 (35B, reasoning) and MAI-Code-1-Flash (5B, code). The former outperforms Claude Sonnet 4.6 in blind human evaluation. Both trained on commercially licensed data without third-party distillation.
Read source
Your take?
Code generationReasoningBenchmarks

Summary generated by Claude — human-verified