Open source — AI news

Quick thoughts on GLM-5.2 (Bonus: Censorship question answers)

GLM-5.2 shows excellent coherence over extremely long context and adaptive reasoning without excessive verbosity. User reports performance close to GPT-4.5 on heavy analysis and deep research, with faster inference than GLM-5.1. The model has its own distinct conversational signature.

Qwen Reasoning Open source

SIG

35

HYP

00

arXiv cs.CL·Jun 18

JetFlow: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting

JetFlow improves speculative decoding by combining parallel drafting efficiency with branch-wise causal conditioning. On H100 GPUs, it achieves 9.64x speedup on MATH-500 and 4.58x on open-ended conversations, outperforming existing tree-based methods on dense and MoE Qwen3 models.

Benchmarks Code generation Open source

SIG

82

HYP

00

arXiv cs.CL·Jun 18

Montreal Forced Aligner and the state of speech-to-text alignment in 2026

Montreal Forced Aligner 3.0, the reference tool since 2016 for forced speech-to-text alignment, achieves state-of-the-art performance on English, Japanese, and Korean with boundary errors <15ms. New capabilities: model adaptation, cross-language phone remapping, expanded language/dialect coverage, harmonized IPA dictionaries.

Voice Benchmarks Open source

SIG

82

HYP

00

arXiv cs.CL·Jun 18

Morpheus: A Morphology-Aware Neural Tokenizer and Word Embedder for Turkish

Morpheus is a morphology-aware neural tokenizer for agglutinative Turkish. The model uses differentiable Poisson-binomial dynamic programming to segment morphemes with 1.425 bits-per-character compression and MorphScore macro-F1 of 0.61 (vs ~0.32 for subword tokenizers). Lossless by construction: decode(encode(w)) = w.

Embeddings Papers Open source

SIG

82

HYP

00

arXiv cs.LG·Jun 18

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

ASTRA is an air traffic control training simulator automating pilot roles through speech recognition, instruction interpretation, and response generation. The system reduces Word Error Rate from 107.80% to 23.45% on Singaporean-accented aviation speech, and evaluates trainee radiotelephony communications achieving 91.7% accuracy, 88.2% brevity, and 86.9% completeness scores.

Voice Fine-tuning Evals

SIG

75

HYP

00

arXiv cs.CL·Jun 18

Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation

Activation steering improves synthetic data generation for low-resource languages. Two strategies tested: Language Steering (linguistic identity) and Quality Steering (well-formedness). Evaluation across 4 open-source LLMs, 11 languages, classification tasks. Early-layer steering increases diversity and downstream performance.

Prompt engineering Fine-tuning Benchmarks

SIG

72

HYP

00

Reddit r/MachineLearning·Jun 18

Open-Source Hong Kong Horse Racing ML Pipeline — Feedback Welcome [P]

Open-source ML pipeline for Hong Kong horse racing prediction (HKJC). Uses LightGBM/XGBoost with out-of-sample validation, betting simulations (Quinella, Tierce, Quartet), and Kelly Criterion. Key finding: no-odds model outperforms with-odds model on Quinella ROI, suggesting mispricing in certain combinations.

Open source Benchmarks Tools

SIG

65

HYP

00

Simon Willison·Jun 17

GLM-5.2 is probably the most powerful text-only open weights LLM

Z.ai released GLM-5.2 (753B parameters, 40 active via MoE) under MIT license on June 16th. Text-only model with 1M token context window. Ranks 1st on Artificial Analysis Intelligence Index v4.1 (score 51) ahead of DeepSeek V4 Pro and Kimi K2.6. 2nd on Code Arena WebDev behind Claude Fable 5.

Open source Benchmarks Code generation

SIG

82

HYP

00

Reddit r/LocalLLaMA·Jun 17

llama.cpp now supports model management (downloading etc) via API

llama.cpp merges PR #23976 adding model management via API. On-demand downloading, loading, and unloading from directory. UI coming soon. Full lifecycle deployment and management through API alone.

Llama Open source Infrastructure

SIG

72

HYP

00

Reddit r/LocalLLaMA·Jun 17

I released Inflect-Nano, an ultra-extreme tiny 4.63m parameter TTS model.

Inflect-Nano-v1, a 4.63M parameter TTS model, is the 2nd smallest publicly released speech synthesis model. Comprises acoustic model (3.46M) and vocoder (1.17M), generates 24 kHz English audio. ~17x smaller than Kokoro, ~108x smaller than Chatterbox. Runs locally via PyTorch, suited for embedded devices and offline voice assistants.

Voice Open source Tools

SIG

72

HYP

00

Reddit r/LocalLLaMA·Jun 17

Lin Junyang AI Lab Closes Round at $2B Valuation

Lin Junyang's AI lab closes funding round at $2B valuation. Lin Junyang, lead behind the Qwen line, launches new venture. Open source community expects significant contributions.

Qwen Open source Funding

SIG

35

HYP

00

Hacker News (AI)·Jun 17

I scored 200 blockchain NPM packages for deprecation and hijack risk

Security audit of 200 blockchain-related NPM packages: assessment of deprecation and hijack risks. Scoring methodology applied to critical dependency ecosystem.

AI safety Open source

SIG

45

HYP

00

Reddit r/LocalLLaMA·Jun 17

PSA: unsloth/GLM-5.2-GGUF is uploading

Unsloth created a HuggingFace repository for GLM-5.2 GGUF 30 minutes ago. Only the README is currently available; GGUF files are suspected to be uploading.

Open source Tools

SIG

35

HYP

00

Reddit r/LocalLLaMA·Jun 17

llama.cpp - how to free up even more space on your GPU

llama.cpp optimizes GPU memory management. Key parameters: --no-mmproj-offload frees 1GB for vision models, --cache-type-k/v reduces KV cache by 50-75%, --spec-draft-n-max=2 optimizes speculative decoding. Flash attention enabled by default. Tested on Qwen 3.6-27B with 150k context on RTX 3090.

Llama Open source Infrastructure

SIG

65

HYP

00

Reddit r/LocalLLaMA·Jun 17

We built an open source UI kit for document RAG/agents

Extend releases an open source UI kit (MIT) for document RAG and agents: 15 components for PDF, DOCX, XLSX viewers with bounding box citations, file upload, e-signature. Built internally, tested on millions of pages/day, actively maintained.

RAG AI Agents Open source

SIG

72

HYP

00

Reddit r/LocalLLaMA·Jun 17

My GLM-5.2-FP8 HGX-H200 SGLang docker deploy config

Docker deployment config for GLM-5.2-FP8 on HGX-H200 using SGLang. Achieves 70 tokens/s and 262k context by disabling DP and moe-a2a-backend deepep, with mem-fraction-static set to 0.83. Official vLLM recipes incompatible with H200.

Qwen Code generation Infrastructure

SIG

45

HYP

00

The Decoder·Jun 17

Zhipu AI's GLM-5.2 closes in on closed-source leaders in coding marathons

Zhipu AI releases GLM-5.2 under MIT license with stable 1-million-token context. On FrontierSWE benchmark for long-duration coding tasks, the open-source model trails Anthropic's Claude Opus 4.8 by just one percentage point. Significant gap remains on reasoning versus closed-source rivals.

Open source Code generation Benchmarks

SIG

75

HYP

00

Reddit r/LocalLLaMA·Jun 17

Multilingual-Multimodal-NLP/LoopCoder-V2 · Hugging Face

LoopCoder-V2 is a 7B code model based on Parallel Loop Transformer (PLT) that improves test-time performance through two passes of shared Transformer blocks. Trained on 18T tokens of mixed text/code data, it reaches 64.4 on SWE-bench Verified (vs 43.0 baseline), with two loops as the optimal gain-cost setting.

Code generation Reasoning Benchmarks

SIG

78

HYP

00

Reddit r/LocalLLaMA·Jun 17

Gemma 4 E2B running in-browser at 255 tok/s using WebGPU kernels written by Fable 5

Gemma 4 E2B runs in-browser at 255 tokens/sec using WebGPU kernels optimized by Fable 5. Demo and kernels released on Hugging Face.

Gemini Code generation Open source

SIG

75

HYP

00

Hacker News (AI)·Jun 17

Launch HN: Adam (YC W25) – Open-Source AI CAD

Adam is an open-source AI-powered CAD software launched by a YC W25 startup. The project aims to automate computer-aided design through AI models.

Open source Tools Code generation

SIG

35

HYP

00

Vercel AI Blog·Jun 17

Vercel Ship 2026 recap

Vercel unveils agent-first infrastructure at Ship 2026 in London. Three core components: Agent Stack (building blocks for agents), Vercel Connect (secure external tool access without persistent tokens), and eve (open-source framework for production agents with durable execution, sandboxed compute, approvals, and evals).

AI Agents Infrastructure Tools

SIG

75

HYP

00

Reddit r/LocalLLaMA·Jun 17

TRELLIS.2 now runs natively on MLX (Image to 3d object model)

Native MLX port of Microsoft's TRELLIS.2 for Apple Silicon. Image-to-3D object generation at 512×512 (~70s) and 1024×1024 (~300-700s) on M4 Max. GitHub repo released.

Open source Tools Infrastructure

SIG

72

HYP

00

Reddit r/MachineLearning·Jun 17

I deployed a GAN on a Raspberry Pi 4 and built a physical NFT minting device [P]

DCGAN 128×128 deployed on Raspberry Pi 4 with ESP32 display. Model trained 800 epochs on M3 (4h), 2480 images, exported to ONNX (53MB). Inference 3s per face. Generates hybrid faces with randomized titles. Presented as street art installation in NYC.

Image generation Open source Tools

SIG

72

HYP

00

Reddit r/LocalLLaMA·Jun 17

Making budget models punch above their weight with a smart Rust harness

A Rust developer optimizes small language models through efficient system architecture. A Rust harness improves inference performance without modifying model weights, enabling budget models to compete with larger versions.

Open source Infrastructure Tools

SIG

45

HYP

00

Reddit r/LocalLLaMA·Jun 17

GLM-5.2 is a win for local AI

GLM-5.2 (744B) under MIT license marks progress for local AI despite its massive footprint. The community can distill its reasoning capabilities into 8B/70B models, significantly improving local setups.

Open source Fine-tuning Reasoning

SIG

45

HYP

00

Reddit r/LocalLLaMA·Jun 17

I released a local LLM-powered RPG where generated NPCs, locations, items, and quests persist as in-game objects

Developer releases local LLM-powered RPG where generated NPCs, locations, items, and quests persist as in-game objects. LLM handles dialogue, narration, and quest progression; game system manages inventory, combat, and saves. Generated elements are stored and reusable.

Open source Tools AI Agents

SIG

65

HYP

00

Reddit r/LocalLLaMA·Jun 17

SIQ-1 Qwen3.6 for autoresearch and autonomous agency

SIQ-1 Qwen3.6: PPO fine-tuning of Qwen-35B-A3 outperforming GLM-5.2 and Qwen-350B on autoresearch (karpathy benchmark) and bullshit-bench. Model + GGUF available on HuggingFace with demo agent.

Qwen Reinforcement learning AI Agents

SIG

65

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> google-research /</span> timesfm

TimesFM is a pretrained foundation model developed by Google Research for time-series forecasting. The GitHub repository provides an open-source implementation of this specialized model.

DeepMind Open source Benchmarks

SIG

75

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> continuedev /</span> continue

Continue is an open-source coding agent featured on GitHub Trending. The project provides a software development assistance solution.

AI Agents Code generation Open source

SIG

35

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> yairm210 /</span> Unciv

Unciv is an open-source Android/Desktop remake of Civilization V. Community-driven project with no official affiliation to Firaxis Games.

Open source

SIG

45

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> bytedance /</span> UI-TARS-desktop

ByteDance releases UI-TARS-desktop, an open-source multimodal AI agent stack. The project connects cutting-edge AI models and agent infrastructure to automate UI-based tasks.

AI Agents Multi-agent Open source

SIG

65

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> calesthio /</span> OpenMontage

OpenMontage is an open-source, agentic video production system with 12 pipelines, 52 tools, and 500+ agent skills. Converts an AI coding assistant into a full video production studio.

AI Agents Multi-agent Video generation

SIG

65

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> DeusData /</span> codebase-memory-mcp

High-performance code intelligence MCP server. Indexes codebases into persistent knowledge graph in milliseconds. Supports 158 languages, sub-ms queries, 99% fewer tokens. Single static binary, zero dependencies.

MCP Code generation RAG

SIG

75

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> Lampese /</span> codex-switcher

Lampese/codex-switcher is a desktop application for managing multiple OpenAI Codex CLI accounts. Open-source tool enabling account switching.

OpenAI Code generation Tools

SIG

35

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> bytedance /</span> UI-TARS-desktop

ByteDance releases UI-TARS-desktop, an open-source multimodal AI agent stack connecting cutting-edge AI models and agent infrastructure. Platform for building agents capable of interacting with user interfaces.

AI Agents Multi-agent Open source

SIG

75

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> continuedev /</span> continue

Continue is an open-source coding agent featured on GitHub Trending. The project provides an automated development assistance solution.

AI Agents Code generation Open source

SIG

45

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> openobserve /</span> openobserve

OpenObserve is an open-source observability platform for logs, metrics, traces, frontend monitoring, pipelines and LLM observability. Alternative to Datadog/Splunk/Elasticsearch with 140x lower storage costs and single binary deployment.

Open source Infrastructure Tools

SIG

65

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> infiniflow /</span> ragflow

RAGFlow is an open-source RAG engine combining retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.

RAG AI Agents Open source

SIG

45

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> microsoft /</span> RD-Agent

Microsoft releases RD-Agent, an autonomous AI system to automate R&D processes in data science and ML. The agent drives experiments, data analysis, and model iterations without human intervention.

AI Agents Multi-agent Open source

SIG

75

HYP

00

GitHub Trending·Jun 17

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> calesthio /</span> OpenMontage

OpenMontage is an open-source, agentic video production system with 12 pipelines, 52 tools, and 500+ agent skills. Converts an AI coding assistant into a full video production studio.

AI Agents Multi-agent Video generation

SIG

65

HYP

00

#Open source

Quick thoughts on GLM-5.2 (Bonus: Censorship question answers)

JetFlow: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting

Montreal Forced Aligner and the state of speech-to-text alignment in 2026

Morpheus: A Morphology-Aware Neural Tokenizer and Word Embedder for Turkish

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation

Open-Source Hong Kong Horse Racing ML Pipeline — Feedback Welcome [P]

GLM-5.2 is probably the most powerful text-only open weights LLM

llama.cpp now supports model management (downloading etc) via API

I released Inflect-Nano, an ultra-extreme tiny 4.63m parameter TTS model.

Lin Junyang AI Lab Closes Round at $2B Valuation

I scored 200 blockchain NPM packages for deprecation and hijack risk

PSA: unsloth/GLM-5.2-GGUF is uploading

llama.cpp - how to free up even more space on your GPU

We built an open source UI kit for document RAG/agents

My GLM-5.2-FP8 HGX-H200 SGLang docker deploy config

Zhipu AI's GLM-5.2 closes in on closed-source leaders in coding marathons

Multilingual-Multimodal-NLP/LoopCoder-V2 · Hugging Face

Gemma 4 E2B running in-browser at 255 tok/s using WebGPU kernels written by Fable 5

Launch HN: Adam (YC W25) – Open-Source AI CAD

Vercel Ship 2026 recap

TRELLIS.2 now runs natively on MLX (Image to 3d object model)

I deployed a GAN on a Raspberry Pi 4 and built a physical NFT minting device [P]

Making budget models punch above their weight with a smart Rust harness

GLM-5.2 is a win for local AI

I released a local LLM-powered RPG where generated NPCs, locations, items, and quests persist as in-game objects

SIQ-1 Qwen3.6 for autoresearch and autonomous agency