Page 50 of 144

AllHigh signalRecent

5740 articles

Mellum 2 12B A2.5B

JetBrains releases Mellum 2, a coding-focused 12B/2.5B MoE. Reasoning performance matches Qwen 3.5 9B, underperforms Qwen 3.5 4B on general tasks. Technical report published.

Code generation Open source Benchmarks

SIG

HYP

Reddit r/LocalLLaMA·4d ago

unsloth vs bartowski MTP ggufs

Comparative benchmark of MTP (Multi-Token Prediction) quantizations between unsloth and bartowski on Qwen 3.5-4B, 3.5-9B, and 3.6-27B. Bartowski uses Q8_0 for MTP head (larger files). Tests for Snapdragon with Q4_0, IQ4_NL, Q4_1, MXFP4_MOE, Q8_0 limited to 24GB VRAM RTX 3090. Unsloth generally faster in decoding throughput and VRAM efficient.

Qwen Benchmarks Code generation

SIG

HYP

Reddit r/LocalLLaMA·4d ago

I was a Data Scientist for 10 years before becoming a quadriplegic. For the past 3 months, I built VibeETL from scratch: A lightning-fast, visual Alteryx alternative powered by Polars & React Flow.

VibeETL: open-source visual ETL platform built in 3 months by former data scientist. Polars + Rust backend, React Flow frontend with native BFS layout algorithm. Zero external dependencies, sandboxed Python execution (30s timeout). Lightweight Alteryx alternative.

Open source Tools Infrastructure

SIG

HYP

arXiv cs.LG·4d ago

Universal Multiclass Transductive Online Learning

Theoretical paper on universal multiclass transductive online learning with unbounded label space. Characterizes learnability: only two possible optimal rates (bounded or logarithmic). Introduces LCLL tree combinatorial structure and extends results to agnostic and stochastic settings.

Papers Evals

SIG

HYP

arXiv cs.LG·4d ago

Calibrated Preference Learning: The Case of Label Ranking

Formal study of calibration for probabilistic label ranking. Authors define a hierarchy of notions (full rankings, sub-rankings, top-k) and show popular models are poorly calibrated. Application to RLHF reward models reveals calibration and accuracy are not perfectly correlated.

Reinforcement learning Evals Benchmarks

SIG

HYP

arXiv cs.LG·4d ago

Functional MRI Time Series Generation via Wavelet-Based Image Transform and Spectral Flow Matching for Brain Disorder Identification

DSFM (Dual-Spectral Flow Matching) generates synthetic fMRI time series by combining discrete wavelet transform (DWT) and discrete cosine transform (DCT) with spectral flow matching. The model captures non-stationarity and spatiotemporal dynamics of BOLD signals to improve brain network classification.

Papers Benchmarks Vision

SIG

HYP

arXiv cs.LG·4d ago

Unicorn: Scaling High-Dimensional Time Series Forecasting via Universal Correlation Modeling

Unicorn, a multi-dataset pretraining framework, bridges the trade-off between channel-independent models (scalable but ignoring dependencies) and channel-dependent models (expressive but dimension-bounded). Using a latent prototype codebook, it projects heterogeneous channels into a shared space to learn identity-agnostic, reusable correlation patterns transferable across domains.

Papers Benchmarks Fine-tuning

SIG

HYP

arXiv cs.LG·4d ago

A Novel Evaluation Metric for Unsupervised Learning in AIS-Based Maritime Anomaly Detection: MADQI

Novel MADQI evaluation metric for unsupervised anomaly detection in maritime AIS datasets. Combines four indices (ARC, PPS, SDS, ECE) through automatic normalization. Achieves MADQI score of 80.37% on AIS data, with ECE=0.907 and ARC=1.000 for detecting abnormal vessel behavior.

Evals Benchmarks

SIG

HYP

Page 50 of 144

Mellum 2 12B A2.5B

unsloth vs bartowski MTP ggufs

I was a Data Scientist for 10 years before becoming a quadriplegic. For the past 3 months, I built VibeETL from scratch: A lightning-fast, visual Alteryx alternative powered by Polars & React Flow.

Universal Multiclass Transductive Online Learning

Calibrated Preference Learning: The Case of Label Ranking

Functional MRI Time Series Generation via Wavelet-Based Image Transform and Spectral Flow Matching for Brain Disorder Identification

Unicorn: Scaling High-Dimensional Time Series Forecasting via Universal Correlation Modeling

A Novel Evaluation Metric for Unsupervised Learning in AIS-Based Maritime Anomaly Detection: MADQI

Formalizing and falsifying causal pathways of rare events

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

XOResNet: Exclusive-OR Meta-Residuals Facilitate Deep Spiking Neural Networks Learning

Uncertainty-Aware and Temporally Regulated Expert Advice in Reinforcement Learning for Autonomous Driving

Structure-Induced Information for Rerooting Levin Tree Search

Healthcare Mechanisms from Policy-as-Code Search under Strategic Provider Response

Learning Agent-Compatible Context Management for Long-Horizon Tasks

Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics

Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration

Gradient-Free Training of Spiking Neural Networks via Low-Rank Evolution Strategies

Semantic Motion Anchors: Bridging Motion and Meaning in Co-Speech Gestures

Efficient Diffusion LLMs via Temporal-Spatial Parallel Decoding and Confidence Extrapolation

Bounded Behavioral Indistinguishability for Black-Box LLM Distillation

A Unified Framework for Gradient Aggregation in Multi-Objective Optimization

When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL

Early Prediction of Future Behavioral Strategy from Process Traces

Benchmarking Machine Learning Uncertainty Quantification Methodologies for Predicting Turbine Gas Temperature Degradation

Diagnosing Failure Modes of Shared-State Collaboration in Resource-Constrained Visual Agents

XLGoBench: Detecting cross-lingual skill gaps with algorithmic tasks

Scientific Machine Learning for Engine Health Management and Remaining Useful Life Prediction

Pairwise Reference Alignment as a Model-Level Ordinal Observable

Human-Alignment, Calibration, and Activation Patterns in Large Language Model Uncertainty

CobSeg: Coherence Boundary Modeling for Dialogue Topic Segmentation

Vector Linking via Cross-Model Local Isometric Consistency

HADT: A Heterogeneous Multi-Agent Differential Transformer for Autonomous Earth Observation Satellite Cluster

A Persona-Based Evaluation Framework for Pluralistic Alignment in Generative AI

COMPASS: Cognitive MCTS-Guided Process Alignment for Safe Search Agents

Counterfactual Graph for Multi-Agent LLM Calibration

AI for Monitoring and Classifying Data Used in Research Literature

Speculative Decoding Across Languages

Knowledge Graph-Enhanced Zero-Shot Topic Classification: A Multi-Strategy Comparative Study

Procedural Generation of First Person Shooter Maps using Map-Elites