Page 3 of 136

AllHigh signalRecent
5412 articles
arXiv cs.CL·

Hubness, Not Anisotropy, Drives Cross-Lingual Retrieval Asymmetry in Multilingual Embedding Models

Study on cross-lingual retrieval asymmetry in 5 multilingual models (Gemini, Mistral, OpenAI, Qwen). Analysis of 6,518 idiomatic expressions in English, Bengali, Hindi, Arabic. Finding: hubness (vector concentration) is the dominant causal driver (49.5% dominance share), far exceeding anisotropy. CSLS correction closes 63.5% of reciprocity gap.

EmbeddingsBenchmarksMulti-agent
SIG
82
HYP
15
arXiv cs.LG·

LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs

LLM-AutoSciLab proposes a closed-loop scientific discovery framework coupling hypothesis generation, hypothesis-conditioned experiment selection, and mechanism refinement. Evaluated on ActiveSciBench (57 enzyme-kinetics tasks, 45 gene-regulatory-network tasks), the system achieves 67.6% symbolic accuracy and 2-5x better sample efficiency than competing baselines.

ReasoningAI AgentsBenchmarks
SIG
82
HYP
25
arXiv cs.LG·

FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

FuRA introduces full-rank parameter-efficient fine-tuning via spectral preconditioning through SVD decomposition. By freezing pretrained singular bases and optimizing only compact cores via block tensor-train factorization, FuRA outperforms full fine-tuning and LoRA on LLaMA-3-8B (+1.37 commonsense reasoning) and VLMs while maintaining LoRA-comparable efficiency.

Fine-tuningLlamaReinforcement learning
SIG
82
HYP
18
Reddit r/LocalLLaMA·

BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.

BeeLlama v0.2.0 delivers major performance gains with DFlash optimization. On RTX 3090: Qwen 3.6 27B reaches 164 tps (4.40x speedup), Gemma 4 31B 177.8 tps (4.93x). Full Gemma 4 31B support, reduced DFlash overhead, improved prefill handling, stricter draft/target validation.

QwenOpen sourceCode generation
SIG
82
HYP
25
Reddit r/MachineLearning·

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Numind releases NuExtract3, a 4B open-weight VLM based on Qwen3.5-4B under Apache-2.0 license. The model extracts structured data from complex documents (PDFs, forms, tables, invoices) to Markdown or JSON. Trained for 3 days on 8xH100, it supports multiple quantizations (GPTQ, W8A8, FP8, Q4, Q6) and runs on 4GB VRAM minimum.

VisionOpen sourceCode generation
SIG
82
HYP
25
arXiv cs.LG·

Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction

AI text detectors amplify a pretrained typicality axis rather than construct an AI-vs-human boundary. On RoBERTa-base, raw projection onto centroid(AI)-centroid(HC3) achieves AUROC 0.806-0.944, matching or exceeding fine-tuning. A closed-form Jacobian predictor transfers to 16/16 third-party detectors with oracle-equivalence, reducing FPR by 57% on the OpenAI detector.

EvalsBenchmarksAI safety
SIG
82
HYP
15
arXiv cs.LG·

Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding

Chronicle is a 324M-parameter multimodal foundation model trained from scratch on natural language and time series in a unified architecture. Both modalities share the same transformer blocks and attention mechanisms. It matches Gemma-3-270M on 19 NLU tasks, sets new benchmarks on 24 UCR/UEA datasets, and outperforms supervised fusion baselines on Time-MMD.

BenchmarksPapersReasoning
SIG
82
HYP
25
arXiv cs.CL·

Beyond Semantic Similarity: A Two-Phase Non-Parametric Retrieval Workflow for Corporate Credit Underwriting

Two-phase RAG system for corporate credit analysis: phase 1 combines lexical and dense multilingual retrieval; phase 2 applies adaptive controller and LLM-as-Judge scoring based on analytical utility rather than semantic similarity. On-premise deployment on proprietary multilingual corpus. Production: document review time reduced from hours to 3 minutes across 800+ analysts.

RAGVector searchEmbeddings
SIG
82
HYP
15