Page 64 of 147

AllHigh signalRecent
5873 articles
arXiv cs.LG·

Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity

Theoretical work on piecewise-stationary low-rank linear contextual bandits with drifting subspaces. Introduces SPSC algorithm combining isotropic probes with windowed projected ridge-UCB, achieving dynamic regret Õ(r√T) instead of Õ(d√T). Characterizes identification boundary for moving subspace recovery and validates on 11 benchmarks (synthetic, MovieLens, clinical, ZOZOTOWN production logs).

Reinforcement learningPapersBenchmarks
SIG
72
HYP
15
arXiv cs.LG·

FBOS-RL: Feedback-Driven Bi-Objective Synergistic Reinforcement Learning

FBOS-RL introduces a feedback-driven bi-objective reinforcement learning framework to improve large-scale model training. The framework combines two mutually reinforcing objectives: Exploitation-oriented Policy Alignment (EPA) and Exploration-oriented Capability Cultivation (ECC). Experiments show FBOS-RL converges faster than GRPO with higher performance ceilings.

Reinforcement learningReasoningPapers
SIG
72
HYP
25
arXiv cs.LG·

GraphDiffMed: Knowledge-Constrained Differential Attention with Pharmacological Graph Priors for Medication Recommendation

GraphDiffMed presents a medication recommendation framework using dual-scale Differential Attention v2 with pharmacological constraints. Tested on MIMIC-III, the model filters noise at intra-visit and inter-visit levels while incorporating drug-drug interactions, outperforming baselines on recommendation quality and safety metrics.

BenchmarksPapersAI safety
SIG
72
HYP
18
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> lance-format /</span> lance

Lance is an open lakehouse format for multimodal AI. Converts from Parquet in 2 lines of code with 100x faster random access, vector indexing, and data versioning. Compatible with Pandas, DuckDB, Polars, PyArrow, PyTorch.

Vector searchEmbeddingsOpen source
SIG
72
HYP
35