Page 57 of 146

AllHigh signalRecent
5835 articles
Reddit r/LocalLLaMA·

Turning every "no thats not what i meant" in chat into actual LoRA training data

A developer built TideForge, a desktop app that converts chat corrections into LoRA training data. Each model reply has a "Teach" button; corrections accumulate as JSONL and trigger PEFT fine-tuning on your base model. Initial test: 110 hand-written corrections on Qwen 0.6B, loss dropped 4.25→0.73, adapter maintained identity across ~30 jailbreak prompts. Free, Windows, GGUF-compatible.

Fine-tuningOpen sourceTools
SIG
72
HYP
35
arXiv cs.LG·

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

PathoFM, an encoder-centric transformer pretrained on clinical time series (pathological gait analysis for spinal cord injury), combines three objectives: Local Completion, Temporal Continuity, and Unsupervised In-Context Dynamics. The study shows that dynamics-centric objectives produce the most balanced transferable representations across classification and regression tasks.

PapersReasoningFine-tuning
SIG
72
HYP
18
arXiv cs.CL·

Evidence Absence Is Not Evidence Insufficiency: Diagnosing NEI Construction Artifacts in Fact Verification

NEI-CAP, a diagnostic protocol to audit the construction of "Not Enough Information" labels in fact verification benchmarks. Researchers show NEI competence does not transfer reliably across constructions: models trained on shortcut-prone evidence conditions fail to recognize semantically related insufficient evidence. Tested on SciFact, FEVER, and HoVer.

BenchmarksEvalsPapers
SIG
72
HYP
15
arXiv cs.CL·

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

Theoretical paper on sequence models' insufficiency when facing unobserved latent states. Authors formalize a mixed-regime process where a perfect predictor becomes overconfident if observed context matches the wrong latent regime. They show the sufficiency gap can only be closed by perfect revelation of latent state or equivalent verification mechanism.

ReasoningAlignmentAI safety
SIG
72
HYP
15
arXiv cs.AI·

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

Theoretical and empirical work on training LLM-based dialogue agents. Identifies context distribution shift as fundamental limitation of Static Context RL and Interactive RL. Proposes Calibrated Interactive RL combining interactive RL with simulator alignment to reduce sim-to-real gap and improve multi-turn dialogue quality.

Reinforcement learningAI AgentsReasoning
SIG
72
HYP
18
arXiv cs.LG·

Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection

CoAD, a novel framework for time series anomaly detection, unifies classification (Outlier Exposure) and reconstruction (Masked Autoencoder) paradigms. The classification module generates probability-informed soft masks for the reconstruction module, addressing generalization and masking misalignment issues. Experiments on standard benchmarks demonstrate significant improvements with faster inference.

BenchmarksPapers
SIG
72
HYP
28