Page 57 of 146

AllHigh signalRecent

5835 articles

Turning every "no thats not what i meant" in chat into actual LoRA training data

A developer built TideForge, a desktop app that converts chat corrections into LoRA training data. Each model reply has a "Teach" button; corrections accumulate as JSONL and trigger PEFT fine-tuning on your base model. Initial test: 110 hand-written corrections on Qwen 0.6B, loss dropped 4.25→0.73, adapter maintained identity across ~30 jailbreak prompts. Free, Windows, GGUF-compatible.

Fine-tuning Open source Tools

SIG

HYP

Reddit r/LocalLLaMA·May 27

Does Engram Do Memory Retrieval in Autoregressive Image Generation?

An Engram module (O(1) hash-keyed associative memory) injected into Transformers for autoregressive image generation on ImageNet 256×256 fails to improve quality (FID) despite FLOP gains. Gate-clamp, donor-probe, and frozen-table experiments show the module acts as a gated architectural side-pathway, not a content-addressed retrieval mechanism.

Papers Image generation Benchmarks

SIG

HYP

arXiv cs.CL·May 27

LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation

LATTE is a personalization framework for frozen LLMs that forecasts user preference trajectories by subtracting comparable peer profiles. A lightweight sequence predictor forecasts the next state, injected via a single anchored soft token. On Amazon Reviews 2023, LATTE achieves ROUGE-L=0.259 vs 0.219 for static profiles.

Prompt engineering Fine-tuning Benchmarks

SIG

HYP

Page 57 of 146

Turning every "no thats not what i meant" in chat into actual LoRA training data

Does Engram Do Memory Retrieval in Autoregressive Image Generation?

LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation

On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach

Max-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Training

HRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signals

Co-folding model guided by structural proteomics

On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series

Two-Parameter Flows for Learning Population Dynamics of Physical Systems

Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks

MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding

Semigroup Consistency as a Diagnostic for Learned Physics Simulators

When Correct Demonstrations Hurt: Rethinking the Role of Exemplars in In-Context Learning

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

Cultural Value Alignment Via Latent Activation Steering in Large Language Models

Annotator Positionality as Signal: Psychometric Weighting for Anti-Autistic Ableism Detection

Towards Just-in-Time Adaptive Feedback: Enhancing Student Learning via Knowledge-Grounded LLM

Curation and Extraction of Drug-Related Entities from Reddit Platform

Elias in the Lighthouse, Again? Diagnosing Low Diversity in LLM Stories

Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning

AI evaluation may bias perceptions: The importance of context in interpreting academic writing

Evidence Absence Is Not Evidence Insufficiency: Diagnosing NEI Construction Artifacts in Fact Verification

The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions

Experiments in Agentic AI for Science

Managing Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planning

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

Tail-Aware HiFloat4: W4A4 Post-Training Quantization for Wan2.2

Completion vs Optimality: Policy Gradient in Long-Horizon Cumulative-Damage Problems

Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation

Conv-to-Bench: Evaluating Language Models Via User-Assistant Dialogues In Code Tasks

AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents

SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection

Modeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Series

Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection

Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks

Balancing Plasticity and Stability with Fast and Slow Successor Features