Back to feed
Reddit r/MachineLearning·

Repo for implementations of various Transformer Attn mechanisms [P]

Signal
65
Hype
15
In three linesGitHub repository with implementations of various Transformer attention mechanisms. Originally developed for Small Language Model experiments and benchmarking, applicable to Computer Vision, Vision Encoders, RL, and other domains. Open to community contributions.
Read source
Your take?
Open sourceToolsVisionReinforcement learning

Summary generated by Claude — human-verified