Agent Skills: mamba-architecture
State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware design. Mamba-1 (d_state=16) and Mamba-2 (d_state=128, multi-head). Models 130M-2.8B on HuggingFace.
UncategorizedID: davila7/claude-code-templates/mamba-architecture
19,6461,834
Install this agent skill to your local
Skill Files
Browse the full folder contents for mamba-architecture.
Loading file tree…
Select a file to preview its contents.