stereoplegic
's Collections
StableSSM: Alleviating the Curse of Memory in State-space Models through
Stable Reparameterization
Paper
•
2311.14495
•
Published
•
1
Vision Mamba: Efficient Visual Representation Learning with
Bidirectional State Space Model
Paper
•
2401.09417
•
Published
•
59
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image
Segmentation
Paper
•
2401.13560
•
Published
•
1
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective
State Spaces
Paper
•
2402.00789
•
Published
•
2
Convolutional State Space Models for Long-Range Spatiotemporal Modeling
Paper
•
2310.19694
•
Published
•
2
Vivim: a Video Vision Mamba for Medical Video Object Segmentation
Paper
•
2401.14168
•
Published
•
2
2-D SSM: A General Spatial Layer for Visual Transformers
Paper
•
2306.06635
•
Published
•
1
BlackMamba: Mixture of Experts for State-Space Models
Paper
•
2402.01771
•
Published
•
23
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning
Tasks
Paper
•
2402.04248
•
Published
•
30
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Paper
•
2302.06646
•
Published
•
2
A Quantitative Review on Language Model Efficiency Research
Paper
•
2306.01768
•
Published
•
1
A Unified View of Long-Sequence Models towards Modeling Million-Scale
Dependencies
Paper
•
2302.06218
•
Published
•
1
Accelerating Toeplitz Neural Network with Constant-time Inference
Complexity
Paper
•
2311.08756
•
Published
•
1
Graph Mamba: Towards Learning on Graphs with State Space Models
Paper
•
2402.08678
•
Published
•
13
DenseMamba: State Space Models with Dense Hidden Connection for
Efficient Large Language Models
Paper
•
2403.00818
•
Published
•
15
Improving Token-Based World Models with Parallel Observation Prediction
Paper
•
2402.05643
•
Published
•
1
Hierarchical State Space Models for Continuous Sequence-to-Sequence
Modeling
Paper
•
2402.10211
•
Published
•
11
LOCOST: State-Space Models for Long Document Abstractive Summarization
Paper
•
2401.17919
•
Published
Diffusion Models Without Attention
Paper
•
2311.18257
•
Published
•
2
ZigMa: Zigzag Mamba Diffusion Model
Paper
•
2403.13802
•
Published
•
17
MambaIR: A Simple Baseline for Image Restoration with State-Space Model
Paper
•
2402.15648
•
Published
SSM Meets Video Diffusion Models: Efficient Video Generation with
Structured State Spaces
Paper
•
2403.07711
•
Published
Scalable Diffusion Models with State Space Backbone
Paper
•
2402.05608
•
Published
LocalMamba: Visual State Space Model with Windowed Selective Scan
Paper
•
2403.09338
•
Published
•
7
VMamba: Visual State Space Model
Paper
•
2401.10166
•
Published
•
38
VideoMamba: State Space Model for Efficient Video Understanding
Paper
•
2403.06977
•
Published
•
27
MambaMixer: Efficient Selective State Space Models with Dual Token and
Channel Selection
Paper
•
2403.19888
•
Published
•
10
MambaByte: Token-free Selective State Space Model
Paper
•
2401.13660
•
Published
•
52
Zamba: A Compact 7B SSM Hybrid Model
Paper
•
2405.16712
•
Published
•
22
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context
Language Modeling
Paper
•
2406.07522
•
Published
•
37
Longhorn: State Space Models are Amortized Online Learners
Paper
•
2407.14207
•
Published
•
17
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Paper
•
2405.14174
•
Published
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
Paper
•
2408.15496
•
Published
•
10
Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense
Retrieval
Paper
•
2408.08066
•
Published
GrootVL: Tree Topology is All You Need in State Space Model
Paper
•
2406.02395
•
Published