ahl kal's picture
7

ahl kal

ahlkal
Β·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago
google/gemma-7b-it
liked a Space 7 days ago
hf-accelerate/model-memory-usage
View all activity

Organizations

None yet

ahlkal's activity

reacted to DmitryRyumin's post with πŸ‘ about 2 months ago
view post
Post
2182
πŸ”₯πŸš€πŸŒŸ New Research Alert - xLSTM! πŸŒŸπŸš€πŸ”₯
πŸ“„ Title: xLSTM: Extended Long Short-Term Memory πŸ”

πŸ“ Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

πŸ‘₯ Authors: Maximilian Beck et al.

πŸ“„ Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

πŸ“ Repository: https://github.com/NX-AI/xlstm

πŸ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

πŸ” Keywords: #xLSTM #DeepLearning #Innovation #AI
  • 1 reply
Β·