Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
stefan-it
/
xlstm-german-wikipedia
like
7
Text Generation
Transformers
Safetensors
German
xlstm
custom_code
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
Train
Use this model
dbe6e99
xlstm-german-wikipedia
1 contributor
History:
27 commits
stefan-it
xlstm-config: temporarily introduce new hidden_size parameter
dbe6e99
verified
5 months ago
.gitattributes
Safe
1.52 kB
initial commit
7 months ago
README.md
Safe
3.75 kB
readme: include some new logo :-)
5 months ago
brat-logo.png
Safe
57.8 kB
figure: add some new logo :p
5 months ago
config.json
Safe
639 Bytes
config: fix it
5 months ago
configuration_xlstm.py
Safe
3.08 kB
xlstm-config: temporarily introduce new hidden_size parameter
5 months ago
generation_config.json
Safe
69 Bytes
model: add generation confgi
5 months ago
model.safetensors
Safe
445 MB
LFS
model: add newly trained xLSTM model (with grad clipping)
5 months ago
modeling_xlstm.py
Safe
6.58 kB
xlstm: add configuration and modeling (own one)
5 months ago
special_tokens_map.json
Safe
551 Bytes
tokenizer: add config and vocab
5 months ago
tokenizer.json
Safe
1.84 MB
tokenizer: add config and vocab
5 months ago
tokenizer_config.json
Safe
957 Bytes
tokenizer: add config and vocab
5 months ago
training-loss.png
Safe
201 kB
figure: add updated loss curve for training
5 months ago