Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library, NER & PoS Tagging, LM Pretraining (mostly encoder-only), Historical Language Models
Recent Activity
updated
a model
2 days ago
stefan-it/bert5urk
liked
a dataset
3 days ago
batubayk/TR-News
upvoted
an
article
9 days ago
FineWeb2-C: Help Build Better Language Models in Your Language
Articles
Organizations
Posts
1
Post
1163
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
👉 Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
👉 Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ❤️ and 🥨.
👉 Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
👉 Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ❤️ and 🥨.
Collections
14
My pretrained LMs on FineWeb datasets - part of my TensorFlow Model Garden LMs project
A Collection of Historical Multilingual Language Models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask • Updated • 62 • 6 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask • Updated • 44 • 1 -
hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Text2Text Generation • Updated • 14 -
hmteams/teams-base-historic-multilingual-discriminator
Updated • 7
models
1334
stefan-it/bert5urk
Updated
•
4
stefan-it/span-marker-gelectra-large-germeval14
Token Classification
•
Updated
•
1.3k
•
2
stefan-it/zeitungs-lm-v1
Updated
•
48
•
3
stefan-it/wav2vec2-large-xlsr-53-basque
Automatic Speech Recognition
•
Updated
•
35
stefan-it/german-gpt2-larger
Text Generation
•
Updated
•
477
•
8
stefan-it/xlstm-german-wikipedia
Text Generation
•
Updated
•
188
•
7
stefan-it/flair-barner-wiki-coarse-gbert-large
Token Classification
•
Updated
•
7
•
1
stefan-it/flair-clean-conll-5
Token Classification
•
Updated
•
9
stefan-it/flair-clean-conll-4
Token Classification
•
Updated
stefan-it/flair-clean-conll-3
Token Classification
•
Updated
•
2
datasets
12
stefan-it/senti-anno
Viewer
•
Updated
•
929
•
89
stefan-it/offenseval2020_tr
Viewer
•
Updated
•
35.3k
•
135
stefan-it/dewiki-20230701-nltk-corpus
Viewer
•
Updated
•
39.4M
•
48
•
2
stefan-it/germeval14_no_wikipedia
Preview
•
Updated
•
55
stefan-it/histnero
Viewer
•
Updated
•
217k
•
40
stefan-it/HisGermaNER
Preview
•
Updated
•
224
•
2
stefan-it/co-funer
Preview
•
Updated
•
55
stefan-it/german-dbmdz-bert-corpus
Viewer
•
Updated
•
52.8M
•
54
•
2
stefan-it/span-marker-base-model-detection
Viewer
•
Updated
•
28
•
44
stefan-it/flair-base-model-detection
Viewer
•
Updated
•
52
•
32
•
1