Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library, NER & PoS Tagging, LM Pretraining (mostly encoder-only), Historical Language Models
Recent Activity
reacted
to
nroggendorff's
post
with 😔
1 day ago
im so tired
reacted
to
nroggendorff's
post
with ➕
1 day ago
hey nvidia, can you send me a gpu?
comment or react if you want ~~me~~ to get one too. 👉👈
updated
a model
5 days ago
stefan-it/bert5urk
Articles
Organizations
stefan-it's activity
upvoted
an
article
11 days ago
Article
FineWeb2-C: Help Build Better Language Models in Your Language
By
•
•
10jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Paper
•
2412.08802
•
Published
•
4
Evaluating Pixel Language Models on Non-Standardized Languages
Paper
•
2412.09084
•
Published
•
1
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Paper
•
2412.09341
•
Published
•
1
OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages
Paper
•
2412.09587
•
Published
•
3
The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Paper
•
2412.09460
•
Published
•
5
upvoted
an
article
29 days ago
Article
They Said It Couldn’t Be Done
By
•
•
76upvoted
a
paper
about 2 months ago
upvoted
a
collection
about 2 months ago
Representation Deficiency in Masked Language Modeling
Paper
•
2302.02060
•
Published
•
1
GPT or BERT: why not both?
Paper
•
2410.24159
•
Published
•
14
Zipfian Whitening
Paper
•
2411.00680
•
Published
•
9
WikiNER-fr-gold: A Gold-Standard NER Corpus
Paper
•
2411.00030
•
Published
•
4
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Paper
•
2410.20771
•
Published
•
3
upvoted
a
paper
3 months ago