Defa Zhu
mathfinder
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
upvoted
a
paper
4 days ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
authored
a paper
2 months ago
Ultra-Sparse Memory Network
Organizations
None yet
mathfinder's activity
No public activity