CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper • 2406.05967 • Published Jun 10, 2024 • 5
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding Paper • 2406.09297 • Published Jun 13, 2024 • 4 • 2
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding Paper • 2406.09297 • Published Jun 13, 2024 • 4
zaydzuhri/the_pile_tokenized_5percent_truncated_packed_v2 Viewer • Updated Mar 5, 2024 • 2.46M • 35
zaydzuhri/the_pile_tokenized_5percent_truncated_packed Viewer • Updated Feb 9, 2024 • 2.11M • 39