AI & ML interests

None defined yet.

Recent Activity

RWKV's activity

BlinkDL 
posted an update 19 days ago
view post
Post
1908
RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba
SmerkyG 
in RWKV/v6-Finch-7B-HF about 2 months ago

Update README.md

#1 opened 4 months ago by
SmerkyG
BlinkDL 
posted an update about 2 months ago
view post
Post
4435
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
BlinkDL 
posted an update 3 months ago
xianbao 
posted an update 4 months ago
view post
Post
1740
With the open-weight release of CogVideoX-5B from THUDM, i.e. GLM team, the Video Generation Model (how about calling it VGM) field has officially became the next booming "LLM"

What does the landscape look like? What are other video generation models? This collection below is all your need.

xianbao/video-generation-models-66c350163c74f60f5c412af6

The above video is generated by @a-r-r-o-w with CogVideoX-5B, taken from a nice lookout for the field!
ybelkada 
posted an update 5 months ago
ybelkada 
posted an update 5 months ago