ShelterW's picture

7 95

ShelterW

ShelterW

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

new activity 5 days ago

Qwen/Qwen2.5-Math-PRM-7B:If the response length exceeds 4096, is a sliding window used, or is it simply truncated?

new activity 6 days ago

Qwen/Qwen2.5-Math-PRM-7B:"<extra_0>" is not special token ? I got 5 token_ids ，is it right？

View all activity

Organizations

None yet

ShelterW's activity

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 2 hours ago • 5.57k • 1.43k

New activity in Qwen/Qwen2.5-Math-PRM-7B 5 days ago

If the response length exceeds 4096, is a sliding window used, or is it simply truncated?

#6 opened 5 days ago by

New activity in Qwen/Qwen2.5-Math-PRM-7B 6 days ago

"<extra_0>" is not special token ? I got 5 token_ids ，is it right？

#4 opened 7 days ago by

New activity in OpenLeecher/lmsys_chat_1m_clean 9 days ago

What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?

#5 opened 9 days ago by

New activity in OpenLeecher/lmsys_chat_1m_clean 14 days ago

reward is None

#3 opened 14 days ago by

liked a Space 20 days ago

Running on CPU Upgrade

Open ASR Leaderboard

liked a model 20 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated about 2 hours ago • 29.4k • 2.18k

liked a model about 1 month ago

unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • Updated 15 days ago • 153k • 29

updated a model about 1 month ago

ShelterW/Qwen2.5-Math-72B-Instruct-AWQ

Updated Dec 10, 2024

liked a model about 1 month ago

Qwen/QwQ-32B-Preview

Text Generation • Updated 10 days ago • 166k • • 1.58k

updated 2 datasets about 2 months ago

ShelterW/chinese_common_ner

Viewer • Updated Dec 6, 2024 • 110k • 54

ShelterW/chinese_medical_ner

Viewer • Updated Dec 6, 2024 • 251k • 76

liked a Space about 2 months ago

QwQ-32B-Preview

QwQ-32B-Preview

liked a model 2 months ago

2Noise/ChatTTS

Text-to-Audio • Updated Oct 22, 2024 • 34.1k • 1.44k

liked a dataset 5 months ago

BAAI/Infinity-Instruct

Viewer • Updated 6 days ago • 20.4M • 5.38k • 584

liked a dataset 6 months ago

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 2.04k • 626

New activity in unsloth/gemma-2-27b-it-bnb-4bit 6 months ago

hidden state is nan

#2 opened 6 months ago by

liked 3 models 6 months ago

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 2.08M • • 1.4k

unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit

Text Generation • Updated Sep 11, 2024 • 13.1k • 25

unsloth/gemma-2-27b-it-bnb-4bit

Text Generation • Updated Sep 3, 2024 • 5.02k • 11