Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
95
ShelterW
ShelterW
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1
new
activity
5 days ago
Qwen/Qwen2.5-Math-PRM-7B:
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
new
activity
6 days ago
Qwen/Qwen2.5-Math-PRM-7B:
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
View all activity
Organizations
None yet
ShelterW
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
about 2 hours ago
•
5.57k
•
1.43k
New activity in
Qwen/Qwen2.5-Math-PRM-7B
5 days ago
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
#6 opened 5 days ago by
ShelterW
New activity in
Qwen/Qwen2.5-Math-PRM-7B
6 days ago
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
5
#4 opened 7 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
9 days ago
What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?
#5 opened 9 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
14 days ago
reward is None
1
#3 opened 14 days ago by
ShelterW
liked
a Space
20 days ago
Running
on
CPU Upgrade
598
🏆
Open ASR Leaderboard
liked
a model
20 days ago
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
about 2 hours ago
•
29.4k
•
2.18k
liked
a model
about 1 month ago
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
Text Generation
•
Updated
15 days ago
•
153k
•
29
updated
a model
about 1 month ago
ShelterW/Qwen2.5-Math-72B-Instruct-AWQ
Updated
Dec 10, 2024
liked
a model
about 1 month ago
Qwen/QwQ-32B-Preview
Text Generation
•
Updated
10 days ago
•
166k
•
•
1.58k
updated
2 datasets
about 2 months ago
ShelterW/chinese_common_ner
Viewer
•
Updated
Dec 6, 2024
•
110k
•
54
ShelterW/chinese_medical_ner
Viewer
•
Updated
Dec 6, 2024
•
251k
•
76
liked
a Space
about 2 months ago
Running
870
🔍
QwQ-32B-Preview
QwQ-32B-Preview
liked
a model
2 months ago
2Noise/ChatTTS
Text-to-Audio
•
Updated
Oct 22, 2024
•
34.1k
•
1.44k
liked
a dataset
5 months ago
BAAI/Infinity-Instruct
Viewer
•
Updated
6 days ago
•
20.4M
•
5.38k
•
584
liked
a dataset
6 months ago
lmsys/lmsys-chat-1m
Viewer
•
Updated
Jul 27, 2024
•
1M
•
2.04k
•
626
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
6 months ago
hidden state is nan
1
#2 opened 6 months ago by
ShelterW
liked
3 models
6 months ago
mistralai/Mistral-Nemo-Instruct-2407
Text Generation
•
Updated
Nov 6, 2024
•
2.08M
•
•
1.4k
unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit
Text Generation
•
Updated
Sep 11, 2024
•
13.1k
•
25
unsloth/gemma-2-27b-it-bnb-4bit
Text Generation
•
Updated
Sep 3, 2024
•
5.02k
•
11
Load more