tdh111
tdh111
AI & ML interests
None yet
Recent Activity
new activity
5 days ago
Downtown-Case/Star-Command-R-Lite-32B-v1:good
new activity
13 days ago
mradermacher/DeepSeek-V3-i1-GGUF:imatrix.dat missing output.weight and token_embd.weight
Organizations
None yet
tdh111's activity
imatrix.dat missing output.weight and token_embd.weight
3
#1 opened 13 days ago
by
tdh111
Quantum Entanglement and the Sentient Toaster: Revolutionizing LLM Training
265
#3 opened about 2 months ago
by
mradermacher
https://huggingface.co/deepseek-ai/DeepSeek-V3-Base
21
#515 opened about 1 month ago
by
nicoboss
Can Llama-3.1- Nemotron-40B-Instruct be released as well?
#24 opened 24 days ago
by
tdh111
What is the context size this model was trained on?
2
#23 opened 28 days ago
by
treehugg3
2 abc or not 2 abc
386
#2 opened 5 months ago
by
mradermacher
Nemotron 51B
15
#436 opened 2 months ago
by
AIGUYCONTENT
Disable all languages except English
1
#3 opened 4 months ago
by
codewalker7
https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 3 months ago
by
DazzlingXeno
https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 3 months ago
by
DazzlingXeno