Cao
JiaxinTsao
AI & ML interests
None yet
Organizations
JiaxinTsao's activity
Anyone else encountering bad quantized(?) performance with Llama3-70B?
1
#37 opened 9 months ago
by
philjd
SiLU or GLU activation?
1
#21 opened 10 months ago
by
JiaxinTsao
No `lm_head.weight` in checkpoint ?
#31 opened 11 months ago
by
JiaxinTsao
can not run sft full finetuning.
9
#74 opened about 1 year ago
by
hegang126
如何让模型输出的结果,严格按照定义的json结构进行输出?
1
#12 opened about 1 year ago
by
tang0430