PrunaAI
/

nvidia-Llama3-ChatQA-1.5-8B-QUANTO-int4bit-smashed

Inference Endpoints

Model card Files Files and versions Community

nvidia-Llama3-ChatQA-1.5-8B-QUANTO-int4bit-smashed

1 contributor

History: 3 commits

sharpenb's picture

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909

f93b8ba verified 7 months ago

.gitattributes

1.52 kB

initial commit 7 months ago
README.md

5.34 kB

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909 7 months ago
model.pt

16.2 GB
LFS

a5da407d86d3dff93b293393803827fd573be80a5f2556e3bf6cdf598083f395 7 months ago
smash_config.json

1.03 kB

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909 7 months ago
special_tokens_map.json

301 Bytes

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909 7 months ago
tokenizer.json

9.08 MB

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909 7 months ago
tokenizer_config.json

51.3 kB

2683fb76bc4ee758e23d959366a4ecefa0b31bc3e077682224dbf20565d35909 7 months ago