-
-
-
-
-
-
Inference status
Active filters:
ONNX
EmbeddedLLM/Phi-3-vision-128k-instruct-onnx
Text Generation
•
Updated
EmbeddedLLM/01-ai_Yi-1.5-6B-Chat-onnx
Text Generation
•
Updated
EmbeddedLLM/gemma-7b-it-onnx
Text Generation
•
Updated
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-onnx
Text Generation
•
Updated
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-directml
Text Generation
•
Updated
•
7
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml
Text Generation
•
Updated
•
8
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-int4-onnx-directml
Text Generation
•
Updated
•
9
EmbeddedLLM/Phi-3-medium-4k-instruct-onnx-directml
Text Generation
•
Updated
•
6
EmbeddedLLM/Phi-3-medium-128k-instruct-onnx-directml
Text Generation
•
Updated
•
10
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
•
Updated
•
7
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
Text Generation
•
Updated
•
8
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
•
Updated
•
9
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
Text Generation
•
Updated
•
7
Maximum2000/Phi-3.5-mini-instruct-cuda-fp16-onnx
Text Generation
•
Updated
Maximum2000/Phi-3.5-mini-instruct-cuda-fp32-onnx
Text Generation
•
Updated
philipp-zettl/bart-large-cnn
Summarization
•
Updated
•
9
onnx-community/Llama-3.2-1B-Instruct-ONNX
Text Generation
•
Updated
•
45
sheldonrobinson/Phi-3.5-mini-instruct-onnx
Text Generation
•
Updated
•
7
juampahc/gliner_multi-v2.1-onnx
Token Classification
•
Updated
•
3
FusionQuill/Phi-3.5-vision-instruct-onnx-cpu-dml