moot20
/

Llama-3.1-Tulu-3-8B-MLX-8bits

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

moot20/Llama-3.1-Tulu-3-8B-MLX-8bits

The Model moot20/Llama-3.1-Tulu-3-8B-MLX-8bits was converted to MLX format from allenai/Llama-3.1-Tulu-3-8B using mlx-lm version 0.21.1.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("moot20/Llama-3.1-Tulu-3-8B-MLX-8bits")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

Downloads last month: 0

Safetensors

Model size

2.26B params

Tensor type

FP16

·

U32

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for moot20/Llama-3.1-Tulu-3-8B-MLX-8bits

Base model

meta-llama/Llama-3.1-8B

Finetuned

allenai/Llama-3.1-Tulu-3-8B-SFT

Finetuned

allenai/Llama-3.1-Tulu-3-8B-DPO

Finetuned

allenai/Llama-3.1-Tulu-3-8B

Quantized

(28)

this model

Dataset used to train moot20/Llama-3.1-Tulu-3-8B-MLX-8bits

Collection including moot20/Llama-3.1-Tulu-3-8B-MLX-8bits

Llama-3.1-Tulu-3

Llama-3.1-Tulu-3-8B model - quantized and converted to MLX • 3 items • Updated about 6 hours ago

Evaluation results

averaged accuracy on IFEval (0-Shot)
Open LLM Leaderboard

82.550
normalized accuracy on BBH (3-Shot)
test set Open LLM Leaderboard

16.860
exact match on MATH Lvl 5 (4-Shot)
test set Open LLM Leaderboard

18.880
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

6.260
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

10.520
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

20.230

View on Papers With Code