Model Details

Model Description

Finetune of LLaMa 3.2 1B model to include flashnormalization (https://arxiv.org/abs/2407.09577)

  • Developed by: OpenMachine Labs
  • License: MIT
  • Finetuned from model Meta LLaMa 3.2 1B

Model Sources [optional]

Uses

How to Get Started with the Model

Use the code below to get started with the model.

Speeds, Sizes, Times

[More Information Needed]

Evaluation

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

Model Card Authors

Nils Graef ([email protected])

Drew Wasielewski ([email protected])

Downloads last month
6
Safetensors
Model size
1.24B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for drewwas/OpenMachine_FlashNorm

Finetuned
(241)
this model