Model Details

Model and Training Details

Preprocessing

  • preprocessed and packed the sft dataset with trl.trainer.ConstantLengthDataset

Results

image/png

Compute Infrastructure

The model is trained using 4 * RTX 3090 - 24GB

Model Card Authors

Yiyu (Michael) Ren

Model Card Contact

Email: [email protected]

Framework versions

  • PEFT 0.8.2
Downloads last month
2
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Model tree for renyiyu/llama-2-7b-sft-lora

Adapter
(1818)
this model