VAGOsolutions
/

SauerkrautLM-1.5b

Text Generation

continuous pretraining

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DavidGF commited on Jun 12, 2024

Commit

e106a17

·

verified ·

1 Parent(s): 902942e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -66,6 +66,8 @@ Despite the large volume of German CPT data, the model competes well against the
 **Post-CPT Training**:
 The model underwent 3 epochs of Supervised Fine-Tuning (SFT) with 700K samples.
 **Further Steps**:
 The model was aligned with Direct Preference Optimization (DPO) using 70K samples.

 **Post-CPT Training**:
 The model underwent 3 epochs of Supervised Fine-Tuning (SFT) with 700K samples.
 **Further Steps**:
 The model was aligned with Direct Preference Optimization (DPO) using 70K samples.