Commit History
FIX: TRL trainer preprocessing step was running in one process (#1583)
b9bb169
unverified
Ali Mosavian
Ali Mosavian
commited on
ADD: warning hub model (#1301)
601c08b
unverified
PoSE context length ext (#1567)
5294653
unverified
make sure everything stays in the same dtype when using dpo + FSDP (#1559)
68601ec
unverified
Add support for Gemma chat template (#1530)
60f5ce0
unverified
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)
7477a53
unverified
ORPO Trainer replacement (#1551)
7d1d22f
unverified
Unsloth gradient checkpointing offload (#1528)
6319da1
unverified
DBRX Model Support (#1462)
132eb74
unverified
use locale agnostic seperator to make large nums easier to read (#1503)
da9b1a3
unverified
WIP: Support table logging for mlflow, too (#1506)
057fa44
unverified
Correctly handle splits for datasets.arrow_dataset.Dataset objects (#1504)
8fa0785
unverified
Print versions (#1496)
4313b1a
unverified
add field to sft dataset pydantic for completion support (#1497)
ff01c45
unverified
ignore issues with calculating # params when printing (#1493)
2fa65b9
unverified
Remove `validate_quantized_dora` (#1485)
9430b6e
unverified
xzuyn
commited on