Spaces:
Runtime error
Runtime error
Commit History
feat: update model config + save optim
a30dbd3
fix: model config
5aaf9df
feat: use bart-large-cnn
19d68bb
fix: log metadata
99a1ff5
fix: define function before it is used
d449092
fix: correct arg
283adc6
feat: save model frequently
754f876
feat: use bart large
bb3bfa6
fix: use correct key
b20769d
fix: log correct metrics
3fef9c1
feat: hardcode eval_steps
4c5e5a7
fix: eval_steps belongs to training_args
900136f
feat: eval_steps already exists in TrainingArguments
0a0080b
Merge branch 'main'
3ddf1c5
feat: set default x-axis
97a008e
feat: log everything through wandb
19070ab
Merge pull request #21 from borisdayma/feat-no_decay
b29bab7
unverified
feat: eval less often for faster training
f0a53ac
Merge pull request #20 from borisdayma/eval-interval
635402d
unverified
feat: no decay option
5a3211f
feat: use common wandb shared folder
7aa2f4b
feat: hardcoded datasets
e8709a6
Add eval_interval to evaluate and log every so often.
566d5f2
Pedro Cuenca
commited on
Shift tokens in numpy because the built in shift function stalls.
835ea55
Pedro Cuenca
commited on
fix: should be converted to array
945d86c
fix: labels array
6c1f112
fix: typo
678a62f
fix: model config
0be4942
fix: correct decoder_input_ids and labels
19946be
feat: don't log model by default
5b79afd
feat: fix typo
ec8d66b
feat: log model
1c44a7d
feat: update default parameters
dbe8c41
feat: output_length considers bos and eos
8bb2236
feat: shared cache folder
42ce7dd
fix: accumulation vs lr
4d55db6
fix: wandb logging with sync_tensorboard
8ba598c
feat: lower default source length
48c07ca
fix: typos
5960e87
feat: gradient accumulation
c9e9575
feat: simplify loss function
9db361a
feat: padding mask not required
d61405b
fix: typo
833a2d5
feat: add adafactor
600ad79
feat: log metrics more frequently
498559f
* Make padding mask optional.
ecafe5e
Pedro Cuenca
commited on
* Only perform validation if requested
32dc2d8
Pedro Cuenca
commited on
Preprocessing: return "labels", "decoder_input_ids" and
df3c7bd
Pedro Cuenca
commited on
Decoder: set eos to an unreachable value, set min_length=max_length to
a841a4c
Pedro Cuenca
commited on