vilm
/

Mixsmol-4x400M-v0.1-epoch1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mixsmol-4x400M-v0.1-epoch1

1 contributor

History: 16 commits

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

195bc34 verified 11 months ago

.gitattributes

1.52 kB

initial commit 12 months ago
README.md

6.64 kB

Adding Evaluation Results 11 months ago
config.json

768 Bytes

Update config.json 12 months ago
model.safetensors

3.55 GB
LFS

Training in progress, epoch 0 12 months ago
special_tokens_map.json

552 Bytes

Training in progress, epoch 0 12 months ago
tokenizer.json

2.15 MB

Upload tokenizer 12 months ago
tokenizer.model

597 kB
LFS

Training in progress, epoch 0 12 months ago
tokenizer_config.json

1.02 kB

Training in progress, epoch 0 12 months ago
trainer_log.jsonl

3.36 MB

Training in progress, epoch 0 12 months ago
training_args.bin
Detected Pickle imports (12)
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "torch.bfloat16",
- "transformers.training_args_seq2seq.Seq2SeqTrainingArguments",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.training_args.OptimizerNames",
- "torch.device",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SchedulerType"
How to fix it?
6.71 kB
LFS

Training in progress, epoch 0 12 months ago