Downstream Issues Due to Inference-time Optimization in Model Forward

by theyorubayesian - opened 17 days ago

17 days ago

There's an optimization in the model forward that limits the logits returned to the last timestep (https://huggingface.co/BeardedMonster/SabiYarn-125M/blob/main/pretrained_model.py#L196). This causes issues during likelihood-based evaluations where the logits need to be processed.

Owner 16 days ago

I will work sort it out as soon as i can. Thank you.

BeardedMonster changed discussion status to closed 16 days ago

Owner 16 days ago

It has been fixed. Logits for all time steps should be returned now.

BeardedMonster changed discussion status to open 16 days ago

BeardedMonster changed discussion status to closed 16 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment