All the GPT2 variants I have trained for my Masters Thesis
Tanay Mehta
tanaymehta
AI & ML interests
Large Language Models
Recent Activity
updated
a model
3 days ago
Aleph-Alpha/Pharia-1-LLM-7B-control-hf
new activity
3 days ago
Aleph-Alpha/Pharia-1-LLM-7B-control-hf:Make loss calculation possible during eval mode
Organizations
Collections
1
models
40
tanaymehta/gpt2_900K_100eps
Text Generation
•
Updated
•
26
tanaymehta/gpt2_800K_100eps
Text Generation
•
Updated
•
28
tanaymehta/gpt2_700K_100eps
Text Generation
•
Updated
•
26
tanaymehta/gpt2_600K_100eps
Text Generation
•
Updated
•
25
tanaymehta/gpt2_500K_100eps
Text Generation
•
Updated
•
26
tanaymehta/gpt2_400K_100eps
Text Generation
•
Updated
•
33
tanaymehta/gpt2_300K_100eps
Text Generation
•
Updated
•
31
tanaymehta/gpt2_200K_100eps
Text Generation
•
Updated
•
24
tanaymehta/gpt2_100K_100eps
Text Generation
•
Updated
•
28
tanaymehta/gpt2_1M_100eps
Text Generation
•
Updated
•
36