56 10 118

Fizz 🏳️‍⚧️ PRO

Fizzarolli

https://discord.gg/PPBMhF2vgC

AI & ML interests

None yet

Recent Activity

updated a dataset about 4 hours ago

allura-org/ethical-alignment-seeds

liked a model about 7 hours ago

PleIAs/Pleias-350m-Preview

updated a model 1 day ago

estrogen/L3.1-Tulu3-RP-Ink-adpt

View all activity

Organizations

Posts 2

Post

1956

hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1

Post

2384

Is anyone looking into some sort of decentralized/federated dataset generation or classification by humans instead of synthetically?

From my experience with trying models, a *lot* of modern finetunes are trained on what amounts to, in essence, GPT-4 generated slop that makes everything sound like a rip-off GPT-4 (refer to i.e. the Dolphin finetunes). I have a feeling that this is a lot of the reason people haven't been quite as successful as Meta's instruct tunes of Llama 3.

Collections 3

spaces 3

Sleeping

🏢

models 37

datasets 26

Fizzarolli/FallingThroughTheSkies-592k-Filtered-Filtered

Viewer • Updated Oct 8, 2024 • 139k • 44 • 5

Fizzarolli/filtered-wit-recaptioned

Viewer • Updated Sep 2, 2024 • 10 • 34

Fizzarolli/goofed_up_logs

Viewer • Updated Aug 28, 2024 • 82.9k • 30

Fizzarolli/stheno-filtered-v1.1-filtereded

Viewer • Updated Aug 26, 2024 • 8.81k • 37

Fizzarolli/goofed_up_logs_orig

Viewer • Updated Aug 26, 2024 • 13.5k • 29

Fizzarolli/hh-rlhf-helpful-only

Viewer • Updated Aug 10, 2024 • 118k • 44

Fizzarolli/hh-rlhf-h4-test-revised

Viewer • Updated Aug 8, 2024 • 10 • 35 • 1

Fizzarolli/dclm-baseline-1.0-2.5k

Viewer • Updated Aug 8, 2024 • 2.5k • 32

Fizzarolli/rosier-dataset

Viewer • Updated Jul 16, 2024 • 9.28k • 38 • 2

Fizzarolli/fse-raw-dump

Viewer • Updated Jul 16, 2024 • 186k • 39 • 1

Fizz 🏳️‍⚧️ PRO

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 3

Fizzarolli/L3.1-70b-glitz-v0.2

Fizzarolli/MN-12b-Rosier-v1

Fizzarolli/L3-8b-Rosier-v1

Fizzarolli/duloxetine-4b-v1

Fizzarolli/L3.1-70b-glitz-v0.2

Fizzarolli/clite-500m

Fizzarolli/sappha-2b-v3

spaces 3

Paligemma2 3b E621 Tagger

Shuttle 3 Diffusion

Molmo 7B O 0924

models 37

Fizzarolli/t5-v1_1-base-improved

Fizzarolli/modded-nanogpt-logs

Fizzarolli/OLMoE-1B-7B-0924-extended-pos-emb

Fizzarolli/Nemo_Pony_2-merged

Fizzarolli/cornsnake-6.9b

Fizzarolli/MN-12b-Rosier-v1

Fizzarolli/MN-12b-Sunrose

Fizzarolli/nemo-sunfall-v0.6.1-adapter-on-base

Fizzarolli/roformer-250m-initialized

Fizzarolli/L3.1-70b-glitz-v0.2-GGUF

datasets 26

Fizzarolli/FallingThroughTheSkies-592k-Filtered-Filtered

Fizzarolli/filtered-wit-recaptioned

Fizzarolli/goofed_up_logs

Fizzarolli/stheno-filtered-v1.1-filtereded

Fizzarolli/goofed_up_logs_orig

Fizzarolli/hh-rlhf-helpful-only

Fizzarolli/hh-rlhf-h4-test-revised

Fizzarolli/dclm-baseline-1.0-2.5k

Fizzarolli/rosier-dataset

Fizzarolli/fse-raw-dump

Fizz 🏳️‍⚧️ PRO

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 3

spaces 3 Sort: Recently updated

Paligemma2 3b E621 Tagger

Shuttle 3 Diffusion

Molmo 7B O 0924

models 37 Sort: Recently updated

datasets 26 Sort: Recently updated

spaces 3

models 37

datasets 26