Mining Tasky Data

non-profit

Activity Feed

AI & ML interests

Mining Tasky Data

Recent Activity

manandey new activity about 2 months ago

taskydata/deberta-v3-base_10xp3nirstbbflanse_5xc4:Adding `safetensors` variant of this model

Muennighoff authored a paper 3 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Muennighoff authored a paper 4 months ago

OLMoE: Open Mixture-of-Experts Language Models

View all activity

taskydata's activity

manandey

in taskydata/deberta-v3-base_10xp3nirstbbflanse_5xc4 about 2 months ago

Adding `safetensors` variant of this model

#1 opened about 2 months ago by

SFconvertbot

Muennighoff

authored a paper 3 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 104

Muennighoff

authored a paper 4 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 77

manandey

updated a dataset 5 months ago

taskydata/Pile-T5-Instruction_updated

Viewer • Updated Jul 25, 2024 • 23.5k • 64

Muennighoff

authored a paper 5 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 68

manandey

updated a model 5 months ago

taskydata/pile-t5-xl-instruction

Text2Text Generation • Updated Jul 23, 2024 • 10

jordiclive

updated a model 5 months ago

taskydata/pile-t5-base-instruction

Text2Text Generation • Updated Jul 23, 2024 • 23

jordiclive

updated a dataset 5 months ago

taskydata/C4-Pile-T5-xl-Instructions

Viewer • Updated Jul 23, 2024 • 100 • 9

Muennighoff

authored a paper 6 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 35

craffel

authored a paper 6 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 87

hails

authored a paper 6 months ago

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

Paper • 2406.16838 • Published Jun 24, 2024 • 2

Muennighoff

authored 2 papers 7 months ago

C-Pack: Packaged Resources To Advance General Chinese Embedding

Paper • 2309.07597 • Published Sep 14, 2023 • 1

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 50

hails

authored a paper 7 months ago

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Paper • 2406.04391 • Published Jun 6, 2024 • 7

Muennighoff

authored a paper 7 months ago

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

Paper • 2406.02396 • Published Jun 4, 2024

manandey

updated a dataset 7 months ago

taskydata/C4-Pile-T5-base-Inst-no_robots

Viewer • Updated May 30, 2024 • 100 • 7

manandey

updated a model 7 months ago

taskydata/pile-t5-base-ins-no_robots

Text2Text Generation • Updated May 30, 2024 • 26

manandey

updated 2 datasets 7 months ago

taskydata/Pile-T5-Instruction

Viewer • Updated May 29, 2024 • 24.3k • 20

taskydata/C4-Pile-T5-base-Instructions

Viewer • Updated May 28, 2024 • 100 • 8

manandey

updated a dataset 8 months ago

taskydata/GPT4Tools

Viewer • Updated May 17, 2024 • 71.4k • 40 • 1

AI & ML interests

Recent Activity

Team members 7

taskydata's activity

Adding `safetensors` variant of this model