Tristan/dclm-perplexity-correlations-spearmanr-no-samp-410m Text Generation • Updated Nov 22, 2024 • 13
Tristan/dclm-perplexity-correlations-spearmanr-no-samp-160m Text Generation • Updated Nov 22, 2024 • 10
Tristan/RedPajama-Data-V2-sample-100B-filtered-shuffled-tokenized-with-token-counts Viewer • Updated May 31, 2024 • 4.16M • 73
Tristan/RedPajama-Data-V2-sample-100B-filtered-for-regression-domains-with-domains Viewer • Updated May 24, 2024 • 4.16M • 71