awacke1 commited on
Commit
f87791a
ยท
1 Parent(s): d18c16f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -9,6 +9,7 @@ app_file: app.py
9
  pinned: true
10
  license: mit
11
  ---
 
12
  ## ChatGPT Datasets ๐Ÿ“š
13
  - WebText
14
  - Common Crawl
@@ -16,6 +17,7 @@ license: mit
16
  - English Wikipedia
17
  - Toronto Books Corpus
18
  - OpenWebText
 
19
  ## ChatGPT Datasets - Details ๐Ÿ“š
20
  - **WebText:** A dataset of web pages crawled from domains on the Alexa top 5,000 list. This dataset was used to pretrain GPT-2.
21
  - [WebText: A Large-Scale Unsupervised Text Corpus by Radford et al.](https://paperswithcode.com/dataset/webtext)
 
9
  pinned: true
10
  license: mit
11
  ---
12
+
13
  ## ChatGPT Datasets ๐Ÿ“š
14
  - WebText
15
  - Common Crawl
 
17
  - English Wikipedia
18
  - Toronto Books Corpus
19
  - OpenWebText
20
+
21
  ## ChatGPT Datasets - Details ๐Ÿ“š
22
  - **WebText:** A dataset of web pages crawled from domains on the Alexa top 5,000 list. This dataset was used to pretrain GPT-2.
23
  - [WebText: A Large-Scale Unsupervised Text Corpus by Radford et al.](https://paperswithcode.com/dataset/webtext)