Text Generation
ELM
English
dev-slx commited on
Commit
c1e796c
·
verified ·
1 Parent(s): 7e3c5b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -28,9 +28,10 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
28
  ## ELM-v0.1 Model Release
29
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
30
 
31
- Models are located in the `models` folder. ELM models in this repository comes in three sizes (elm-1.0, elm-0.75 and elm-0.25) and supports the following use-case.
32
  - news_content_generation (tldr_news dataset)
33
 
 
34
 
35
 
36
  ## Setup ELM
 
28
  ## ELM-v0.1 Model Release
29
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
30
 
31
+ ELM models (in the `models` folder) in this repository come in three sizes (`elm-1.0`, `elm-0.75` and `elm-0.25`). **All these different slices are extracted from the same ELM finetuned checkpoint for inference** and supports the following use-case.
32
  - news_content_generation (tldr_news dataset)
33
 
34
+ **NOTE: ELM-v0.1 release is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.**
35
 
36
 
37
  ## Setup ELM