qualcomm
/

Baichuan-7B

@@ -30,7 +30,7 @@ accross various devices, can be found [here](https://aihub.qualcomm.com/models/b
   - Model-1 (Prompt Processor): Baichuan-PromptProcessor-Quantized
   - Max context length: 1024
   - Prompt processor input: 1024 tokens
-  - Prompt processor output: 1 output token + KVCache for token generator
   - Model-2 (Token Generator): Baichuan-TokenGenerator-KVCache-Quantized
   - Token generator input: 1 input token + past KVCache
   - Token generator output: 1 output token + KVCache for next iteration

   - Model-1 (Prompt Processor): Baichuan-PromptProcessor-Quantized
   - Max context length: 1024
   - Prompt processor input: 1024 tokens
+  - Prompt processor output: 1024 output tokens + KVCache for token generator
   - Model-2 (Token Generator): Baichuan-TokenGenerator-KVCache-Quantized
   - Token generator input: 1 input token + past KVCache
   - Token generator output: 1 output token + KVCache for next iteration