zamroni111 commited on
Commit
65ba526
·
verified ·
1 Parent(s): b01f270

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -67,14 +67,13 @@ c. Rename the original model.onnx to other file name and put and rename the opti
67
  d. Rerun step 4.
68
 
69
  #### Speeds, Sizes, Times [optional]
70
- 15 token/s in Radeon 780M with 8GB dedicated RAM.<br>
71
  Increase to 16 token/s with device specific optimized model.onnx.<br>
72
  As comparison, LM Studio using GGUF INT4 model and VulkanML GPU acceleration runs at 13 token/s.
73
 
74
  #### Hardware
75
  AMD Ryzen Zen4 7840U with integrated Radeon 780M GPU<br>
76
  RAM 32GB<br>
77
- 8GB pre-allocated iGPU VRAM
78
 
79
  #### Software
80
  Microsoft DirectML on Windows 10
 
67
  d. Rerun step 4.
68
 
69
  #### Speeds, Sizes, Times [optional]
70
+ 15 token/s in Radeon 780M with 8GB pre-allocated RAM.<br>
71
  Increase to 16 token/s with device specific optimized model.onnx.<br>
72
  As comparison, LM Studio using GGUF INT4 model and VulkanML GPU acceleration runs at 13 token/s.
73
 
74
  #### Hardware
75
  AMD Ryzen Zen4 7840U with integrated Radeon 780M GPU<br>
76
  RAM 32GB<br>
 
77
 
78
  #### Software
79
  Microsoft DirectML on Windows 10