Pelochus
/

llama2-chat-70b-hf-rk3588

Model card Files Files and versions Community

Pelochus commited on Apr 16, 2024

Commit

1cf89a2

·

verified ·

1 Parent(s): 0b41c82

Create README.md

Files changed (1) hide show

README.md +32 -0

README.md ADDED Viewed

	@@ -0,0 +1,32 @@

+---
+tags:
+- llama2
+- llama2-70b
+- rkllm
+- rockchip
+- rk3588
+---
+# Llama 2 Chat 70B for RK3588
+This is a conversion from https://huggingface.co/meta-llama/Llama-2-70b-chat-hf to the RKLLM format for Rockchip devices.
+This runs on the NPU from the RK3588.
+# But wait... will this run on my RK3588?
+No. But I found interesting to see what happens if I converted it.
+Let's hope Microsoft never knows that I was using their SSDs as swap because they don't allow more than 32 GB RAM for student subscription :P
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/660da5d45d68779a53384179/lWqXKM3R_0_3Vlv-6yHji.png)
+And this is before finishing, it will probably go to 600 GBs of RAM.
+But hey! You can always try yourself getting a 512 GB SSD, a 32 GB of RAM SBC, have some patience and see if it loads. Good luck with that!
+# Main repo
+See this for my full collection of converted LLMs for the RK3588's NPU:
+https://huggingface.co/Pelochus/ezrkllm-collection
+# License
+Same as the original LLM:
+https://huggingface.co/meta-llama/Llama-2-70b-chat-hf/blob/main/LICENSE.txt