From 01df4a6b3ff2b1c6604b71babfc238a64c841606 Mon Sep 17 00:00:00 2001 From: Mingxing Zhang Date: Sat, 27 Jul 2024 23:07:31 +0800 Subject: [PATCH 1/2] Update README.md update videos --- README.md | 16 +++++++++++++--- 1 file changed, 13 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 95a8c46..a4bc878 100644 --- a/README.md +++ b/README.md @@ -3,7 +3,8 @@

- DeepSeek-Coder-V2 Score + KTransformers +

@@ -26,7 +27,11 @@ Our vision for KTransformers is to serve as a flexible platform for experimentin

GPT-4-level Local VSCode Copilot on a Desktop with only 24GB VRAM

- https://github.com/user-attachments/assets/3f85780e-aa53-4d2f-91b2-5585c8dade85 + +Uploading ktransformers_vs_llamacpp.mp4… + + +

@@ -44,7 +49,12 @@ Our vision for KTransformers is to serve as a flexible platform for experimentin

- https://github.com/user-attachments/assets/e6e27cb3-8372-44e6-8f1f-34402eae56c1 + + +https://github.com/user-attachments/assets/4cf59f33-8f67-4806-a5c0-540e49bb8305 + + +

From 935c28c27788a374027e7a8a780fab690c2c316b Mon Sep 17 00:00:00 2001 From: Mingxing Zhang Date: Sat, 27 Jul 2024 23:12:00 +0800 Subject: [PATCH 2/2] Update README.md --- README.md | 16 ++-------------- 1 file changed, 2 insertions(+), 14 deletions(-) diff --git a/README.md b/README.md index a4bc878..db39365 100644 --- a/README.md +++ b/README.md @@ -26,20 +26,14 @@ Our vision for KTransformers is to serve as a flexible platform for experimentin

🔥 Show Cases

GPT-4-level Local VSCode Copilot on a Desktop with only 24GB VRAM

- - -Uploading ktransformers_vs_llamacpp.mp4… - - - - +https://github.com/user-attachments/assets/38ee32c7-4280-4563-b068-78181f9c3694

- **Local 236B DeepSeek-Coder-V2:** Running its Q4_K_M version using only 21GB VRAM and 136GB DRAM, attainable on a local desktop machine, which scores even better than GPT4-0613 in [BigCodeBench](https://huggingface.co/blog/leaderboard-bigcodebench).

- DeepSeek-Coder-V2 Score + DeepSeek-Coder-V2 Score

@@ -47,14 +41,8 @@ Uploading ktransformers_vs_llamacpp.mp4… - **VSCode Integration:** Wrapped into an OpenAI and Ollama compatible API for seamless integration as a backend for [Tabby](https://github.com/TabbyML/tabby) and various other frontends.

- - - https://github.com/user-attachments/assets/4cf59f33-8f67-4806-a5c0-540e49bb8305 - - -