Logo
Explore Help
Register Sign In
RYDE-WORK/ktransformers
1
0
Fork 0
You've already forked ktransformers
mirror of https://github.com/RYDE-WORK/ktransformers.git synced 2026-01-19 21:03:18 +08:00
Code Issues Actions Packages Projects Releases Wiki Activity
ktransformers/ktransformers
History
TangJingqi de3faaf55d Update readme; add pipeline tutorial; add detailed inject tutorial
2024-08-15 20:42:54 +08:00
..
configs
Initial commit
2024-07-27 16:06:58 +08:00
ktransformers_ext
[feature] support q2_k & q3_k dequantize on gpu
2024-08-12 12:53:12 +00:00
models
[ADD] support multi-gpu qlen>1 q5_k
2024-08-12 11:41:26 +00:00
operators
[fix] format classes and files name
2024-08-15 10:44:59 +08:00
optimize
Update readme; add pipeline tutorial; add detailed inject tutorial
2024-08-15 20:42:54 +08:00
server
[feature] experts can be injected using CPUInfer
2024-08-14 16:10:54 +08:00
tests
[fix] format classes and files name
2024-08-15 10:44:59 +08:00
util
[feature] experts can be injected using CPUInfer
2024-08-14 16:10:54 +08:00
website
Initial commit
2024-07-27 16:06:58 +08:00
__init__.py
[feature] add github action for pre compile
2024-08-14 16:54:50 +00:00
local_chat.py
[ADD] support multi-gpu qlen>1 q5_k
2024-08-12 11:41:26 +00:00
Powered by Gitea Version: 1.23.8 Page: 16ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API