Logo
Explore Help
Register Sign In
RYDE-WORK/ktransformers
1
0
Fork 0
You've already forked ktransformers
mirror of https://github.com/RYDE-WORK/ktransformers.git synced 2026-01-22 22:46:20 +08:00
Code Issues Actions Packages Projects Releases Wiki Activity
ktransformers/ktransformers
History
Atream c189d55bd1 toy support for experts on GPU, no CUDA Graph
2025-02-15 15:16:00 +00:00
..
configs
update rope calculation; update modeling.py; update gate for moe
2025-02-01 07:32:21 +00:00
ktransformers_ext
toy support for experts on GPU, no CUDA Graph
2025-02-15 15:16:00 +00:00
models
Merge pull request #294 from kvcache-ai/feat-fast-MLA
2025-02-14 19:40:36 +08:00
operators
toy support for experts on GPU, no CUDA Graph
2025-02-15 15:16:00 +00:00
optimize
toy support for experts on GPU, no CUDA Graph
2025-02-15 15:16:00 +00:00
server
Add a lock to server inference()
2025-02-13 10:05:22 +00:00
tests
[fix] format classes and files name
2024-08-15 10:44:59 +08:00
util
toy support for experts on GPU, no CUDA Graph
2025-02-15 15:16:00 +00:00
website
✨: refactor local_chat and fix message slice bug in server
2024-11-04 14:02:19 +08:00
__init__.py
[feature] update version and github action jobs for package
2025-02-10 01:00:57 +00:00
local_chat.py
⚡ support force thinking
2025-02-12 12:43:53 +08:00
Powered by Gitea Version: 1.23.8 Page: 28ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API