Releases: JamePeng/llama-cpp-python
v0.3.37-cu130-Basic-win-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu130-Basic-linux-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu128-Basic-win-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu128-Basic-linux-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu126-Basic-win-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu126-Basic-linux-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu124-Basic-win-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-cu124-Basic-linux-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.37-Metal-macos-20260502
Bump version to 0.3.37
Release Note: [0.3.37] Release Note: MoE CPU Offloading, N-Gram Speculative Decoding, Thread-Safe Abort & New LLM Wiki
Changlog see here: 0.3.37 Changelog
Signed-off-by: JamePeng jame_peng@sina.com
v0.3.36-cu130-Basic-win-20260417
Bump version to 0.3.36
Changlog see here: 0.3.36 Changelog
Signed-off-by: JamePeng jame_peng@sina.com