Skip to content

Upgrade llama.cpp dependency from b8913 to b8953#100

Merged
bernardladenthin merged 1 commit intomasterfrom
claude/update-b8953-compatibility-mvQh7
Apr 28, 2026
Merged

Upgrade llama.cpp dependency from b8913 to b8953#100
bernardladenthin merged 1 commit intomasterfrom
claude/update-b8953-compatibility-mvQh7

Conversation

@bernardladenthin
Copy link
Copy Markdown
Owner

Summary

This PR updates the pinned llama.cpp version from b8913 to b8953 across the project configuration and documentation.

Changes Made

  • Updated CMakeLists.txt to fetch llama.cpp at commit b8953 (previously b8913)
  • Updated version references in CLAUDE.md and README.md to reflect the new pinned version
  • Added documentation of breaking changes introduced in the b8913–b8953 version range:
    • common/debug.h: Struct and template refactoring (no project impact)
    • tools/server/server-http.h: File upload struct changes (upstream only)
    • src/llama-quant.cpp: Default quantization ftype change (upstream only)
    • src/models/llama.cpp, qwen3.cpp, qwen3moe.cpp: Removed duplicate ggml_mul operations (upstream only)

Notes

All identified breaking changes in this version range are either internal to llama.cpp or affect upstream-only code paths, requiring no changes to this project's Java bindings.

https://claude.ai/code/session_019o8EyxotTPXryKkxHjvB6W

No project code changes required — all b8913→b8953 upstream changes
(debug.h API rename, server-http.h uploaded_file struct, quant default
Q5_1→Q8_0, duplicate wo_s removal in model files, GPU backend updates)
are either upstream-internal or auto-compiled via FetchContent.

https://claude.ai/code/session_019o8EyxotTPXryKkxHjvB6W
@bernardladenthin bernardladenthin merged commit 84e0577 into master Apr 28, 2026
16 checks passed
@bernardladenthin bernardladenthin deleted the claude/update-b8953-compatibility-mvQh7 branch April 28, 2026 10:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants