apg update by wbruna · Pull Request #9 · stduhpf/stable-diffusion.cpp

wbruna · 2026-04-23T22:02:51Z

@stduhpf , here is my last attempt to update the APG branch.

The original branch became very hard to merge: it would have to cross the src/ rename, the Tensor changes, and the result would be "merge"in name only.... So I ported it by LLM-assisted hand to the new interfaces, reimplemented the command line, and added server API support. I've also replaced the env var with a debug log line, to avoid that specific barrier to merging.

I chose a somewhat random base branch to send it to you just because it was less distant from upstream than the others on your repo.

At least plain text-to-image with an LCM model seems to work. I noticed something that looks like a bug on this chunk (original diff):

Details

@@ -1322,18 +1414,18 @@ public:
                 float latent_result = positive_data[i];
                 if (has_unconditioned) {
                     // out_uncond + cfg_scale * (out_cond - out_uncond)
-                    if (has_img_cond) {
-                        // out_uncond + text_cfg_scale * (out_cond - out_img_cond) + image_cfg_scale * (out_img_cond - out_uncond)
-                        latent_result = negative_data[i] + img_cfg_scale * (img_cond_data[i] - negative_data[i]) + cfg_scale * (positive_data[i] - img_cond_data[i]);
-                    } else {
-                        // img_cfg_scale == cfg_scale
-                        latent_result = negative_data[i] + cfg_scale * (positive_data[i] - negative_data[i]);
+                    float delta = deltas[i];
+
+                    if (cfg_scale != 1) {
+                        latent_result = positive_data[i] + (cfg_scale - 1) * delta;
+                    } else if (has_img_cond) {
+                        latent_result = positive_data[i] + (img_cfg_scale - 1) * delta;
                     }
                 } else if (has_img_cond) {
                     // img_cfg_scale == 1
                     latent_result = img_cond_data[i] + cfg_scale * (positive_data[i] - img_cond_data[i]);
                 }
-                if (is_skiplayer_step) {
+                if (is_skiplayer_step && slg_scale != 0.0) {
                     latent_result = latent_result + (positive_data[i] - skip_layer_data[i]) * slg_scale;
                 }
                 // v = latent_result, eps = latent_result

The line latent_result = img_cond_data[i] + cfg_scale * (positive_data[i] - img_cond_data[i]); should be latent_result = img_cond_data[i] + cfg_scale * delta;?

And a few LLM reviewers suggested other changes that could be reasonable; I've kept them in a separate commit.

If you still want to move this forward (and I'd understand if you didn't...), I'd suggest just starting over the branch like this. And feel free to take ownership if you wish; I did add you as co-author, but you did most of the hard work.

…#1428)

…leejet#1437)

Ported from leejet#593 . Co-authored-by: Stéphane du Hamel <stephduh@live.fr>

leejet and others added 19 commits April 17, 2026 00:51

feat: add ernie image support (leejet#1427)

5c243db

feat: SDXS-09 support and update doc (leejet#1356)

d73b419

feat: add er_sde sampler (leejet#1403)

1b4e9be

fix: skip empty prompt segments around attention range (leejet#1429)

84fc544

refactor: remove is_xl guard wrapper in get_sd_version (leejet#1430)

a564fdf

fix: correct dpm++2s_a second model call (leejet#1435)

2bcff67

fix: tune ernie-image default flow shift (leejet#1433)

6a9cb31

feat: add DPM++ (2S) Ancestral implementation for flow models (leejet…

f3f69e2

…#1428)

feat(server): implement vid_gen async API and mode-aware capabilities (…

4d626d2

…leejet#1437)

ci: skip docker image build job on pull requests (leejet#1439)

3c99f70

chore: enable MSVC parallel compilation with /MP (leejet#1438)

7d33d4b

feat: adapt LCM for flow models (leejet#1413)

e77e4c4

fix: correct image to image DDIM and TCD (leejet#1410)

7023fc4

refactor: move model file IO into dedicated module (leejet#1442)

6614334

feat: add restricted torch legacy checkpoint loading (leejet#1443)

0a7ae07

feat: support safetensors export in convert mode (leejet#1444)

44cca3d

feat: add SLG Unconditinal support

686c314

Ported from leejet#593 . Co-authored-by: Stéphane du Hamel <stephduh@live.fr>

feat: add APG support

321918c

Ported from leejet#593 . Co-authored-by: Stéphane du Hamel <stephduh@live.fr>

llm fixes

15bc343

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apg update#9

apg update#9
wbruna wants to merge 19 commits intostduhpf:nucleusfrom
wbruna:sd_apg_update_202604

wbruna commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wbruna commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants