Correct timestep for img2img initial noise addition by wfng92 · Pull Request #1946 · invoke-ai/InvokeAI

wfng92 · 2022-12-12T08:33:51Z

There seems to be difference between huggingface's diffusers and compvis's implementation for img2img as the wrong timestep is used for the additive noise for image-2-image.

Summary from the relevant PR raised by the HuggingFace team:

t+1 is used as the timestep to add noise to the original image, but [0, ..., t] is used afterwards for the denoising process. We should however also use t when adding the noise to the original image.
This can be quite easily verified by doing the following. Run a img2img with a small number of update steps and a very low strength because then differences between t and t+1 become quite clear.

From

torch.tensor([t_enc]).to(self.model.device)

to

torch.tensor([t_enc - 1]).to(self.model.device)

For example, using the following init_img and configuration:

prompt: A fantasy landscape, trending on artstation
strength: 0.1
seed: 42
sampler_name: DDIM
steps: 10

The existing code will generate an image with the following noise:

The new implementation will produce an image with lesser noise:

This PR affects only the img2img script. I realized that text2img2img and inpainting share the same code base but I am not sure if the same implementation should be applied as well.

lstein · 2022-12-12T14:16:38Z

Thanks so much for finding this problem. I will test it out the proposed solution later today.

lstein

Thank you for finding and fixing this. I have applied your fix to the inpaint and txt2img2img paths as well.

Correct timestep for img2img initial noise addition

18c4c9a

apply fix to inpaint and txt2img2img as well

06443a1

lstein approved these changes Dec 12, 2022

View reviewed changes

Merge branch 'main' into patch-3

6ac4184

lstein enabled auto-merge (rebase) December 13, 2022 19:32

Merge branch 'main' into patch-3

9fa1bed

lstein disabled auto-merge December 15, 2022 20:59

lstein merged commit 2a2f38a into invoke-ai:main Dec 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct timestep for img2img initial noise addition#1946

Correct timestep for img2img initial noise addition#1946
lstein merged 4 commits intoinvoke-ai:mainfrom
wfng92:patch-3

wfng92 commented Dec 12, 2022

Uh oh!

lstein commented Dec 12, 2022

Uh oh!

lstein left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wfng92 commented Dec 12, 2022

Uh oh!

lstein commented Dec 12, 2022

Uh oh!

lstein left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants