ml-stable-diffusion Add a way to preview an images interim status

It would be great if we could get the data of each intermediate step as an image.

This way we could build a preview in our UIs like this:

catAI

This would increase user experience especially on iPhones and iPads, where computing times are usually longer than on the Mac.

Dec 27 '22 16:12 martinlexow

As an example, in my sample iOS app, I display the intermediate images step by step.

https://github.com/ynagatomo/ARDiffMuseum

Dec 27 '22 22:12 ynagatomo

Yes I’d love this

Dec 29 '22 15:12 icpryde

@ynagatomo Thanks so much for your example code! Could you point us to the right direction in which place you have implemented the preview method? I wasn’t able to find it.

Dec 29 '22 15:12 martinlexow

Hi. In the ImageGenerator.swift,

set the progress handler when calling generateImages() method; let cgImages = try sdPipeline.generateImages(prompt: param.prompt, negativePrompt: param.negativePrompt, imageCount: param.imageCount, stepCount: param.stepCount, seed: UInt32(param.seed), guidanceScale: param.guidanceScale, disableSafety: param.disableSafety, progressHandler: self.progressHandler) // <-- progress handler

The apple/ml-stable-diffusion library provides the immediate images via the progress handler's StableDiffusionPipeline.Progress parameter; nonisolated func progressHandler(progress: StableDiffusionPipeline.Progress) -> Bool { debugLog("IG: Progress: step / stepCount = (progress.step) / (progress.stepCount)")

 if ProcessInfo.processInfo.isiOSAppOnMac {
     let generatedImages = GeneratedImages(parameter: GenerationParameter(prompt: progress.prompt,
                                          negativePrompt: "unknow", // progress does not provide this now
                                          guidanceScale: 0.0, // progress does not provide this now
                                          seed: 0,
                                          stepCount: progress.stepCount,
                                          imageCount: progress.currentImages.count,
                                          disableSafety: progress.isSafetyEnabled),
                                          images: progress.currentImages.compactMap {
         if let cgImage = $0 {
             return UIImage(cgImage: cgImage)
         } else {
             return nil
         }
     })

     DispatchQueue.main.async {
         self.setGeneratedImages(generatedImages)
         self.setProgressStep(step: progress.step, stepCount: progress.stepCount)
     }
 } else {
     DispatchQueue.main.async {
         self.setProgressStep(step: progress.step, stepCount: progress.stepCount)
     }
 }

 return true // continue

}

In my sample code, I convert the immediage images (CGImage) to UIImage and display them in SwiftUI views.

Dec 30 '22 00:12 ynagatomo

Getting the intermediate image takes time, so the above code, as the simplest case, doesn't do it on iPhone or iPad, but does it on macOS. Ex. using an iPad Pro 2020/A12Z, for generating a image (512x512 px) with 20 steps;

with displaying 20 intermediate images, it takes 7 min 20 sec
without displaying intermediate images, it takes 2 min I recommend adding UIs to enable/disable intermediate image display and to set the step interval for displaying intermediate images. (I will do this on my project later: https://github.com/ynagatomo/ARDiffMuseum)

Dec 30 '22 04:12 ynagatomo

ml-stable-diffusion ml-stable-diffusion copied to clipboard

Add a way to preview an images interim status

ml-stable-diffusion
ml-stable-diffusion copied to clipboard