### Describe the bug Awsome works to train ControlNet with Diffuser from the [tutorial](https://github.com/huggingface/diffusers/tree/main/examples/controlnet) I run the code and compare it with the original training code [here](https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md). It turns out to be the convergence is slower than using the diffuser on Fill5k than the original training code. ### Reproduction training on Fill5k with batch size 5 ### Logs _No response_ ### System Info - `diffusers` version: 0.15.0.dev0 - Platform: Linux-5.4.0-139-generic-x86_64-with-glibc2.17 - Python version: 3.8.16 - PyTorch version (GPU?): 1.13.1+cu117 (True) - Huggingface_hub version: 0.13.3 - Transformers version: 4.27.3 - Accelerate version: 0.18.0.dev0 - xFormers version: 0.0.16