|
13 | 13 | - [Hugging Face Diffusers](#hugging-face-diffusers)
|
14 | 14 |
|
15 | 15 | # The Repo Formerly Known As "Dreambooth"
|
16 |
| -## ...now more accurately described as "Unfrozen Model Textual Inversion for Stable Diffusion" |
17 | 16 | 
|
18 | 17 |
|
19 | 18 | ## <a name="notes-by-joe-penna"></a> Notes by Joe Penna
|
@@ -88,28 +87,10 @@ Now, if you wanna try to do this... please read the warnings below first:
|
88 | 87 | - Follow the instructions in the workbook and start training
|
89 | 88 |
|
90 | 89 | # <a name="text-vs-dreamb"></a> Textual Inversion vs. Dreambooth
|
91 |
| -The majority of the code in this repo was written by Rinon Gal et. al, the authors of the Textual Inversion research paper. |
| 90 | +The majority of the code in this repo was written by Rinon Gal et. al, the authors of the Textual Inversion research paper. Though a few ideas about regularization images and prior loss preservation (ideas from "Dreambooth") were added in, out of respect to both the MIT team and the Google researchers, I'm renaming this fork to: |
| 91 | +*"The Repo Formerly Known As "Dreambooth""*. |
92 | 92 |
|
93 |
| -A few bits about regularization images were added that we all thought were super important -- all the researchers included! |
94 |
| - |
95 |
| -...until my images were trained under the class "dog": |
96 |
| -<br><img src="https://media.discordapp.net/attachments/1024716296610385981/1024933960083587102/unknown.png" width="200"> |
97 |
| - |
98 |
| -...and under the nonsensical class "§¶•" instead of "man" or "woman" or "person": |
99 |
| -<br><img src="https://media.discordapp.net/attachments/1024716296610385981/1024934146415529984/unknown.png" width="200"> |
100 |
| - |
101 |
| -...and with completely blank regularization images: |
102 |
| -<br><img src="https://media.discordapp.net/attachments/1023293330601287711/1024933371102629898/IMG_7579.JPG" width="200"> |
103 |
| - |
104 |
| -And here's what `"photograph of an apple"` looked like before I messed with code a bit: |
105 |
| -<br><img src="https://media.discordapp.net/attachments/1018943815370952855/1018946569850069052/unknown.png" width="200"> |
106 |
| - |
107 |
| -We're not realizing the "regularization class" bits of this code have no effect, and that there is little to no prior preservation loss. |
108 |
| - |
109 |
| -So, out of respect to both the MIT team and the Google researchers, I'm renaming this fork to: |
110 |
| -*"Unfrozen Model Textual Inversion for Stable Diffusion"*. |
111 |
| - |
112 |
| -For an alternate implementation that attempts prior loss preservation, please see ["Alternate Option"](#hugging-face-diffusers) below. |
| 93 | +For an alternate implementation , please see ["Alternate Option"](#hugging-face-diffusers) below. |
113 | 94 |
|
114 | 95 |
|
115 | 96 | # <a name="using-the-generated-model"></a> Using the generated model
|
@@ -205,10 +186,8 @@ No problem. We can fix that with the prompt:
|
205 | 186 |
|
206 | 187 | # <a name="hugging-face-diffusers"></a> Hugging Face Diffusers - Alternate Option
|
207 | 188 |
|
208 |
| -Note: This is a diffuser implementation, and use is much more complicated than using a *.ckpy file. |
209 |
| - |
210 |
| -At the moment, there is no way to use the diffusers model with most repos (e.g. AUTOMATIC1111, HLKY, DeForum, etc) |
| 189 | +Dreambooth is now supported in HuggingFace Diffusers for training with Stable Diffusion. |
211 | 190 |
|
212 |
| -Dreambooth is now supported in Hugging Face diffusers for training with stable diffusion, try it out in the colab: |
| 191 | +Try it out here: |
213 | 192 |
|
214 | 193 | [](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_dreambooth_training.ipynb)
|
0 commit comments