Skip to content

Commit 30f7804

Browse files
authored
Merge pull request #1 from JoePenna/main
Bring README.md up to speed with master JoePenna
2 parents aa777fd + a3aa68b commit 30f7804

File tree

1 file changed

+5
-26
lines changed

1 file changed

+5
-26
lines changed

Diff for: README.md

+5-26
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,6 @@
1313
- [Hugging Face Diffusers](#hugging-face-diffusers)
1414

1515
# The Repo Formerly Known As "Dreambooth"
16-
## ...now more accurately described as "Unfrozen Model Textual Inversion for Stable Diffusion"
1716
![image](https://user-images.githubusercontent.com/100188076/192390551-cb89364f-af57-4aed-8f3d-f9eb9b61cf95.png)
1817

1918
## <a name="notes-by-joe-penna"></a> Notes by Joe Penna
@@ -88,28 +87,10 @@ Now, if you wanna try to do this... please read the warnings below first:
8887
- Follow the instructions in the workbook and start training
8988

9089
# <a name="text-vs-dreamb"></a> Textual Inversion vs. Dreambooth
91-
The majority of the code in this repo was written by Rinon Gal et. al, the authors of the Textual Inversion research paper.
90+
The majority of the code in this repo was written by Rinon Gal et. al, the authors of the Textual Inversion research paper. Though a few ideas about regularization images and prior loss preservation (ideas from "Dreambooth") were added in, out of respect to both the MIT team and the Google researchers, I'm renaming this fork to:
91+
*"The Repo Formerly Known As "Dreambooth""*.
9292

93-
A few bits about regularization images were added that we all thought were super important -- all the researchers included!
94-
95-
...until my images were trained under the class "dog":
96-
<br><img src="https://media.discordapp.net/attachments/1024716296610385981/1024933960083587102/unknown.png" width="200">
97-
98-
...and under the nonsensical class "§¶•" instead of "man" or "woman" or "person":
99-
<br><img src="https://media.discordapp.net/attachments/1024716296610385981/1024934146415529984/unknown.png" width="200">
100-
101-
...and with completely blank regularization images:
102-
<br><img src="https://media.discordapp.net/attachments/1023293330601287711/1024933371102629898/IMG_7579.JPG" width="200">
103-
104-
And here's what `"photograph of an apple"` looked like before I messed with code a bit:
105-
<br><img src="https://media.discordapp.net/attachments/1018943815370952855/1018946569850069052/unknown.png" width="200">
106-
107-
We're not realizing the "regularization class" bits of this code have no effect, and that there is little to no prior preservation loss.
108-
109-
So, out of respect to both the MIT team and the Google researchers, I'm renaming this fork to:
110-
*"Unfrozen Model Textual Inversion for Stable Diffusion"*.
111-
112-
For an alternate implementation that attempts prior loss preservation, please see ["Alternate Option"](#hugging-face-diffusers) below.
93+
For an alternate implementation , please see ["Alternate Option"](#hugging-face-diffusers) below.
11394

11495

11596
# <a name="using-the-generated-model"></a> Using the generated model
@@ -205,10 +186,8 @@ No problem. We can fix that with the prompt:
205186

206187
# <a name="hugging-face-diffusers"></a> Hugging Face Diffusers - Alternate Option
207188

208-
Note: This is a diffuser implementation, and use is much more complicated than using a *.ckpy file.
209-
210-
At the moment, there is no way to use the diffusers model with most repos (e.g. AUTOMATIC1111, HLKY, DeForum, etc)
189+
Dreambooth is now supported in HuggingFace Diffusers for training with Stable Diffusion.
211190

212-
Dreambooth is now supported in Hugging Face diffusers for training with stable diffusion, try it out in the colab:
191+
Try it out here:
213192

214193
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_dreambooth_training.ipynb)

0 commit comments

Comments
 (0)