Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,24 @@ alpha: pretrain on 2M dataset, smaller batch size. Limited ability
|
|
22 |
beta: pretrain on 5.3M dataset, larger batch size. More stable, better ability with only a few information provided.
|
23 |
|
24 |
## Examples
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Model arch
|
28 |
This version of DTG is trained from scratch with 400M param LLaMA arch.(In my personal preference I will call it NanoLLaMA)
|
|
|
22 |
beta: pretrain on 5.3M dataset, larger batch size. More stable, better ability with only a few information provided.
|
23 |
|
24 |
## Examples
|
25 |
+
Base prompt:
|
26 |
+
```
|
27 |
+
1girl,
|
28 |
+
|
29 |
+
daring tact \(umamusume\), umamusume,
|
30 |
+
|
31 |
+
kxl-delta-style1,
|
32 |
+
|
33 |
+
horse girl, horse tail, horse ears, cafe, table, chair,
|
34 |
+
|
35 |
+
masterpiece, newest, absurdres, safe
|
36 |
+
```
|
37 |
+
||Without DTG|DTG-Alpha|DTG-Beta|
|
38 |
+
|-|-|-|-|
|
39 |
+
|Prompts|Base prompt|Base propmt + "plant, necktie, tail, indoors, skirt, looking at viewer, cup, lounge chair, green theme, book, alternate costume, potted plant, hair ornament, blue jacket, blush, medium hair, black necktie, green eyes, jacket, animal ears, black hair, round eyewear, bookshelf, adjusting eyewear, ahoge, smile, solo, window, brown hair, crossed legs, glasses, closed mouth, book stack,"| base propmt + "jacket, sitting on table, food, tail, collar, horse racing, black hair, boots, school bag, bag, full body, blue eyes, hair ornament, animal ears, ahoge, sitting, thighhighs, blurry background, looking at viewer, school uniform, long hair, blurry, cup, window, crossed legs, alternate costume, medium breasts, breasts, calendar \(object\), casual, door, solo, disposable cup,"|
|
40 |
+
|Result image||||
|
41 |
+
|Performance| |It can generate image with more elements and details, but the coherence with character is not good|Way better than alpha, also provide lot more details and better composition|
|
42 |
+
|
43 |
|
44 |
## Model arch
|
45 |
This version of DTG is trained from scratch with 400M param LLaMA arch.(In my personal preference I will call it NanoLLaMA)
|