Skip to content

Latest commit

 

History

History
186 lines (97 loc) · 11 KB

File metadata and controls

186 lines (97 loc) · 11 KB

Merging models

OK. Now it is being popular in LLM community. Meanwhile, it is faded in SD community. Crap.

Blue pill

Blue pill, but in academic paper

24022701.jpg

Github repos (mergers / algorithm implementations) excluding MBW / LBW

MBW / LBW related topics

Not "merge" but worth mentioning

Nice merges

  • majicMIX realistic: Better cosplay model (series). The "chained" mix is great, covering nice models from all "dimensions".

  • Chilloutmix: Cosplay model. However there is no cosplayer in dataset. Just merging "real photo" and "anime" together.

  • AbyssOrangeMix2: Realistic anime style. More focus on muscle and proportions, which is lack in most anime models. Merging "real photo" and "anime" also.

  • PastelMix: At least there is a clear theme, without owning the dataset.

  • Lawlas's yiffymix: There is way too many speices to train. AI will get confused. yiffy-e18 is an example.

  • AnythingV3: SOTA for hitting the perfect spot of the market desire.

  • Bayesian Merger, SD-Silicon: A model using auto RL to select merging hyperparameters.

My merge (shameless advertisement)

Merge by attention blocks (exclusive)

targets:
  - index: ["attentions"]
    targets:
      - targets:
          - index: ["attn1"]
  • "CC" found that there is no clear pattern per model, as some models contribute by "FF", meanwhile some others are "sattn / xattn". Twitter post.

The original and eatly stage on merging

The mergin method that was forgotten (nice try?)

Start ranting

Try to read thesis and don't try to dream about the blackbox.

img/photo_2023-01-01_02-45-25.jpg

img/photo_2023-01-01_03-02-49.jpg

img/0af5496675d3d85b8879bf46b3602b79e3a3c7c0160f98448b01c16b5242801f.jpg

  • So colorful...

img/23031201.png

img/23031202.png