Retain alpha in `pil_resize` for `--alpha_mask` #1619

emcmanus · 2024-09-19T21:31:05Z

Currently pil_resize() drops the alpha channel when --alpha_mask is supplied, but only if the image width does not exceed the bucket size.

This codepath is entered on the last line, here:

def trim_and_resize_if_required(
    random_crop: bool, image: np.ndarray, reso, resized_size: Tuple[int, int]
) -> Tuple[np.ndarray, Tuple[int, int], Tuple[int, int, int, int]]:
    image_height, image_width = image.shape[0:2]
    original_size = (image_width, image_height)  # size before resize

    if image_width != resized_size[0] or image_height != resized_size[1]:
        # リサイズする
        if image_width > resized_size[0] and image_height > resized_size[1]:
            image = cv2.resize(image, resized_size, interpolation=cv2.INTER_AREA)  # INTER_AREAでやりたいのでcv2でリサイズ
        else:
            image = pil_resize(image, resized_size)

Currently the alpha channel is dropped by `pil_resize()` when `--alpha_mask` is supplied and the image width does not exceed the bucket. This codepath is entered on the last line, here: ``` def trim_and_resize_if_required( random_crop: bool, image: np.ndarray, reso, resized_size: Tuple[int, int] ) -> Tuple[np.ndarray, Tuple[int, int], Tuple[int, int, int, int]]: image_height, image_width = image.shape[0:2] original_size = (image_width, image_height) # size before resize if image_width != resized_size[0] or image_height != resized_size[1]: # リサイズする if image_width > resized_size[0] and image_height > resized_size[1]: image = cv2.resize(image, resized_size, interpolation=cv2.INTER_AREA) # INTER_AREAでやりたいのでcv2でリサイズ else: image = pil_resize(image, resized_size) ```

Cleanup

kohya-ss · 2024-09-20T13:24:43Z

Thank you for this!

Maru-mee · 2024-09-22T06:46:55Z

私の認識が間違っていなければ、
この変更は、sd3のみで、dev版には反映されていないようです。
しかし、dev版でも同じ事象（※１）が発生する問題のようなので、もし可能ならマージをお願いしたいです。
PR#1632と関係する要素であり、先に解決しておきたい課題です。

※１　下記のような事象です。
pilによるアルファチャンネル喪失、３チャンネル化
→ alpha_mask作成時に
if image.shape[2] == 4:にならず、
else:
alpha_mask = torch.ones_like(image[:, :, 0], dtype=torch.float32) # [H,W]
に分岐し強制停止。

kohya-ss · 2024-09-23T12:17:24Z

devブランチにも同様の変更を行いました。

Maru-mee · 2024-09-24T13:24:45Z

ありがとうございます！

emcmanus added 2 commits September 19, 2024 14:30

Update utils.py

de4bb65

Cleanup

kohya-ss merged commit 95ff9db into kohya-ss:sd3 Sep 20, 2024
1 check passed

Maru-mee mentioned this pull request Sep 22, 2024

fix import and make npz for alpha_mask #1632

Closed

kohya-ss added a commit that referenced this pull request Sep 23, 2024

retain alpha in pil_resize backport #1619

29177d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retain alpha in `pil_resize` for `--alpha_mask` #1619

Retain alpha in `pil_resize` for `--alpha_mask` #1619

emcmanus commented Sep 19, 2024

kohya-ss commented Sep 20, 2024

Maru-mee commented Sep 22, 2024 •

edited

Loading

kohya-ss commented Sep 23, 2024

Maru-mee commented Sep 24, 2024

Retain alpha in pil_resize for --alpha_mask #1619

Retain alpha in pil_resize for --alpha_mask #1619

Conversation

emcmanus commented Sep 19, 2024

kohya-ss commented Sep 20, 2024

Maru-mee commented Sep 22, 2024 • edited Loading

kohya-ss commented Sep 23, 2024

Maru-mee commented Sep 24, 2024

Retain alpha in `pil_resize` for `--alpha_mask` #1619

Retain alpha in `pil_resize` for `--alpha_mask` #1619

Maru-mee commented Sep 22, 2024 •

edited

Loading