revamp model card to work by default and provide quanto hints #1133

bghira · 2024-11-10T12:53:35Z

No description provided.

…re/sd3-model-card-details

….com/bghira/SimpleTuner into feature/sd3-model-card-details

bghira · 2024-11-10T14:07:50Z

new example model card code snippet:

import torch
from diffusers import DiffusionPipeline
from lycoris import create_lycoris_from_weights


def download_adapter(repo_id: str):
    import os
    from huggingface_hub import hf_hub_download
    adapter_filename = "pytorch_lora_weights.safetensors"
    cache_dir = os.environ.get('HF_PATH', os.path.expanduser('~/.cache/huggingface/hub/models'))
    cleaned_adapter_path = repo_id.replace("/", "_").replace("\\", "_").replace(":", "_")
    path_to_adapter = os.path.join(cache_dir, cleaned_adapter_path)
    path_to_adapter_file = os.path.join(path_to_adapter, adapter_filename)
    os.makedirs(path_to_adapter, exist_ok=True)
    hf_hub_download(
        repo_id=repo_id, filename=adapter_filename, local_dir=path_to_adapter
    )

    return path_to_adapter_file
    
model_id = 'stabilityai/stable-diffusion-3.5-medium'
adapter_repo_id = 'bghira/sd35m-photo-SLG-shift0.8-attn_ff'
adapter_filename = 'pytorch_lora_weights.safetensors'
adapter_file_path = download_adapter(repo_id=adapter_repo_id)
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
lora_scale = 1.0
wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_file_path, pipeline.transformer)
wrapper.merge_to()

prompt = "A test prompt"


## Optional: quantise the model to save on vram.
## Note: The model was quantised during training, and so it is recommended to do the same during inference time.
from optimum.quanto import quantize, freeze, qint8
quantize(pipeline.transformer, weights=qint8)
freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
image = pipeline(
    prompt=prompt,
    num_inference_steps=50,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=512,
    height=512,
    guidance_scale=7.5,
).images[0]
image.save("output.png", format="PNG")

bghira added 6 commits November 8, 2024 09:53

validation: move prompt embeds to device/inference dtype

7141ec3

Merge branch 'main' of ssh://github.com/bghira/SimpleTuner into featu…

03afab1

…re/sd3-model-card-details

Merge branch 'feature/flow-matching-uniform-sampling' of ssh://github…

f0892bb

….com/bghira/SimpleTuner into feature/sd3-model-card-details

sd3: update qs

fd33cf1

model card should not have extra comma when guidance rescale is disabled

814f972

add more lycoris and quanto details to the model card

49eb37a

bghira changed the title ~~fix typo in model card example code when no rescale is enabled~~ revamp model card to work by default and provide quanto hints Nov 10, 2024

bghira merged commit c2701f6 into main Nov 10, 2024
1 check passed

bghira deleted the feature/sd3-model-card-details branch November 10, 2024 14:08

bghira restored the feature/sd3-model-card-details branch November 10, 2024 15:13

bghira deleted the feature/sd3-model-card-details branch November 11, 2024 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

revamp model card to work by default and provide quanto hints #1133

revamp model card to work by default and provide quanto hints #1133

bghira commented Nov 10, 2024

bghira commented Nov 10, 2024 •

edited

Loading

revamp model card to work by default and provide quanto hints #1133

revamp model card to work by default and provide quanto hints #1133

Conversation

bghira commented Nov 10, 2024

bghira commented Nov 10, 2024 • edited Loading

bghira commented Nov 10, 2024 •

edited

Loading