diff --git a/README.md b/README.md index 9f44d37..740368b 100644 --- a/README.md +++ b/README.md @@ -18,7 +18,7 @@ Specifically, I have a broad interest in systems (e.g., OSDI, SOSP, NSDI, ATC, E ## Changelogs -* 07/2024: Organize the papers of [ATC 2024](reading-notes/conference/atc-2024.md), [OSDI 2024](reading-notes/conference/osdi-2024.md), [NSDI 2024](reading-notes/conference/nsdi-2024.md), [CVPR 2024](reading-notes/conference/cvpr-2024.md), [ISCA 2024](reading-notes/conference/isca-2024.md); create a new paper list of [Systems for diffusion models](paper-list/systems-for-ml/diffusion-models.md); update the paper list of [Systems for LLMs](paper-list/systems-for-ml/llm.md), [Systems for DLRMs](paper-list/systems-for-ml/dlrm.md), [Resource Scheduler](paper-list/systems-for-ml/resource-scheduler.md). +* 07/2024: Organize the papers of [SIGCOMM 2024](reading-notes/conference/sigcomm-2024.md), [ICML 2024](reading-notes/conference/icml-2024.md), [ATC 2024](reading-notes/conference/atc-2024.md), [OSDI 2024](reading-notes/conference/osdi-2024.md), [NSDI 2024](reading-notes/conference/nsdi-2024.md), [CVPR 2024](reading-notes/conference/cvpr-2024.md), [ISCA 2024](reading-notes/conference/isca-2024.md); create a new paper list of [Systems for diffusion models](paper-list/systems-for-ml/diffusion-models.md); update the paper list of [Systems for LLMs](paper-list/systems-for-ml/llm.md), [Systems for DLRMs](paper-list/systems-for-ml/dlrm.md), [Resource Scheduler](paper-list/systems-for-ml/resource-scheduler.md). ## Epilogue diff --git a/SUMMARY.md b/SUMMARY.md index c7f6ca3..cc6bb29 100644 --- a/SUMMARY.md +++ b/SUMMARY.md @@ -5,11 +5,11 @@ ## Paper List * [Systems for ML](paper-list/systems-for-ml/README.md) - * [Large Language Model (LLM)](paper-list/systems-for-ml/llm.md) * [Data Processing](paper-list/systems-for-ml/data-processing.md) * [Deep Learning Training](paper-list/systems-for-ml/deep-learning-training.md) * [Resource Scheduler](paper-list/systems-for-ml/resource-scheduler.md) * [Model Serving](paper-list/systems-for-ml/model-serving.md) + * [Large Language Model (LLM)](paper-list/systems-for-ml/llm.md) * [Diffusion Models](paper-list/systems-for-ml/diffusion-models.md) * [Deep Learning Recommendation Model (DLRM)](paper-list/systems-for-ml/dlrm.md) * [Mixture of Experts (MoE)](paper-list/systems-for-ml/moe.md) diff --git a/paper-list/systems-for-ml/diffusion-models.md b/paper-list/systems-for-ml/diffusion-models.md index 5e42131..f1fe27f 100644 --- a/paper-list/systems-for-ml/diffusion-models.md +++ b/paper-list/systems-for-ml/diffusion-models.md @@ -2,6 +2,8 @@ ## Serving Diffusion Models +* PipeFusion: Displaced Patch Pipeline Parallelism for Inference of Diffusion Transformer Models (arXiv:2405.14430) \[[arXiv](https://arxiv.org/abs/2405.14430)] \[[Code](https://github.com/xdit-project/xDiT)] + * Tencent & HKU * Cache Me if You Can: Accelerating Diffusion Models through Block Caching ([CVPR 2024](../../reading-notes/conference/cvpr-2024.md)) \[[Paper](https://openaccess.thecvf.com/content/CVPR2024/html/Wimbauer\_Cache\_Me\_if\_You\_Can\_Accelerating\_Diffusion\_Models\_through\_Block\_CVPR\_2024\_paper.html)] \[[Homepage](https://fwmb.github.io/blockcaching/)] * Meta & TUM & MCML & Oxford * CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model ([CVPR 2024](../../reading-notes/conference/cvpr-2024.md)) \[[Paper](https://openaccess.thecvf.com/content/CVPR2024/html/Zeng\_CAT-DM\_Controllable\_Accelerated\_Virtual\_Try-on\_with\_Diffusion\_Model\_CVPR\_2024\_paper.html)] \[[Code](https://github.com/zengjianhao/CAT-DM)]