diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index a800b2c0..c6c7a66d 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.6.7","generation_timestamp":"2024-11-04T12:53:23","documenter_version":"1.7.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.6.7","generation_timestamp":"2024-12-15T17:22:58","documenter_version":"1.8.0"}}
\ No newline at end of file
diff --git a/dev/api/densenet/index.html b/dev/api/densenet/index.html
index 3e106626..2ec834b2 100644
--- a/dev/api/densenet/index.html
+++ b/dev/api/densenet/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>DenseNet · Metalhead.jl</title><meta name="title" content="DenseNet · Metalhead.jl"/><meta property="og:title" content="DenseNet · Metalhead.jl"/><meta property="twitter:title" content="DenseNet · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/densenet/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/densenet/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/densenet/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li class="is-active"><a class="tocitem" href>DenseNet</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model"><span>The higher level model</span></a></li><li><a class="tocitem" href="#The-core-function"><span>The core function</span></a></li></ul></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>DenseNet</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>DenseNet</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/densenet.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="DenseNet"><a class="docs-heading-anchor" href="#DenseNet">DenseNet</a><a id="DenseNet-1"></a><a class="docs-heading-anchor-permalink" href="#DenseNet" title="Permalink"></a></h1><p>This is the API reference for the DenseNet model present in Metalhead.jl.</p><h2 id="The-higher-level-model"><a class="docs-heading-anchor" href="#The-higher-level-model">The higher level model</a><a id="The-higher-level-model-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.DenseNet" href="#Metalhead.DenseNet"><code>Metalhead.DenseNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DenseNet(config::Int; pretrain = false, growth_rate = 32,
-         reduction = 0.5, inchannels = 3, nclasses = 1000)</code></pre><p>Create a DenseNet model with specified configuration. Currently supported values are (121, 161, 169, 201) (<a href="https://arxiv.org/abs/1608.06993">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the configuration of the model</li><li><code>pretrain</code>: whether to load the model with pre-trained weights for ImageNet.</li><li><code>growth_rate</code>: the output feature map growth probability of dense blocks (i.e. <code>k</code> in the ref)</li><li><code>reduction</code>: the factor by which the number of feature maps is scaled across each transition</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>DenseNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.densenet"><code>Metalhead.densenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/densenet.jl#L119-L139">source</a></section></article><h2 id="The-core-function"><a class="docs-heading-anchor" href="#The-core-function">The core function</a><a id="The-core-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-core-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.densenet" href="#Metalhead.densenet"><code>Metalhead.densenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">densenet(nblocks::AbstractVector{Int}; growth_rate = 32,
+         reduction = 0.5, inchannels = 3, nclasses = 1000)</code></pre><p>Create a DenseNet model with specified configuration. Currently supported values are (121, 161, 169, 201) (<a href="https://arxiv.org/abs/1608.06993">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the configuration of the model</li><li><code>pretrain</code>: whether to load the model with pre-trained weights for ImageNet.</li><li><code>growth_rate</code>: the output feature map growth probability of dense blocks (i.e. <code>k</code> in the ref)</li><li><code>reduction</code>: the factor by which the number of feature maps is scaled across each transition</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>DenseNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.densenet"><code>Metalhead.densenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/densenet.jl#L119-L139">source</a></section></article><h2 id="The-core-function"><a class="docs-heading-anchor" href="#The-core-function">The core function</a><a id="The-core-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-core-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.densenet" href="#Metalhead.densenet"><code>Metalhead.densenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">densenet(nblocks::AbstractVector{Int}; growth_rate = 32,
          reduction = 0.5, dropout_prob = nothing, inchannels = 3,
-         nclasses = 1000)</code></pre><p>Create a DenseNet model (<a href="https://arxiv.org/abs/1608.06993">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>nblocks</code>: number of dense blocks between transitions</li><li><code>growth_rate</code>: the output feature map growth probability of dense blocks (i.e. <code>k</code> in the ref)</li><li><code>reduction</code>: the factor by which the number of feature maps is scaled across each transition</li><li><code>dropout_prob</code>: the dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/densenet.jl#L90-L106">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../resnet/">« ResNet-like models</a><a class="docs-footer-nextpage" href="../efficientnet/">EfficientNet family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+         nclasses = 1000)</code></pre><p>Create a DenseNet model (<a href="https://arxiv.org/abs/1608.06993">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>nblocks</code>: number of dense blocks between transitions</li><li><code>growth_rate</code>: the output feature map growth probability of dense blocks (i.e. <code>k</code> in the ref)</li><li><code>reduction</code>: the factor by which the number of feature maps is scaled across each transition</li><li><code>dropout_prob</code>: the dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/densenet.jl#L90-L106">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../resnet/">« ResNet-like models</a><a class="docs-footer-nextpage" href="../efficientnet/">EfficientNet family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/efficientnet/index.html b/dev/api/efficientnet/index.html
index 0f2872c6..f0da212d 100644
--- a/dev/api/efficientnet/index.html
+++ b/dev/api/efficientnet/index.html
@@ -1,6 +1,6 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>EfficientNet family of models · Metalhead.jl</title><meta name="title" content="EfficientNet family of models · Metalhead.jl"/><meta property="og:title" content="EfficientNet family of models · Metalhead.jl"/><meta property="twitter:title" content="EfficientNet family of models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/efficientnet/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/efficientnet/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/efficientnet/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li class="is-active"><a class="tocitem" href>EfficientNet family of models</a><ul class="internal"><li class="toplevel"><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li class="toplevel"><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>EfficientNet family of models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>EfficientNet family of models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/efficientnet.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="EfficientNet-family-of-models"><a class="docs-heading-anchor" href="#EfficientNet-family-of-models">EfficientNet family of models</a><a id="EfficientNet-family-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#EfficientNet-family-of-models" title="Permalink"></a></h1><p>This is the API reference for the EfficientNet family of models supported by Metalhead.jl.</p><h1 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.EfficientNet" href="#Metalhead.EfficientNet"><code>Metalhead.EfficientNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">EfficientNet(config::Symbol; pretrain::Bool = false, inchannels::Integer = 3,
-             nclasses::Integer = 1000)</code></pre><p>Create an EfficientNet model (<a href="https://arxiv.org/abs/1905.11946v5">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the model. Can be one of <code>[:b0, :b1, :b2, :b3, :b4, :b5, :b6, :b7, :b8]</code>.</li><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>EfficientNet does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.efficientnet"><code>Metalhead.efficientnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/efficientnets/efficientnet.jl#L62-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.EfficientNetv2" href="#Metalhead.EfficientNetv2"><code>Metalhead.EfficientNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">EfficientNetv2(config::Symbol; pretrain::Bool = false, inchannels::Integer = 3,
-               nclasses::Integer = 1000)</code></pre><p>Create an EfficientNetv2 model (<a href="https://arxiv.org/abs/2104.00298">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the network (one of <code>[:small, :medium, :large, :xlarge]</code>)</li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>EfficientNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href><code>efficientnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/efficientnets/efficientnetv2.jl#L65-L83">source</a></section></article><h1 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.efficientnet" href="#Metalhead.efficientnet"><code>Metalhead.efficientnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">efficientnet(config::Symbol; norm_layer = BatchNorm, stochastic_depth_prob = 0.2,
-             dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an EfficientNet model. (<a href="https://arxiv.org/abs/1905.11946v5">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the model. Can be one of <code>[:b0, :b1, :b2, :b3, :b4, :b5, :b6, :b7, :b8]</code>.</li><li><code>norm_layer</code>: normalization layer to use.</li><li><code>stochastic_depth_prob</code>: probability of stochastic depth. Set to <code>nothing</code> to disable stochastic depth.</li><li><code>dropout_prob</code>: probability of dropout in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/efficientnets/efficientnet.jl#L34-L50">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.efficientnetv2" href="#Metalhead.efficientnetv2"><code>Metalhead.efficientnetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">efficientnetv2(config::Symbol; norm_layer = BatchNorm, stochastic_depth_prob = 0.2,
-               dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an EfficientNetv2 model. (<a href="https://arxiv.org/abs/2104.00298">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the network (one of <code>[:small, :medium, :large, :xlarge]</code>)</li><li><code>norm_layer</code>: normalization layer to use.</li><li><code>stochastic_depth_prob</code>: probability of stochastic depth. Set to <code>nothing</code> to disable stochastic depth.</li><li><code>dropout_prob</code>: probability of dropout in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/efficientnets/efficientnetv2.jl#L38-L54">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../densenet/">« DenseNet</a><a class="docs-footer-nextpage" href="../mobilenet/">MobileNet family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+             nclasses::Integer = 1000)</code></pre><p>Create an EfficientNet model (<a href="https://arxiv.org/abs/1905.11946v5">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the model. Can be one of <code>[:b0, :b1, :b2, :b3, :b4, :b5, :b6, :b7, :b8]</code>.</li><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>EfficientNet does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.efficientnet"><code>Metalhead.efficientnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/efficientnets/efficientnet.jl#L62-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.EfficientNetv2" href="#Metalhead.EfficientNetv2"><code>Metalhead.EfficientNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">EfficientNetv2(config::Symbol; pretrain::Bool = false, inchannels::Integer = 3,
+               nclasses::Integer = 1000)</code></pre><p>Create an EfficientNetv2 model (<a href="https://arxiv.org/abs/2104.00298">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the network (one of <code>[:small, :medium, :large, :xlarge]</code>)</li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>EfficientNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href><code>efficientnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/efficientnets/efficientnetv2.jl#L65-L83">source</a></section></article><h1 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.efficientnet" href="#Metalhead.efficientnet"><code>Metalhead.efficientnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">efficientnet(config::Symbol; norm_layer = BatchNorm, stochastic_depth_prob = 0.2,
+             dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an EfficientNet model. (<a href="https://arxiv.org/abs/1905.11946v5">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the model. Can be one of <code>[:b0, :b1, :b2, :b3, :b4, :b5, :b6, :b7, :b8]</code>.</li><li><code>norm_layer</code>: normalization layer to use.</li><li><code>stochastic_depth_prob</code>: probability of stochastic depth. Set to <code>nothing</code> to disable stochastic depth.</li><li><code>dropout_prob</code>: probability of dropout in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/efficientnets/efficientnet.jl#L34-L50">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.efficientnetv2" href="#Metalhead.efficientnetv2"><code>Metalhead.efficientnetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">efficientnetv2(config::Symbol; norm_layer = BatchNorm, stochastic_depth_prob = 0.2,
+               dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an EfficientNetv2 model. (<a href="https://arxiv.org/abs/2104.00298">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: size of the network (one of <code>[:small, :medium, :large, :xlarge]</code>)</li><li><code>norm_layer</code>: normalization layer to use.</li><li><code>stochastic_depth_prob</code>: probability of stochastic depth. Set to <code>nothing</code> to disable stochastic depth.</li><li><code>dropout_prob</code>: probability of dropout in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/efficientnets/efficientnetv2.jl#L38-L54">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../densenet/">« DenseNet</a><a class="docs-footer-nextpage" href="../mobilenet/">MobileNet family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/hybrid/index.html b/dev/api/hybrid/index.html
index 164d25fd..3ad7aa0c 100644
--- a/dev/api/hybrid/index.html
+++ b/dev/api/hybrid/index.html
@@ -1,7 +1,7 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Hybrid CNN architectures · Metalhead.jl</title><meta name="title" content="Hybrid CNN architectures · Metalhead.jl"/><meta property="og:title" content="Hybrid CNN architectures · Metalhead.jl"/><meta property="twitter:title" content="Hybrid CNN architectures · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/hybrid/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/hybrid/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/hybrid/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li class="is-active"><a class="tocitem" href>Hybrid CNN architectures</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>Hybrid CNN architectures</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Hybrid CNN architectures</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/hybrid.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Hybrid-CNN-architectures"><a class="docs-heading-anchor" href="#Hybrid-CNN-architectures">Hybrid CNN architectures</a><a id="Hybrid-CNN-architectures-1"></a><a class="docs-heading-anchor-permalink" href="#Hybrid-CNN-architectures" title="Permalink"></a></h1><p>These models are hybrid CNN architectures that borrow certain ideas from vision transformer models.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ConvMixer" href="#Metalhead.ConvMixer"><code>Metalhead.ConvMixer</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ConvMixer(config::Symbol; pretrain::Bool = false, inchannels::Integer = 3,
-          nclasses::Integer = 1000)</code></pre><p>Creates a ConvMixer model. (<a href="https://arxiv.org/abs/2201.09792">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model, either <code>:base</code>, <code>:small</code> or <code>:large</code></li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of classes in the output</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"></div></div><p><code>ConvMixer</code> does not currently support pretrained weights.</p><p>See also <a href="#Metalhead.convmixer"><code>Metalhead.convmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/hybrid/convmixer.jl#L47-L66">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ConvNeXt" href="#Metalhead.ConvNeXt"><code>Metalhead.ConvNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ConvNeXt(config::Symbol; pretrain::Bool = true, inchannels::Integer = 3,
-         nclasses::Integer = 1000)</code></pre><p>Creates a ConvNeXt model. (<a href="https://arxiv.org/abs/2201.03545">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: The size of the model, one of <code>tiny</code>, <code>small</code>, <code>base</code>, <code>large</code> or <code>xlarge</code>.</li><li><code>pretrain</code>: set to <code>true</code> to load pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>ConvNeXt</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.convnext"><code>Metalhead.convnext</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/hybrid/convnext.jl#L101-L120">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.convmixer" href="#Metalhead.convmixer"><code>Metalhead.convmixer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">convmixer(planes::Integer, depth::Integer; kernel_size::Dims{2} = (9, 9),
+          nclasses::Integer = 1000)</code></pre><p>Creates a ConvMixer model. (<a href="https://arxiv.org/abs/2201.09792">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model, either <code>:base</code>, <code>:small</code> or <code>:large</code></li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of classes in the output</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"></div></div><p><code>ConvMixer</code> does not currently support pretrained weights.</p><p>See also <a href="#Metalhead.convmixer"><code>Metalhead.convmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/hybrid/convmixer.jl#L47-L66">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ConvNeXt" href="#Metalhead.ConvNeXt"><code>Metalhead.ConvNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ConvNeXt(config::Symbol; pretrain::Bool = true, inchannels::Integer = 3,
+         nclasses::Integer = 1000)</code></pre><p>Creates a ConvNeXt model. (<a href="https://arxiv.org/abs/2201.03545">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: The size of the model, one of <code>tiny</code>, <code>small</code>, <code>base</code>, <code>large</code> or <code>xlarge</code>.</li><li><code>pretrain</code>: set to <code>true</code> to load pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>ConvNeXt</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.convnext"><code>Metalhead.convnext</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/hybrid/convnext.jl#L101-L120">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.convmixer" href="#Metalhead.convmixer"><code>Metalhead.convmixer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">convmixer(planes::Integer, depth::Integer; kernel_size::Dims{2} = (9, 9),
           patch_size::Dims{2} = (7, 7), activation = gelu,
-          inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ConvMixer model. (<a href="https://arxiv.org/abs/2201.09792">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: number of planes in the output of each block</li><li><code>depth</code>: number of layers</li><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>patch_size</code>: size of the patches</li><li><code>activation</code>: activation function used after the convolutional layers</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of classes in the output</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/hybrid/convmixer.jl#L1-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.convnext" href="#Metalhead.convnext"><code>Metalhead.convnext</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">convnext(config::Symbol; stochastic_depth_prob = 0.0, layerscale_init = 1.0f-6,
-         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ConvNeXt model. (<a href="https://arxiv.org/abs/2201.03545">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: The size of the model, one of <code>tiny</code>, <code>small</code>, <code>base</code>, <code>large</code> or <code>xlarge</code>.</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability.</li><li><code>layerscale_init</code>: Initial value for <a href="../layers_intro/#Metalhead.Layers.LayerScale"><code>LayerScale</code></a> (<a href="https://arxiv.org/abs/2103.17239">reference</a>)</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/hybrid/convnext.jl#L72-L87">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../inception/">« Inception family of models</a><a class="docs-footer-nextpage" href="../others/">Other models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+          inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ConvMixer model. (<a href="https://arxiv.org/abs/2201.09792">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: number of planes in the output of each block</li><li><code>depth</code>: number of layers</li><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>patch_size</code>: size of the patches</li><li><code>activation</code>: activation function used after the convolutional layers</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of classes in the output</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/hybrid/convmixer.jl#L1-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.convnext" href="#Metalhead.convnext"><code>Metalhead.convnext</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">convnext(config::Symbol; stochastic_depth_prob = 0.0, layerscale_init = 1.0f-6,
+         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ConvNeXt model. (<a href="https://arxiv.org/abs/2201.03545">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: The size of the model, one of <code>tiny</code>, <code>small</code>, <code>base</code>, <code>large</code> or <code>xlarge</code>.</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability.</li><li><code>layerscale_init</code>: Initial value for <a href="../layers_intro/#Metalhead.Layers.LayerScale"><code>LayerScale</code></a> (<a href="https://arxiv.org/abs/2103.17239">reference</a>)</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/hybrid/convnext.jl#L72-L87">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../inception/">« Inception family of models</a><a class="docs-footer-nextpage" href="../others/">Other models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/inception/index.html b/dev/api/inception/index.html
index f1df9543..0ac66adf 100644
--- a/dev/api/inception/index.html
+++ b/dev/api/inception/index.html
@@ -1,4 +1,4 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Inception family of models · Metalhead.jl</title><meta name="title" content="Inception family of models · Metalhead.jl"/><meta property="og:title" content="Inception family of models · Metalhead.jl"/><meta property="twitter:title" content="Inception family of models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li class="is-active"><a class="tocitem" href>Inception family of models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>Inception family of models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Inception family of models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/inception.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Inception-family-of-models"><a class="docs-heading-anchor" href="#Inception-family-of-models">Inception family of models</a><a id="Inception-family-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#Inception-family-of-models" title="Permalink"></a></h1><p>This is the API reference for the Inception family of models supported by Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.GoogLeNet" href="#Metalhead.GoogLeNet"><code>Metalhead.GoogLeNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">GoogLeNet(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v1 model (commonly referred to as <code>GoogLeNet</code>) (<a href="https://arxiv.org/abs/1409.4842v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>nclasses</code>: the number of output classes</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>bias</code>: set to <code>true</code> to use bias in the convolution layers</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>GoogLeNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.googlenet"><code>Metalhead.googlenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/googlenet.jl#L75-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Inceptionv3" href="#Metalhead.Inceptionv3"><code>Metalhead.Inceptionv3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Inceptionv3(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v3 model (<a href="https://arxiv.org/abs/1512.00567v3">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Inceptionv3</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionv3"><code>Metalhead.inceptionv3</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionv3.jl#L161-L177">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Inceptionv4" href="#Metalhead.Inceptionv4"><code>Metalhead.Inceptionv4</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Inceptionv4(; pretrain::Bool = false, inchannels::Integer = 3,
-            nclasses::Integer = 1000)</code></pre><p>Creates an Inceptionv4 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Inceptionv4</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionv4"><code>Metalhead.inceptionv4</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionv4.jl#L113-L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.InceptionResNetv2" href="#Metalhead.InceptionResNetv2"><code>Metalhead.InceptionResNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">InceptionResNetv2(; pretrain::Bool = false, inchannels::Integer = 3, 
-                  nclasses::Integer = 1000)</code></pre><p>Creates an InceptionResNetv2 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>InceptionResNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionresnetv2"><code>Metalhead.inceptionresnetv2</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionresnetv2.jl#L98-L116">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Xception" href="#Metalhead.Xception"><code>Metalhead.Xception</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Xception(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates an Xception model. (<a href="https://arxiv.org/abs/1610.02357">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Xception</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.xception"><code>Metalhead.xception</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/xception.jl#L70-L87">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.googlenet" href="#Metalhead.googlenet"><code>Metalhead.googlenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">googlenet(; dropout_prob = 0.4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v1 model (commonly referred to as GoogLeNet) (<a href="https://arxiv.org/abs/1409.4842v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: the dropout probability in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>bias</code>: set to <code>true</code> to use bias in the convolution layers</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/googlenet.jl#L38-L51">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionv3" href="#Metalhead.inceptionv3"><code>Metalhead.inceptionv3</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionv3(; dropout_prob = 0.2, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v3 model (<a href="https://arxiv.org/abs/1512.00567v3">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: the dropout probability in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input feature maps</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionv3.jl#L127-L137">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionv4" href="#Metalhead.inceptionv4"><code>Metalhead.inceptionv4</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionv4(; dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inceptionv4 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionv4.jl#L87-L98">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionresnetv2" href="#Metalhead.inceptionresnetv2"><code>Metalhead.inceptionresnetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionresnetv2(; inchannels::Integer = 3, dropout_prob = nothing, nclasses::Integer = 1000)</code></pre><p>Creates an InceptionResNetv2 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/inceptionresnetv2.jl#L66-L77">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.xception" href="#Metalhead.xception"><code>Metalhead.xception</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">xception(; dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates an Xception model. (<a href="https://arxiv.org/abs/1610.02357">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/inceptions/xception.jl#L45-L56">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../mobilenet/">« MobileNet family of models</a><a class="docs-footer-nextpage" href="../hybrid/">Hybrid CNN architectures »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Inception family of models · Metalhead.jl</title><meta name="title" content="Inception family of models · Metalhead.jl"/><meta property="og:title" content="Inception family of models · Metalhead.jl"/><meta property="twitter:title" content="Inception family of models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/inception/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li class="is-active"><a class="tocitem" href>Inception family of models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>Inception family of models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Inception family of models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/inception.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Inception-family-of-models"><a class="docs-heading-anchor" href="#Inception-family-of-models">Inception family of models</a><a id="Inception-family-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#Inception-family-of-models" title="Permalink"></a></h1><p>This is the API reference for the Inception family of models supported by Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.GoogLeNet" href="#Metalhead.GoogLeNet"><code>Metalhead.GoogLeNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">GoogLeNet(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v1 model (commonly referred to as <code>GoogLeNet</code>) (<a href="https://arxiv.org/abs/1409.4842v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>nclasses</code>: the number of output classes</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>bias</code>: set to <code>true</code> to use bias in the convolution layers</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>GoogLeNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.googlenet"><code>Metalhead.googlenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/googlenet.jl#L75-L93">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Inceptionv3" href="#Metalhead.Inceptionv3"><code>Metalhead.Inceptionv3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Inceptionv3(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v3 model (<a href="https://arxiv.org/abs/1512.00567v3">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Inceptionv3</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionv3"><code>Metalhead.inceptionv3</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionv3.jl#L161-L177">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Inceptionv4" href="#Metalhead.Inceptionv4"><code>Metalhead.Inceptionv4</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Inceptionv4(; pretrain::Bool = false, inchannels::Integer = 3,
+            nclasses::Integer = 1000)</code></pre><p>Creates an Inceptionv4 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Inceptionv4</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionv4"><code>Metalhead.inceptionv4</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionv4.jl#L113-L131">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.InceptionResNetv2" href="#Metalhead.InceptionResNetv2"><code>Metalhead.InceptionResNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">InceptionResNetv2(; pretrain::Bool = false, inchannels::Integer = 3, 
+                  nclasses::Integer = 1000)</code></pre><p>Creates an InceptionResNetv2 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>InceptionResNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.inceptionresnetv2"><code>Metalhead.inceptionresnetv2</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionresnetv2.jl#L98-L116">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Xception" href="#Metalhead.Xception"><code>Metalhead.Xception</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Xception(; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates an Xception model. (<a href="https://arxiv.org/abs/1610.02357">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Xception</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.xception"><code>Metalhead.xception</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/xception.jl#L70-L87">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.googlenet" href="#Metalhead.googlenet"><code>Metalhead.googlenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">googlenet(; dropout_prob = 0.4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v1 model (commonly referred to as GoogLeNet) (<a href="https://arxiv.org/abs/1409.4842v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: the dropout probability in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>bias</code>: set to <code>true</code> to use bias in the convolution layers</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/googlenet.jl#L38-L51">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionv3" href="#Metalhead.inceptionv3"><code>Metalhead.inceptionv3</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionv3(; dropout_prob = 0.2, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inception-v3 model (<a href="https://arxiv.org/abs/1512.00567v3">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: the dropout probability in the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input feature maps</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionv3.jl#L127-L137">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionv4" href="#Metalhead.inceptionv4"><code>Metalhead.inceptionv4</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionv4(; dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an Inceptionv4 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionv4.jl#L87-L98">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.inceptionresnetv2" href="#Metalhead.inceptionresnetv2"><code>Metalhead.inceptionresnetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inceptionresnetv2(; inchannels::Integer = 3, dropout_prob = nothing, nclasses::Integer = 1000)</code></pre><p>Creates an InceptionResNetv2 model. (<a href="https://arxiv.org/abs/1602.07261">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/inceptionresnetv2.jl#L66-L77">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.xception" href="#Metalhead.xception"><code>Metalhead.xception</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">xception(; dropout_prob = nothing, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates an Xception model. (<a href="https://arxiv.org/abs/1610.02357">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: probability of dropout in classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/inceptions/xception.jl#L45-L56">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../mobilenet/">« MobileNet family of models</a><a class="docs-footer-nextpage" href="../hybrid/">Hybrid CNN architectures »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/layers_adv/index.html b/dev/api/layers_adv/index.html
index dd63b8dd..36f7cbaa 100644
--- a/dev/api/layers_adv/index.html
+++ b/dev/api/layers_adv/index.html
@@ -1,17 +1,17 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>More advanced layers · Metalhead.jl</title><meta name="title" content="More advanced layers · Metalhead.jl"/><meta property="og:title" content="More advanced layers · Metalhead.jl"/><meta property="twitter:title" content="More advanced layers · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/layers_adv/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/layers_adv/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/layers_adv/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox" checked/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li class="is-active"><a class="tocitem" href>More advanced layers</a><ul class="internal"><li><a class="tocitem" href="#Squeeze-and-excitation-blocks"><span>Squeeze-and-excitation blocks</span></a></li><li><a class="tocitem" href="#Inverted-residual-blocks"><span>Inverted residual blocks</span></a></li><li><a class="tocitem" href="#Vision-transformer-related-layers"><span>Vision transformer-related layers</span></a></li><li><a class="tocitem" href="#MLPMixer-related-blocks"><span>MLPMixer-related blocks</span></a></li><li><a class="tocitem" href="#Miscellaneous-utilities-for-layers"><span>Miscellaneous utilities for layers</span></a></li></ul></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Layers</a></li><li class="is-active"><a href>More advanced layers</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>More advanced layers</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/layers_adv.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="More-advanced-layers"><a class="docs-heading-anchor" href="#More-advanced-layers">More advanced layers</a><a id="More-advanced-layers-1"></a><a class="docs-heading-anchor-permalink" href="#More-advanced-layers" title="Permalink"></a></h1><p>This page contains the API reference for some more advanced layers present in the <code>Layers</code> module. These layers are used in Metalhead.jl to build more complex models, and can also be used by the user to build custom models. For a more basic introduction to the <code>Layers</code> module, please refer to the <a href="../layers_intro/#layers-intro">introduction guide</a> for the <code>Layers</code> module.</p><h2 id="Squeeze-and-excitation-blocks"><a class="docs-heading-anchor" href="#Squeeze-and-excitation-blocks">Squeeze-and-excitation blocks</a><a id="Squeeze-and-excitation-blocks-1"></a><a class="docs-heading-anchor-permalink" href="#Squeeze-and-excitation-blocks" title="Permalink"></a></h2><p>These are used in models like SE-ResNet and SE-ResNeXt, as well as in the design of inverted residual blocks used in the MobileNet and EfficientNet family of models.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.squeeze_excite" href="#Metalhead.Layers.squeeze_excite"><code>Metalhead.Layers.squeeze_excite</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">squeeze_excite(inplanes::Integer; reduction::Real = 16, round_fn = _round_channels, 
-               norm_layer = identity, activation = relu, gate_activation = sigmoid)</code></pre><p>Creates a squeeze-and-excitation layer used in MobileNets, EfficientNets and SE-ResNets.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: The number of input feature maps</li><li><code>reduction</code>: The reduction factor for the number of hidden feature maps in the squeeze and excite layer. The number of hidden feature maps is calculated as <code>round_fn(inplanes / reduction)</code>.</li><li><code>round_fn</code>: The function to round the number of reduced feature maps.</li><li><code>activation</code>: The activation function for the first convolution layer</li><li><code>gate_activation</code>: The activation function for the gate layer</li><li><code>norm_layer</code>: The normalization layer to be used after the convolution layers</li><li><code>rd_planes</code>: The number of hidden feature maps in a squeeze and excite layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/selayers.jl#L1-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.effective_squeeze_excite" href="#Metalhead.Layers.effective_squeeze_excite"><code>Metalhead.Layers.effective_squeeze_excite</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">effective_squeeze_excite(inplanes, gate_activation = sigmoid)</code></pre><p>Effective squeeze-and-excitation layer. (reference: <a href="https://arxiv.org/abs/1911.06667">CenterMask : Real-Time Anchor-Free Instance Segmentation</a>)</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: The number of input feature maps</li><li><code>gate_activation</code>: The activation function for the gate layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/selayers.jl#L29-L39">source</a></section></article><h2 id="Inverted-residual-blocks"><a class="docs-heading-anchor" href="#Inverted-residual-blocks">Inverted residual blocks</a><a id="Inverted-residual-blocks-1"></a><a class="docs-heading-anchor-permalink" href="#Inverted-residual-blocks" title="Permalink"></a></h2><p>These blocks are designed to be used in the MobileNet and EfficientNet family of convolutional neural networks.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.dwsep_conv_norm" href="#Metalhead.Layers.dwsep_conv_norm"><code>Metalhead.Layers.dwsep_conv_norm</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dwsep_conv_norm(kernel_size::Dims{2}, inplanes::Integer, outplanes::Integer,
+               norm_layer = identity, activation = relu, gate_activation = sigmoid)</code></pre><p>Creates a squeeze-and-excitation layer used in MobileNets, EfficientNets and SE-ResNets.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: The number of input feature maps</li><li><code>reduction</code>: The reduction factor for the number of hidden feature maps in the squeeze and excite layer. The number of hidden feature maps is calculated as <code>round_fn(inplanes / reduction)</code>.</li><li><code>round_fn</code>: The function to round the number of reduced feature maps.</li><li><code>activation</code>: The activation function for the first convolution layer</li><li><code>gate_activation</code>: The activation function for the gate layer</li><li><code>norm_layer</code>: The normalization layer to be used after the convolution layers</li><li><code>rd_planes</code>: The number of hidden feature maps in a squeeze and excite layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/selayers.jl#L1-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.effective_squeeze_excite" href="#Metalhead.Layers.effective_squeeze_excite"><code>Metalhead.Layers.effective_squeeze_excite</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">effective_squeeze_excite(inplanes, gate_activation = sigmoid)</code></pre><p>Effective squeeze-and-excitation layer. (reference: <a href="https://arxiv.org/abs/1911.06667">CenterMask : Real-Time Anchor-Free Instance Segmentation</a>)</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: The number of input feature maps</li><li><code>gate_activation</code>: The activation function for the gate layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/selayers.jl#L29-L39">source</a></section></article><h2 id="Inverted-residual-blocks"><a class="docs-heading-anchor" href="#Inverted-residual-blocks">Inverted residual blocks</a><a id="Inverted-residual-blocks-1"></a><a class="docs-heading-anchor-permalink" href="#Inverted-residual-blocks" title="Permalink"></a></h2><p>These blocks are designed to be used in the MobileNet and EfficientNet family of convolutional neural networks.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.dwsep_conv_norm" href="#Metalhead.Layers.dwsep_conv_norm"><code>Metalhead.Layers.dwsep_conv_norm</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dwsep_conv_norm(kernel_size::Dims{2}, inplanes::Integer, outplanes::Integer,
                 activation = relu; norm_layer = BatchNorm, stride::Integer = 1,
-                bias::Bool = !(norm_layer !== identity), pad::Integer = 0, [bias, weight, init])</code></pre><p>Create a depthwise separable convolution chain as used in MobileNetv1. This is sequence of layers:</p><ul><li>a <code>kernel_size</code> depthwise convolution from <code>inplanes =&gt; inplanes</code></li><li>a (batch) normalisation layer + <code>activation</code> (if <code>norm_layer !== identity</code>; otherwise <code>activation</code> is applied to the convolution output)</li><li>a <code>kernel_size</code> convolution from <code>inplanes =&gt; outplanes</code></li><li>a (batch) normalisation layer + <code>activation</code> (if <code>norm_layer !== identity</code>; otherwise <code>activation</code> is applied to the convolution output)</li></ul><p>See Fig. 3 in <a href="https://arxiv.org/abs/1704.04861v1">reference</a>.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>norm_layer</code>: the normalisation layer used. Note that using <code>identity</code> as the normalisation layer will result in no normalisation being applied.</li><li><code>bias</code>: whether to use bias in the convolution layers.</li><li><code>stride</code>: stride of the first convolution kernel</li><li><code>pad</code>: padding of the first convolution kernel</li><li><code>weight</code>, <code>init</code>: initialization for the convolution kernel (see <code>Flux.Conv</code>)</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/mbconv.jl#L1-L30">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.mbconv" href="#Metalhead.Layers.mbconv"><code>Metalhead.Layers.mbconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mbconv(kernel_size::Dims{2}, inplanes::Integer, explanes::Integer,
+                bias::Bool = !(norm_layer !== identity), pad::Integer = 0, [bias, weight, init])</code></pre><p>Create a depthwise separable convolution chain as used in MobileNetv1. This is sequence of layers:</p><ul><li>a <code>kernel_size</code> depthwise convolution from <code>inplanes =&gt; inplanes</code></li><li>a (batch) normalisation layer + <code>activation</code> (if <code>norm_layer !== identity</code>; otherwise <code>activation</code> is applied to the convolution output)</li><li>a <code>kernel_size</code> convolution from <code>inplanes =&gt; outplanes</code></li><li>a (batch) normalisation layer + <code>activation</code> (if <code>norm_layer !== identity</code>; otherwise <code>activation</code> is applied to the convolution output)</li></ul><p>See Fig. 3 in <a href="https://arxiv.org/abs/1704.04861v1">reference</a>.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>norm_layer</code>: the normalisation layer used. Note that using <code>identity</code> as the normalisation layer will result in no normalisation being applied.</li><li><code>bias</code>: whether to use bias in the convolution layers.</li><li><code>stride</code>: stride of the first convolution kernel</li><li><code>pad</code>: padding of the first convolution kernel</li><li><code>weight</code>, <code>init</code>: initialization for the convolution kernel (see <code>Flux.Conv</code>)</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/mbconv.jl#L1-L30">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.mbconv" href="#Metalhead.Layers.mbconv"><code>Metalhead.Layers.mbconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mbconv(kernel_size::Dims{2}, inplanes::Integer, explanes::Integer,
        outplanes::Integer, activation = relu; stride::Integer,
        reduction::Union{Nothing, Real} = nothing,
-       se_round_fn = x -&gt; round(Int, x), norm_layer = BatchNorm, kwargs...)</code></pre><p>Create a basic inverted residual block for MobileNet and Efficient variants. This is a sequence of layers:</p><ul><li><p>a 1x1 convolution from <code>inplanes =&gt; explanes</code> followed by a (batch) normalisation layer</p></li><li><p><code>activation</code> if <code>inplanes != explanes</code></p></li><li><p>a <code>kernel_size</code> depthwise separable convolution from <code>explanes =&gt; explanes</code></p></li><li><p>a (batch) normalisation layer</p></li><li><p>a squeeze-and-excitation block (if <code>reduction != nothing</code>) from <code>explanes =&gt; se_round_fn(explanes / reduction)</code> and back to <code>explanes</code></p></li><li><p>a 1x1 convolution from <code>explanes =&gt; outplanes</code></p></li><li><p>a (batch) normalisation layer + <code>activation</code></p></li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This function does not handle the residual connection by default. The user must add this manually to use this block as a standalone. To construct a model, check out the builders, which handle the residual connection and other details.</p></div></div><p>First introduced in the MobileNetv2 paper. (See Fig. 3 in <a href="https://arxiv.org/abs/1801.04381v4">reference</a>.)</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>inplanes</code>: number of input feature maps</li><li><code>explanes</code>: The number of expanded feature maps. This is the number of feature maps after the first 1x1 convolution.</li><li><code>outplanes</code>: The number of output feature maps</li><li><code>activation</code>: The activation function for the first two convolution layer</li><li><code>stride</code>: The stride of the convolutional kernel, has to be either 1 or 2</li><li><code>reduction</code>: The reduction factor for the number of hidden feature maps in a squeeze and excite layer (see <a href="#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a>)</li><li><code>se_round_fn</code>: The function to round the number of reduced feature maps in the squeeze and excite layer</li><li><code>norm_layer</code>: The normalization layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/mbconv.jl#L39-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.fused_mbconv" href="#Metalhead.Layers.fused_mbconv"><code>Metalhead.Layers.fused_mbconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">fused_mbconv(kernel_size::Dims{2}, inplanes::Integer, explanes::Integer,
+       se_round_fn = x -&gt; round(Int, x), norm_layer = BatchNorm, kwargs...)</code></pre><p>Create a basic inverted residual block for MobileNet and Efficient variants. This is a sequence of layers:</p><ul><li><p>a 1x1 convolution from <code>inplanes =&gt; explanes</code> followed by a (batch) normalisation layer</p></li><li><p><code>activation</code> if <code>inplanes != explanes</code></p></li><li><p>a <code>kernel_size</code> depthwise separable convolution from <code>explanes =&gt; explanes</code></p></li><li><p>a (batch) normalisation layer</p></li><li><p>a squeeze-and-excitation block (if <code>reduction != nothing</code>) from <code>explanes =&gt; se_round_fn(explanes / reduction)</code> and back to <code>explanes</code></p></li><li><p>a 1x1 convolution from <code>explanes =&gt; outplanes</code></p></li><li><p>a (batch) normalisation layer + <code>activation</code></p></li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This function does not handle the residual connection by default. The user must add this manually to use this block as a standalone. To construct a model, check out the builders, which handle the residual connection and other details.</p></div></div><p>First introduced in the MobileNetv2 paper. (See Fig. 3 in <a href="https://arxiv.org/abs/1801.04381v4">reference</a>.)</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>inplanes</code>: number of input feature maps</li><li><code>explanes</code>: The number of expanded feature maps. This is the number of feature maps after the first 1x1 convolution.</li><li><code>outplanes</code>: The number of output feature maps</li><li><code>activation</code>: The activation function for the first two convolution layer</li><li><code>stride</code>: The stride of the convolutional kernel, has to be either 1 or 2</li><li><code>reduction</code>: The reduction factor for the number of hidden feature maps in a squeeze and excite layer (see <a href="#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a>)</li><li><code>se_round_fn</code>: The function to round the number of reduced feature maps in the squeeze and excite layer</li><li><code>norm_layer</code>: The normalization layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/mbconv.jl#L39-L80">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.fused_mbconv" href="#Metalhead.Layers.fused_mbconv"><code>Metalhead.Layers.fused_mbconv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">fused_mbconv(kernel_size::Dims{2}, inplanes::Integer, explanes::Integer,
              outplanes::Integer, activation = relu;
-             stride::Integer, norm_layer = BatchNorm)</code></pre><p>Create a fused inverted residual block.</p><p>This is a sequence of layers:</p><ul><li>a <code>kernel_size</code> depthwise separable convolution from <code>explanes =&gt; explanes</code></li><li>a (batch) normalisation layer</li><li>a 1x1 convolution from <code>explanes =&gt; outplanes</code> followed by a (batch) normalisation layer + <code>activation</code> if <code>inplanes != explanes</code></li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This function does not handle the residual connection by default. The user must add this manually to use this block as a standalone. To construct a model, check out the builders, which handle the residual connection and other details.</p></div></div><p>Originally introduced by Google in <a href="https://ai.googleblog.com/2019/08/efficientnet-edgetpu-creating.html">EfficientNet-EdgeTPU: Creating Accelerator-Optimized Neural Networks with AutoML</a>. Later used in the EfficientNetv2 paper.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>inplanes</code>: number of input feature maps</li><li><code>explanes</code>: The number of expanded feature maps</li><li><code>outplanes</code>: The number of output feature maps</li><li><code>activation</code>: The activation function for the first two convolution layer</li><li><code>stride</code>: The stride of the convolutional kernel, has to be either 1 or 2</li><li><code>norm_layer</code>: The normalization layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/mbconv.jl#L107-L138">source</a></section></article><h2 id="Vision-transformer-related-layers"><a class="docs-heading-anchor" href="#Vision-transformer-related-layers">Vision transformer-related layers</a><a id="Vision-transformer-related-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Vision-transformer-related-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module contains specific layers that are used to build vision transformer (ViT)-inspired models:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.MultiHeadSelfAttention" href="#Metalhead.Layers.MultiHeadSelfAttention"><code>Metalhead.Layers.MultiHeadSelfAttention</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MultiHeadSelfAttention(planes::Integer, nheads::Integer = 8; qkv_bias::Bool = false, 
-            attn_dropout_prob = 0., proj_dropout_prob = 0.)</code></pre><p>Multi-head self-attention layer.</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: number of input channels</li><li><code>nheads</code>: number of heads</li><li><code>qkv_bias</code>: whether to use bias in the layer to get the query, key and value</li><li><code>attn_dropout_prob</code>: dropout probability after the self-attention layer</li><li><code>proj_dropout_prob</code>: dropout probability after the projection layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/attention.jl#L1-L14">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ClassTokens" href="#Metalhead.Layers.ClassTokens"><code>Metalhead.Layers.ClassTokens</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ClassTokens(planes::Integer; init = Flux.zeros32)</code></pre><p>Appends class tokens to an input with embedding dimension <code>planes</code> for use in many vision transformer models.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/embeddings.jl#L51-L56">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ViPosEmbedding" href="#Metalhead.Layers.ViPosEmbedding"><code>Metalhead.Layers.ViPosEmbedding</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ViPosEmbedding(embedsize::Integer, npatches::Integer; 
-               init = (dims::Dims{2}) -&gt; rand(Float32, dims))</code></pre><p>Positional embedding layer used by many vision transformer-like models.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/embeddings.jl#L33-L38">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.PatchEmbedding" href="#Metalhead.Layers.PatchEmbedding"><code>Metalhead.Layers.PatchEmbedding</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PatchEmbedding(imsize::Dims{2} = (224, 224); inchannels::Integer = 3,
+             stride::Integer, norm_layer = BatchNorm)</code></pre><p>Create a fused inverted residual block.</p><p>This is a sequence of layers:</p><ul><li>a <code>kernel_size</code> depthwise separable convolution from <code>explanes =&gt; explanes</code></li><li>a (batch) normalisation layer</li><li>a 1x1 convolution from <code>explanes =&gt; outplanes</code> followed by a (batch) normalisation layer + <code>activation</code> if <code>inplanes != explanes</code></li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This function does not handle the residual connection by default. The user must add this manually to use this block as a standalone. To construct a model, check out the builders, which handle the residual connection and other details.</p></div></div><p>Originally introduced by Google in <a href="https://ai.googleblog.com/2019/08/efficientnet-edgetpu-creating.html">EfficientNet-EdgeTPU: Creating Accelerator-Optimized Neural Networks with AutoML</a>. Later used in the EfficientNetv2 paper.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: kernel size of the convolutional layers</li><li><code>inplanes</code>: number of input feature maps</li><li><code>explanes</code>: The number of expanded feature maps</li><li><code>outplanes</code>: The number of output feature maps</li><li><code>activation</code>: The activation function for the first two convolution layer</li><li><code>stride</code>: The stride of the convolutional kernel, has to be either 1 or 2</li><li><code>norm_layer</code>: The normalization layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/mbconv.jl#L107-L138">source</a></section></article><h2 id="Vision-transformer-related-layers"><a class="docs-heading-anchor" href="#Vision-transformer-related-layers">Vision transformer-related layers</a><a id="Vision-transformer-related-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Vision-transformer-related-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module contains specific layers that are used to build vision transformer (ViT)-inspired models:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.MultiHeadSelfAttention" href="#Metalhead.Layers.MultiHeadSelfAttention"><code>Metalhead.Layers.MultiHeadSelfAttention</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MultiHeadSelfAttention(planes::Integer, nheads::Integer = 8; qkv_bias::Bool = false, 
+            attn_dropout_prob = 0., proj_dropout_prob = 0.)</code></pre><p>Multi-head self-attention layer.</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: number of input channels</li><li><code>nheads</code>: number of heads</li><li><code>qkv_bias</code>: whether to use bias in the layer to get the query, key and value</li><li><code>attn_dropout_prob</code>: dropout probability after the self-attention layer</li><li><code>proj_dropout_prob</code>: dropout probability after the projection layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/attention.jl#L1-L14">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ClassTokens" href="#Metalhead.Layers.ClassTokens"><code>Metalhead.Layers.ClassTokens</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ClassTokens(planes::Integer; init = Flux.zeros32)</code></pre><p>Appends class tokens to an input with embedding dimension <code>planes</code> for use in many vision transformer models.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/embeddings.jl#L51-L56">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ViPosEmbedding" href="#Metalhead.Layers.ViPosEmbedding"><code>Metalhead.Layers.ViPosEmbedding</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ViPosEmbedding(embedsize::Integer, npatches::Integer; 
+               init = (dims::Dims{2}) -&gt; rand(Float32, dims))</code></pre><p>Positional embedding layer used by many vision transformer-like models.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/embeddings.jl#L33-L38">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.PatchEmbedding" href="#Metalhead.Layers.PatchEmbedding"><code>Metalhead.Layers.PatchEmbedding</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">PatchEmbedding(imsize::Dims{2} = (224, 224); inchannels::Integer = 3,
                patch_size::Dims{2} = (16, 16), embedplanes = 768,
-               norm_layer = planes -&gt; identity, flatten = true)</code></pre><p>Patch embedding layer used by many vision transformer-like models to split the input image into patches.</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: the size of the patches</li><li><code>embedplanes</code>: the number of channels in the embedding</li><li><code>norm_layer</code>: the normalization layer - by default the identity function but otherwise takes a single argument constructor for a normalization layer like LayerNorm or BatchNorm</li><li><code>flatten</code>: set true to flatten the input spatial dimensions after the embedding</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/embeddings.jl#L3-L20">source</a></section></article><h2 id="MLPMixer-related-blocks"><a class="docs-heading-anchor" href="#MLPMixer-related-blocks">MLPMixer-related blocks</a><a id="MLPMixer-related-blocks-1"></a><a class="docs-heading-anchor-permalink" href="#MLPMixer-related-blocks" title="Permalink"></a></h2><p>Apart from this, the <code>Layers</code> module also contains certain blocks used in MLPMixer-style models:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.gated_mlp_block" href="#Metalhead.Layers.gated_mlp_block"><code>Metalhead.Layers.gated_mlp_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gated_mlp(gate_layer, inplanes::Integer, hidden_planes::Integer, 
-          outplanes::Integer = inplanes; dropout_prob = 0.0, activation = gelu)</code></pre><p>Feedforward block based on the implementation in the paper &quot;Pay Attention to MLPs&quot;. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>gate_layer</code>: Layer to use for the gating.</li><li><code>inplanes</code>: Number of dimensions in the input.</li><li><code>hidden_planes</code>: Number of dimensions in the intermediate layer.</li><li><code>outplanes</code>: Number of dimensions in the output - by default it is the same as <code>inplanes</code>.</li><li><code>dropout_prob</code>: Dropout probability.</li><li><code>activation</code>: Activation function to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/mlp.jl#L22-L37">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.mlp_block" href="#Metalhead.Layers.mlp_block"><code>Metalhead.Layers.mlp_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mlp_block(inplanes::Integer, hidden_planes::Integer, outplanes::Integer = inplanes; 
-          dropout_prob = 0., activation = gelu)</code></pre><p>Feedforward block used in many MLPMixer-like and vision-transformer models.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: Number of dimensions in the input.</li><li><code>hidden_planes</code>: Number of dimensions in the intermediate layer.</li><li><code>outplanes</code>: Number of dimensions in the output - by default it is the same as <code>inplanes</code>.</li><li><code>dropout_prob</code>: Dropout probability.</li><li><code>activation</code>: Activation function to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/mlp.jl#L2-L15">source</a></section></article><h2 id="Miscellaneous-utilities-for-layers"><a class="docs-heading-anchor" href="#Miscellaneous-utilities-for-layers">Miscellaneous utilities for layers</a><a id="Miscellaneous-utilities-for-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Miscellaneous-utilities-for-layers" title="Permalink"></a></h2><p>These are some miscellaneous utilities present in the <code>Layers</code> module, and are used with other custom/inbuilt layers to make certain common operations in neural networks easier.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.inputscale" href="#Metalhead.Layers.inputscale"><code>Metalhead.Layers.inputscale</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inputscale(λ; activation = identity)</code></pre><p>Scale the input by a scalar <code>λ</code> and applies an activation function to it. Equivalent to <code>activation.(λ .* x)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/scale.jl#L1-L6">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.actadd" href="#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">actadd(activation = relu, xs...)</code></pre><p>Convenience function for summing up the input arrays after applying an  activation function to them. Useful as the <code>connection</code> argument for the block  function in <code>Metalhead.resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L21-L27">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.addact" href="#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">addact(activation = relu, xs...)</code></pre><p>Convenience function for applying an activation function to the output after summing up the input arrays. Useful as the <code>connection</code> argument for the block function in <code>Metalhead.resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L12-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.cat_channels" href="#Metalhead.Layers.cat_channels"><code>Metalhead.Layers.cat_channels</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">cat_channels(x, y, zs...)</code></pre><p>Concatenate <code>x</code> and <code>y</code> (and any <code>z</code>s) along the channel dimension (third dimension). Equivalent to <code>cat(x, y, zs...; dims=3)</code>. Convenient reduction operator for use with <code>Parallel</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L30-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.flatten_chains" href="#Metalhead.Layers.flatten_chains"><code>Metalhead.Layers.flatten_chains</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">flatten_chains(m::Chain)
-flatten_chains(m)</code></pre><p>Convenience function for traversing nested layers of a Chain object and flatten them into a single iterator.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L70-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.swapdims" href="#Metalhead.Layers.swapdims"><code>Metalhead.Layers.swapdims</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">swapdims(perm)</code></pre><p>Convenience function that returns a closure which permutes the dimensions of an array. <code>perm</code> is a vector or tuple specifying a permutation of the input dimensions. Equivalent to <code>permutedims(x, perm)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L42-L48">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers_intro/">« An introduction to the <code>Layers</code> module in Metalhead.jl</a><a class="docs-footer-nextpage" href="../utilities/">Model Utilities »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+               norm_layer = planes -&gt; identity, flatten = true)</code></pre><p>Patch embedding layer used by many vision transformer-like models to split the input image into patches.</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: the size of the patches</li><li><code>embedplanes</code>: the number of channels in the embedding</li><li><code>norm_layer</code>: the normalization layer - by default the identity function but otherwise takes a single argument constructor for a normalization layer like LayerNorm or BatchNorm</li><li><code>flatten</code>: set true to flatten the input spatial dimensions after the embedding</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/embeddings.jl#L3-L20">source</a></section></article><h2 id="MLPMixer-related-blocks"><a class="docs-heading-anchor" href="#MLPMixer-related-blocks">MLPMixer-related blocks</a><a id="MLPMixer-related-blocks-1"></a><a class="docs-heading-anchor-permalink" href="#MLPMixer-related-blocks" title="Permalink"></a></h2><p>Apart from this, the <code>Layers</code> module also contains certain blocks used in MLPMixer-style models:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.gated_mlp_block" href="#Metalhead.Layers.gated_mlp_block"><code>Metalhead.Layers.gated_mlp_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gated_mlp(gate_layer, inplanes::Integer, hidden_planes::Integer, 
+          outplanes::Integer = inplanes; dropout_prob = 0.0, activation = gelu)</code></pre><p>Feedforward block based on the implementation in the paper &quot;Pay Attention to MLPs&quot;. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>gate_layer</code>: Layer to use for the gating.</li><li><code>inplanes</code>: Number of dimensions in the input.</li><li><code>hidden_planes</code>: Number of dimensions in the intermediate layer.</li><li><code>outplanes</code>: Number of dimensions in the output - by default it is the same as <code>inplanes</code>.</li><li><code>dropout_prob</code>: Dropout probability.</li><li><code>activation</code>: Activation function to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/mlp.jl#L22-L37">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.mlp_block" href="#Metalhead.Layers.mlp_block"><code>Metalhead.Layers.mlp_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mlp_block(inplanes::Integer, hidden_planes::Integer, outplanes::Integer = inplanes; 
+          dropout_prob = 0., activation = gelu)</code></pre><p>Feedforward block used in many MLPMixer-like and vision-transformer models.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: Number of dimensions in the input.</li><li><code>hidden_planes</code>: Number of dimensions in the intermediate layer.</li><li><code>outplanes</code>: Number of dimensions in the output - by default it is the same as <code>inplanes</code>.</li><li><code>dropout_prob</code>: Dropout probability.</li><li><code>activation</code>: Activation function to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/mlp.jl#L2-L15">source</a></section></article><h2 id="Miscellaneous-utilities-for-layers"><a class="docs-heading-anchor" href="#Miscellaneous-utilities-for-layers">Miscellaneous utilities for layers</a><a id="Miscellaneous-utilities-for-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Miscellaneous-utilities-for-layers" title="Permalink"></a></h2><p>These are some miscellaneous utilities present in the <code>Layers</code> module, and are used with other custom/inbuilt layers to make certain common operations in neural networks easier.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.inputscale" href="#Metalhead.Layers.inputscale"><code>Metalhead.Layers.inputscale</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">inputscale(λ; activation = identity)</code></pre><p>Scale the input by a scalar <code>λ</code> and applies an activation function to it. Equivalent to <code>activation.(λ .* x)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/scale.jl#L1-L6">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.actadd" href="#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">actadd(activation = relu, xs...)</code></pre><p>Convenience function for summing up the input arrays after applying an  activation function to them. Useful as the <code>connection</code> argument for the block  function in <code>Metalhead.resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L21-L27">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.addact" href="#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">addact(activation = relu, xs...)</code></pre><p>Convenience function for applying an activation function to the output after summing up the input arrays. Useful as the <code>connection</code> argument for the block function in <code>Metalhead.resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L12-L18">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.cat_channels" href="#Metalhead.Layers.cat_channels"><code>Metalhead.Layers.cat_channels</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">cat_channels(x, y, zs...)</code></pre><p>Concatenate <code>x</code> and <code>y</code> (and any <code>z</code>s) along the channel dimension (third dimension). Equivalent to <code>cat(x, y, zs...; dims=3)</code>. Convenient reduction operator for use with <code>Parallel</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L30-L36">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.flatten_chains" href="#Metalhead.Layers.flatten_chains"><code>Metalhead.Layers.flatten_chains</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">flatten_chains(m::Chain)
+flatten_chains(m)</code></pre><p>Convenience function for traversing nested layers of a Chain object and flatten them into a single iterator.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L70-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.swapdims" href="#Metalhead.Layers.swapdims"><code>Metalhead.Layers.swapdims</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">swapdims(perm)</code></pre><p>Convenience function that returns a closure which permutes the dimensions of an array. <code>perm</code> is a vector or tuple specifying a permutation of the input dimensions. Equivalent to <code>permutedims(x, perm)</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L42-L48">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers_intro/">« An introduction to the <code>Layers</code> module in Metalhead.jl</a><a class="docs-footer-nextpage" href="../utilities/">Model Utilities »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/layers_intro/index.html b/dev/api/layers_intro/index.html
index acbb05b2..8bd2ccc7 100644
--- a/dev/api/layers_intro/index.html
+++ b/dev/api/layers_intro/index.html
@@ -3,16 +3,16 @@
 using Metalhead.Layers</code></pre><h2 id="Convolution-Normalisation:-the-conv_norm-layer"><a class="docs-heading-anchor" href="#Convolution-Normalisation:-the-conv_norm-layer">Convolution + Normalisation: the <code>conv_norm</code> layer</a><a id="Convolution-Normalisation:-the-conv_norm-layer-1"></a><a class="docs-heading-anchor-permalink" href="#Convolution-Normalisation:-the-conv_norm-layer" title="Permalink"></a></h2><p>One of the most common patterns in modern neural networks is to have a convolutional layer followed by a normalisation layer. Most major deep learning libraries have a way to combine these two layers into a single layer. In Metalhead.jl, this is done with the <a href="#Metalhead.Layers.conv_norm"><code>Metalhead.Layers.conv_norm</code></a> layer. The function signature for this is given below:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.conv_norm" href="#Metalhead.Layers.conv_norm"><code>Metalhead.Layers.conv_norm</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">conv_norm(kernel_size::Dims{2}, inplanes::Integer, outplanes::Integer,
           activation = relu; norm_layer = BatchNorm, revnorm::Bool = false,
           preact::Bool = false, stride::Integer = 1, pad::Integer = 0,
-          dilation::Integer = 1, groups::Integer = 1, [bias, weight, init])</code></pre><p>Create a convolution + normalisation layer pair with activation.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>norm_layer</code>: the normalisation layer used. Note that using <code>identity</code> as the normalisation layer will result in no normalisation being applied. (This is only compatible with <code>preact</code> and <code>revnorm</code> both set to <code>false</code>.)</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>preact</code>: set to <code>true</code> to place the activation function before the normalisation layer (only compatible with <code>revnorm = false</code>)</li><li><code>bias</code>: bias for the convolution kernel. This is set to <code>false</code> by default if <code>norm_layer</code> is not <code>identity</code> and <code>true</code> otherwise.</li><li><code>stride</code>: stride of the convolution kernel</li><li><code>pad</code>: padding of the convolution kernel</li><li><code>dilation</code>: dilation of the convolution kernel</li><li><code>groups</code>: groups for the convolution kernel</li><li><code>weight</code>, <code>init</code>: initialization for the convolution kernel (see <code>Flux.Conv</code>)</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/conv.jl#L1-L28">source</a></section></article><p>To know more about the exact details of each of these parameters, you can refer to the documentation for this function. For now, we will focus on some common use cases. For example, if you want to create a convolutional layer with a kernel size of 3x3, with 32 input channels and 64 output channels, along with a <code>BatchNorm</code> layer, you can do the following:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64)</code></pre><p>This returns a <code>Vector</code> with the desired layers. To use it in a model, the user should splat it into a Chain. For example:</p><pre><code class="language-julia hljs">Chain(Dense(3, 32), conv_norm((3, 3), 32, 64)..., Dense(64, 10))</code></pre><p>The default activation function for <code>conv_norm</code> is <code>relu</code>, and the default normalisation layer is <code>BatchNorm</code>. To use a different activation function, you can just pass it in as a positional argument. For example, to use a <code>sigmoid</code> activation function:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64, sigmoid)</code></pre><p>Let&#39;s try something else. Suppose you want to use a <code>GroupNorm</code> layer instead of a <code>BatchNorm</code> layer. Note that <code>norm_layer</code> is a keyword argument in the function signature of <code>conv_norm</code> as shown above. Then we can write:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64; norm_layer = GroupNorm)</code></pre><p>What if you want to change certain specific parameters of the <code>norm_layer</code>? For example, what if you want to change the number of groups in the <code>GroupNorm</code> layer?</p><pre><code class="language-julia hljs"># defining the norm layer
+          dilation::Integer = 1, groups::Integer = 1, [bias, weight, init])</code></pre><p>Create a convolution + normalisation layer pair with activation.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>norm_layer</code>: the normalisation layer used. Note that using <code>identity</code> as the normalisation layer will result in no normalisation being applied. (This is only compatible with <code>preact</code> and <code>revnorm</code> both set to <code>false</code>.)</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>preact</code>: set to <code>true</code> to place the activation function before the normalisation layer (only compatible with <code>revnorm = false</code>)</li><li><code>bias</code>: bias for the convolution kernel. This is set to <code>false</code> by default if <code>norm_layer</code> is not <code>identity</code> and <code>true</code> otherwise.</li><li><code>stride</code>: stride of the convolution kernel</li><li><code>pad</code>: padding of the convolution kernel</li><li><code>dilation</code>: dilation of the convolution kernel</li><li><code>groups</code>: groups for the convolution kernel</li><li><code>weight</code>, <code>init</code>: initialization for the convolution kernel (see <code>Flux.Conv</code>)</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/conv.jl#L1-L28">source</a></section></article><p>To know more about the exact details of each of these parameters, you can refer to the documentation for this function. For now, we will focus on some common use cases. For example, if you want to create a convolutional layer with a kernel size of 3x3, with 32 input channels and 64 output channels, along with a <code>BatchNorm</code> layer, you can do the following:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64)</code></pre><p>This returns a <code>Vector</code> with the desired layers. To use it in a model, the user should splat it into a Chain. For example:</p><pre><code class="language-julia hljs">Chain(Dense(3, 32), conv_norm((3, 3), 32, 64)..., Dense(64, 10))</code></pre><p>The default activation function for <code>conv_norm</code> is <code>relu</code>, and the default normalisation layer is <code>BatchNorm</code>. To use a different activation function, you can just pass it in as a positional argument. For example, to use a <code>sigmoid</code> activation function:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64, sigmoid)</code></pre><p>Let&#39;s try something else. Suppose you want to use a <code>GroupNorm</code> layer instead of a <code>BatchNorm</code> layer. Note that <code>norm_layer</code> is a keyword argument in the function signature of <code>conv_norm</code> as shown above. Then we can write:</p><pre><code class="language-julia hljs">conv_norm((3, 3), 32, 64; norm_layer = GroupNorm)</code></pre><p>What if you want to change certain specific parameters of the <code>norm_layer</code>? For example, what if you want to change the number of groups in the <code>GroupNorm</code> layer?</p><pre><code class="language-julia hljs"># defining the norm layer
 norm_layer = planes -&gt; GroupNorm(planes, 4)
 # passing it to the conv_norm layer
 conv_norm((3, 3), 32, 64; norm_layer = norm_layer)</code></pre><p>One of Julia&#39;s features is that functions are first-class objects, and can be passed around as arguments to other functions. Here, we have create an <a href="https://docs.julialang.org/en/v1/manual/functions/#man-anonymous-functions-1"><strong>anonymous function</strong></a> that takes in the number of planes as an argument, and returns a <code>GroupNorm</code> layer with 4 groups. This is then passed to the <code>norm_layer</code> keyword argument of the <code>conv_norm</code> layer. Using anonymous functions allows us to configure the layers in a very flexible manner, and this is a common pattern in Metalhead.jl.</p><p>Let&#39;s take a slightly more complicated example. TensorFlow uses different defaults for its normalisation layers. In particular, it uses an <code>epsilon</code> value of <code>1e-3</code> for <code>BatchNorm</code> layers. If you want to use the same defaults as TensorFlow, you can do the following:</p><pre><code class="language-julia hljs"># note that 1e-3 is not a Float32 and Flux is optimized for Float32, so we use 1.0f-3
 conv_norm((3, 3), 32, 64; norm_layer = planes -&gt; BatchNorm(planes, eps = 1.0f-3))</code></pre><p>which, incidentally, is very similar to the code Metalhead uses internally for the <a href="#Metalhead.Layers.basic_conv_bn"><code>Metalhead.Layers.basic_conv_bn</code></a> layer that is used in the Inception family of models.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.basic_conv_bn" href="#Metalhead.Layers.basic_conv_bn"><code>Metalhead.Layers.basic_conv_bn</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">basic_conv_bn(kernel_size::Dims{2}, inplanes, outplanes, activation = relu;
-              kwargs...)</code></pre><p>Returns a convolution + batch normalisation pair with activation as used by the Inception family of models with default values matching those used in the official TensorFlow implementation.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>kwargs</code>: keyword arguments passed to <a href="#Metalhead.Layers.conv_norm"><code>conv_norm</code></a></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/conv.jl#L64-L80">source</a></section></article><h2 id="Normalisation-layers"><a class="docs-heading-anchor" href="#Normalisation-layers">Normalisation layers</a><a id="Normalisation-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Normalisation-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides some custom normalisation functions that are not present in Flux.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.LayerScale" href="#Metalhead.Layers.LayerScale"><code>Metalhead.Layers.LayerScale</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LayerScale(planes::Integer, λ)</code></pre><p>Creates a <code>Flux.Scale</code> layer that performs &quot;<code>LayerScale</code>&quot; (<a href="https://arxiv.org/abs/2103.17239">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: Size of channel dimension in the input.</li><li><code>λ</code>: initialisation value for the learnable diagonal matrix.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/scale.jl#L11-L21">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.LayerNormV2" href="#Metalhead.Layers.LayerNormV2"><code>Metalhead.Layers.LayerNormV2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LayerNormV2(size..., λ=identity; affine=true, eps=1f-5)</code></pre><p>Same as Flux&#39;s LayerNorm but eps is added before taking the square root in the denominator. Therefore, LayerNormV2 matches pytorch&#39;s LayerNorm.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/normalise.jl#L38-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ChannelLayerNorm" href="#Metalhead.Layers.ChannelLayerNorm"><code>Metalhead.Layers.ChannelLayerNorm</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ChannelLayerNorm(sz::Integer, λ = identity; eps = 1.0f-6)</code></pre><p>A variant of LayerNorm where the input is normalised along the channel dimension. The input is expected to have channel dimension with size <code>sz</code>. It also applies a learnable shift and rescaling after the normalization.</p><p>Note that this is specifically for inputs with 4 dimensions in the format (H, W, C, N) where H, W are the height and width of the input, C is the number of channels, and N is the batch size.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/normalise.jl#L14-L24">source</a></section></article><p>There is also a utility function, <code>prenorm</code>, which applies a normalisation layer before a given block and simply returns a <code>Chain</code> with the normalisation layer and the block. This is useful for creating Vision Transformers (ViT)-like models.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.prenorm" href="#Metalhead.Layers.prenorm"><code>Metalhead.Layers.prenorm</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">prenorm(planes, block; norm_layer = LayerNorm)</code></pre><p>Utility function to apply a normalization layer before a block.</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: Size of dimension to normalize.</li><li><code>block</code>: The block before which the normalization layer is applied.</li><li><code>norm_layer</code>: The normalization layer to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/normalise.jl#L1-L11">source</a></section></article><h2 id="Dropout-layers"><a class="docs-heading-anchor" href="#Dropout-layers">Dropout layers</a><a id="Dropout-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Dropout-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides two dropout-like layers not present in Flux:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.DropBlock" href="#Metalhead.Layers.DropBlock"><code>Metalhead.Layers.DropBlock</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DropBlock(drop_block_prob = 0.1, block_size = 7, gamma_scale = 1.0, [rng])</code></pre><p>The <code>DropBlock</code> layer. While training, it zeroes out continguous regions of size <code>block_size</code> in the input. During inference, it simply returns the input <code>x</code>. It can be used in two ways: either with all blocks having the same survival probability or with a linear scaling rule across the blocks. This is performed only at training time. At test time, the <code>DropBlock</code> layer is equivalent to <code>identity</code>.</p><p>(<a href="https://arxiv.org/abs/1810.12890">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>drop_block_prob</code>: probability of dropping a block. If <code>nothing</code> is passed, it returns <code>identity</code>. Note that some literature uses the term &quot;survival probability&quot; instead, which is equivalent to <code>1 - drop_block_prob</code>.</li><li><code>block_size</code>: size of the block to drop</li><li><code>gamma_scale</code>: multiplicative factor for <code>gamma</code> used. For the calculation of gamma, refer to <a href="https://arxiv.org/abs/1810.12890">the paper</a>.</li><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. Custom RNGs are only supported on the CPU.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/drop.jl#L49-L70">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.StochasticDepth" href="#Metalhead.Layers.StochasticDepth"><code>Metalhead.Layers.StochasticDepth</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">StochasticDepth(p, mode = :row; [rng])</code></pre><p>Implements Stochastic Depth. This is a <code>Dropout</code> layer from Flux that drops values with probability <code>p</code>. (<a href="https://arxiv.org/abs/1603.09382">reference</a>)</p><p>This layer can be used to drop certain blocks in a residual structure and allow them to propagate completely through the skip connection. It can be used in two ways: either with all blocks having the same survival probability or with a linear scaling rule across the blocks. This is performed only at training time. At test time, the <code>StochasticDepth</code> layer is equivalent to <code>identity</code>.</p><p><strong>Arguments</strong></p><ul><li><code>p</code>: probability of Stochastic Depth. Note that some literature uses the term &quot;survival probability&quot; instead, which is equivalent to <code>1 - p</code>.</li><li><code>mode</code>: Either <code>:batch</code> or <code>:row</code>. <code>:batch</code> randomly zeroes the entire input, <code>row</code> zeroes randomly selected rows from the batch. The default is <code>:row</code>.</li><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. See <code>Flux.Dropout</code> for more information on the behaviour of this argument. Custom RNGs are only supported on the CPU.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/drop.jl#L115-L137">source</a></section></article><p><code>DropBlock</code> also has a functional variant present in the <code>Layers</code> module:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.dropblock" href="#Metalhead.Layers.dropblock"><code>Metalhead.Layers.dropblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dropblock([rng], x::AbstractArray{T, 4}, drop_block_prob, block_size,
-          gamma_scale, active::Bool = true)</code></pre><p>The dropblock function. If <code>active</code> is <code>true</code>, for each input, it zeroes out continguous regions of size <code>block_size</code> in the input. Otherwise, it simply returns the input <code>x</code>.</p><p><strong>Arguments</strong></p><ul><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. Custom RNGs are only supported on the CPU.</li><li><code>x</code>: input array</li><li><code>drop_block_prob</code>: probability of dropping a block. If <code>nothing</code> is passed, it returns <code>identity</code>.</li><li><code>block_size</code>: size of the block to drop</li><li><code>gamma_scale</code>: multiplicative factor for <code>gamma</code> used. For the calculations, refer to <a href="https://arxiv.org/abs/1810.12890">the paper</a>.</li></ul><p>If you are not a package developer, you most likely do not want this function. Use <a href="#Metalhead.Layers.DropBlock"><code>DropBlock</code></a> instead.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/drop.jl#L13-L33">source</a></section></article><p>Both <code>DropBlock</code> and <code>StochasticDepth</code> are used along with probability values that vary based on a linear schedule across the structure of the model (see the respective papers for more details). The <code>Layers</code> module provides a utility function to create such a schedule as well:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.linear_scheduler" href="#Metalhead.Layers.linear_scheduler"><code>Metalhead.Layers.linear_scheduler</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">linear_scheduler(drop_prob = 0.0; start_value = 0.0, depth)
-linear_scheduler(drop_prob::Nothing; depth::Integer)</code></pre><p>Returns the dropout probabilities for a given depth using the linear scaling rule. Note that this returns evenly spaced values between <code>start_value</code> and <code>drop_prob</code>, not including <code>drop_prob</code>. If <code>drop_prob</code> is <code>nothing</code>, it returns a <code>Vector</code> of length <code>depth</code> with all values equal to <code>nothing</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/utilities.jl#L51-L59">source</a></section></article><p>The <a href="../resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> function which powers the ResNet family of models in Metalhead.jl is configured to allow the use of both these layers. For examples, check out the guide for using the ResNet family in Metalhead <a href="../../howto/resnet/#resnet-guide">here</a>. These layers can also be used by the user to construct other custom models.</p><h2 id="Pooling-layers"><a class="docs-heading-anchor" href="#Pooling-layers">Pooling layers</a><a id="Pooling-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Pooling-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides a <a href="#Metalhead.Layers.AdaptiveMeanMaxPool"><code>Metalhead.Layers.AdaptiveMeanMaxPool</code></a> layer, which is inspired by a similar layer present in <a href="https://github.com/huggingface/pytorch-image-models/blob/394e8145551191ae60f672556936314a20232a35/timm/layers/adaptive_avgmax_pool.py#L106">timm</a>. </p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.AdaptiveMeanMaxPool" href="#Metalhead.Layers.AdaptiveMeanMaxPool"><code>Metalhead.Layers.AdaptiveMeanMaxPool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">AdaptiveMeanMaxPool([connection = +], output_size::Tuple = (1, 1))</code></pre><p>A type of adaptive pooling layer which uses both mean and max pooling and combines them to produce a single output. Note that this is equivalent to <code>Parallel(connection, AdaptiveMeanPool(output_size), AdaptiveMaxPool(output_size))</code>. When <code>connection</code> is not specified, it defaults to <code>+</code>.</p><p><strong>Arguments</strong></p><ul><li><code>connection</code>: The connection type to use.</li><li><code>output_size</code>: The size of the output after pooling.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/pool.jl#L1-L13">source</a></section></article><p>Many mid-level model functions in Metalhead.jl have been written to support passing custom pooling layers to them if applicable (either in the model itself or in the classifier head). For example, the <a href="../resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> function supports this, and examples of this can be found in the guide for using the ResNet family in Metalhead <a href="../../howto/resnet/#resnet-guide">here</a>.</p><h2 id="Classifier-creation"><a class="docs-heading-anchor" href="#Classifier-creation">Classifier creation</a><a id="Classifier-creation-1"></a><a class="docs-heading-anchor-permalink" href="#Classifier-creation" title="Permalink"></a></h2><p>Metalhead provides a function to create a classifier for neural network models that is quite flexible, and is used by the library extensively to create the classifier &quot;head&quot; for networks. This function is called <a href="#Metalhead.Layers.create_classifier"><code>Metalhead.Layers.create_classifier</code></a> and is documented below:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.create_classifier" href="#Metalhead.Layers.create_classifier"><code>Metalhead.Layers.create_classifier</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">create_classifier(inplanes::Integer, nclasses::Integer, activation = identity;
+              kwargs...)</code></pre><p>Returns a convolution + batch normalisation pair with activation as used by the Inception family of models with default values matching those used in the official TensorFlow implementation.</p><p><strong>Arguments</strong></p><ul><li><code>kernel_size</code>: size of the convolution kernel (tuple)</li><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>activation</code>: the activation function for the final layer</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>kwargs</code>: keyword arguments passed to <a href="#Metalhead.Layers.conv_norm"><code>conv_norm</code></a></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/conv.jl#L64-L80">source</a></section></article><h2 id="Normalisation-layers"><a class="docs-heading-anchor" href="#Normalisation-layers">Normalisation layers</a><a id="Normalisation-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Normalisation-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides some custom normalisation functions that are not present in Flux.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.LayerScale" href="#Metalhead.Layers.LayerScale"><code>Metalhead.Layers.LayerScale</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LayerScale(planes::Integer, λ)</code></pre><p>Creates a <code>Flux.Scale</code> layer that performs &quot;<code>LayerScale</code>&quot; (<a href="https://arxiv.org/abs/2103.17239">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: Size of channel dimension in the input.</li><li><code>λ</code>: initialisation value for the learnable diagonal matrix.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/scale.jl#L11-L21">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.LayerNormV2" href="#Metalhead.Layers.LayerNormV2"><code>Metalhead.Layers.LayerNormV2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">LayerNormV2(size..., λ=identity; affine=true, eps=1f-5)</code></pre><p>Same as Flux&#39;s LayerNorm but eps is added before taking the square root in the denominator. Therefore, LayerNormV2 matches pytorch&#39;s LayerNorm.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/normalise.jl#L38-L43">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.ChannelLayerNorm" href="#Metalhead.Layers.ChannelLayerNorm"><code>Metalhead.Layers.ChannelLayerNorm</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ChannelLayerNorm(sz::Integer, λ = identity; eps = 1.0f-6)</code></pre><p>A variant of LayerNorm where the input is normalised along the channel dimension. The input is expected to have channel dimension with size <code>sz</code>. It also applies a learnable shift and rescaling after the normalization.</p><p>Note that this is specifically for inputs with 4 dimensions in the format (H, W, C, N) where H, W are the height and width of the input, C is the number of channels, and N is the batch size.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/normalise.jl#L14-L24">source</a></section></article><p>There is also a utility function, <code>prenorm</code>, which applies a normalisation layer before a given block and simply returns a <code>Chain</code> with the normalisation layer and the block. This is useful for creating Vision Transformers (ViT)-like models.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.prenorm" href="#Metalhead.Layers.prenorm"><code>Metalhead.Layers.prenorm</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">prenorm(planes, block; norm_layer = LayerNorm)</code></pre><p>Utility function to apply a normalization layer before a block.</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: Size of dimension to normalize.</li><li><code>block</code>: The block before which the normalization layer is applied.</li><li><code>norm_layer</code>: The normalization layer to use.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/normalise.jl#L1-L11">source</a></section></article><h2 id="Dropout-layers"><a class="docs-heading-anchor" href="#Dropout-layers">Dropout layers</a><a id="Dropout-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Dropout-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides two dropout-like layers not present in Flux:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.DropBlock" href="#Metalhead.Layers.DropBlock"><code>Metalhead.Layers.DropBlock</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">DropBlock(drop_block_prob = 0.1, block_size = 7, gamma_scale = 1.0, [rng])</code></pre><p>The <code>DropBlock</code> layer. While training, it zeroes out continguous regions of size <code>block_size</code> in the input. During inference, it simply returns the input <code>x</code>. It can be used in two ways: either with all blocks having the same survival probability or with a linear scaling rule across the blocks. This is performed only at training time. At test time, the <code>DropBlock</code> layer is equivalent to <code>identity</code>.</p><p>(<a href="https://arxiv.org/abs/1810.12890">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>drop_block_prob</code>: probability of dropping a block. If <code>nothing</code> is passed, it returns <code>identity</code>. Note that some literature uses the term &quot;survival probability&quot; instead, which is equivalent to <code>1 - drop_block_prob</code>.</li><li><code>block_size</code>: size of the block to drop</li><li><code>gamma_scale</code>: multiplicative factor for <code>gamma</code> used. For the calculation of gamma, refer to <a href="https://arxiv.org/abs/1810.12890">the paper</a>.</li><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. Custom RNGs are only supported on the CPU.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/drop.jl#L49-L70">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.StochasticDepth" href="#Metalhead.Layers.StochasticDepth"><code>Metalhead.Layers.StochasticDepth</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">StochasticDepth(p, mode = :row; [rng])</code></pre><p>Implements Stochastic Depth. This is a <code>Dropout</code> layer from Flux that drops values with probability <code>p</code>. (<a href="https://arxiv.org/abs/1603.09382">reference</a>)</p><p>This layer can be used to drop certain blocks in a residual structure and allow them to propagate completely through the skip connection. It can be used in two ways: either with all blocks having the same survival probability or with a linear scaling rule across the blocks. This is performed only at training time. At test time, the <code>StochasticDepth</code> layer is equivalent to <code>identity</code>.</p><p><strong>Arguments</strong></p><ul><li><code>p</code>: probability of Stochastic Depth. Note that some literature uses the term &quot;survival probability&quot; instead, which is equivalent to <code>1 - p</code>.</li><li><code>mode</code>: Either <code>:batch</code> or <code>:row</code>. <code>:batch</code> randomly zeroes the entire input, <code>row</code> zeroes randomly selected rows from the batch. The default is <code>:row</code>.</li><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. See <code>Flux.Dropout</code> for more information on the behaviour of this argument. Custom RNGs are only supported on the CPU.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/drop.jl#L115-L137">source</a></section></article><p><code>DropBlock</code> also has a functional variant present in the <code>Layers</code> module:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.dropblock" href="#Metalhead.Layers.dropblock"><code>Metalhead.Layers.dropblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">dropblock([rng], x::AbstractArray{T, 4}, drop_block_prob, block_size,
+          gamma_scale, active::Bool = true)</code></pre><p>The dropblock function. If <code>active</code> is <code>true</code>, for each input, it zeroes out continguous regions of size <code>block_size</code> in the input. Otherwise, it simply returns the input <code>x</code>.</p><p><strong>Arguments</strong></p><ul><li><code>rng</code>: can be used to pass in a custom RNG instead of the default. Custom RNGs are only supported on the CPU.</li><li><code>x</code>: input array</li><li><code>drop_block_prob</code>: probability of dropping a block. If <code>nothing</code> is passed, it returns <code>identity</code>.</li><li><code>block_size</code>: size of the block to drop</li><li><code>gamma_scale</code>: multiplicative factor for <code>gamma</code> used. For the calculations, refer to <a href="https://arxiv.org/abs/1810.12890">the paper</a>.</li></ul><p>If you are not a package developer, you most likely do not want this function. Use <a href="#Metalhead.Layers.DropBlock"><code>DropBlock</code></a> instead.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/drop.jl#L13-L33">source</a></section></article><p>Both <code>DropBlock</code> and <code>StochasticDepth</code> are used along with probability values that vary based on a linear schedule across the structure of the model (see the respective papers for more details). The <code>Layers</code> module provides a utility function to create such a schedule as well:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.linear_scheduler" href="#Metalhead.Layers.linear_scheduler"><code>Metalhead.Layers.linear_scheduler</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">linear_scheduler(drop_prob = 0.0; start_value = 0.0, depth)
+linear_scheduler(drop_prob::Nothing; depth::Integer)</code></pre><p>Returns the dropout probabilities for a given depth using the linear scaling rule. Note that this returns evenly spaced values between <code>start_value</code> and <code>drop_prob</code>, not including <code>drop_prob</code>. If <code>drop_prob</code> is <code>nothing</code>, it returns a <code>Vector</code> of length <code>depth</code> with all values equal to <code>nothing</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/utilities.jl#L51-L59">source</a></section></article><p>The <a href="../resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> function which powers the ResNet family of models in Metalhead.jl is configured to allow the use of both these layers. For examples, check out the guide for using the ResNet family in Metalhead <a href="../../howto/resnet/#resnet-guide">here</a>. These layers can also be used by the user to construct other custom models.</p><h2 id="Pooling-layers"><a class="docs-heading-anchor" href="#Pooling-layers">Pooling layers</a><a id="Pooling-layers-1"></a><a class="docs-heading-anchor-permalink" href="#Pooling-layers" title="Permalink"></a></h2><p>The <code>Layers</code> module provides a <a href="#Metalhead.Layers.AdaptiveMeanMaxPool"><code>Metalhead.Layers.AdaptiveMeanMaxPool</code></a> layer, which is inspired by a similar layer present in <a href="https://github.com/huggingface/pytorch-image-models/blob/394e8145551191ae60f672556936314a20232a35/timm/layers/adaptive_avgmax_pool.py#L106">timm</a>. </p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.AdaptiveMeanMaxPool" href="#Metalhead.Layers.AdaptiveMeanMaxPool"><code>Metalhead.Layers.AdaptiveMeanMaxPool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">AdaptiveMeanMaxPool([connection = +], output_size::Tuple = (1, 1))</code></pre><p>A type of adaptive pooling layer which uses both mean and max pooling and combines them to produce a single output. Note that this is equivalent to <code>Parallel(connection, AdaptiveMeanPool(output_size), AdaptiveMaxPool(output_size))</code>. When <code>connection</code> is not specified, it defaults to <code>+</code>.</p><p><strong>Arguments</strong></p><ul><li><code>connection</code>: The connection type to use.</li><li><code>output_size</code>: The size of the output after pooling.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/pool.jl#L1-L13">source</a></section></article><p>Many mid-level model functions in Metalhead.jl have been written to support passing custom pooling layers to them if applicable (either in the model itself or in the classifier head). For example, the <a href="../resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> function supports this, and examples of this can be found in the guide for using the ResNet family in Metalhead <a href="../../howto/resnet/#resnet-guide">here</a>.</p><h2 id="Classifier-creation"><a class="docs-heading-anchor" href="#Classifier-creation">Classifier creation</a><a id="Classifier-creation-1"></a><a class="docs-heading-anchor-permalink" href="#Classifier-creation" title="Permalink"></a></h2><p>Metalhead provides a function to create a classifier for neural network models that is quite flexible, and is used by the library extensively to create the classifier &quot;head&quot; for networks. This function is called <a href="#Metalhead.Layers.create_classifier"><code>Metalhead.Layers.create_classifier</code></a> and is documented below:</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Layers.create_classifier" href="#Metalhead.Layers.create_classifier"><code>Metalhead.Layers.create_classifier</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">create_classifier(inplanes::Integer, nclasses::Integer, activation = identity;
                   use_conv::Bool = false, pool_layer = AdaptiveMeanPool((1, 1)), 
-                  dropout_prob = nothing)</code></pre><p>Creates a classifier head to be used for models.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>nclasses</code>: number of output classes</li><li><code>activation</code>: activation function to use</li><li><code>use_conv</code>: whether to use a 1x1 convolutional layer instead of a <code>Dense</code> layer.</li><li><code>pool_layer</code>: pooling layer to use. This is passed in with the layer instantiated with any arguments that are needed i.e. as <code>AdaptiveMeanPool((1, 1))</code>, for example.</li><li><code>dropout_prob</code>: dropout probability used in the classifier head. Set to <code>nothing</code> to disable dropout.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/classifier.jl#L1-L17">source</a></section><section><div><pre><code class="language-julia hljs">create_classifier(inplanes::Integer, hidden_planes::Integer, nclasses::Integer,
+                  dropout_prob = nothing)</code></pre><p>Creates a classifier head to be used for models.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>nclasses</code>: number of output classes</li><li><code>activation</code>: activation function to use</li><li><code>use_conv</code>: whether to use a 1x1 convolutional layer instead of a <code>Dense</code> layer.</li><li><code>pool_layer</code>: pooling layer to use. This is passed in with the layer instantiated with any arguments that are needed i.e. as <code>AdaptiveMeanPool((1, 1))</code>, for example.</li><li><code>dropout_prob</code>: dropout probability used in the classifier head. Set to <code>nothing</code> to disable dropout.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/classifier.jl#L1-L17">source</a></section><section><div><pre><code class="language-julia hljs">create_classifier(inplanes::Integer, hidden_planes::Integer, nclasses::Integer,
                   activations::NTuple{2} = (relu, identity);
                   use_conv::NTuple{2, Bool} = (false, false),
-                  pool_layer = AdaptiveMeanPool((1, 1)), dropout_prob = nothing)</code></pre><p>Creates a classifier head to be used for models with an extra hidden layer.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>hidden_planes</code>: number of hidden feature maps</li><li><code>nclasses</code>: number of output classes</li><li><code>activations</code>: activation functions to use for the hidden and output layers. This is a tuple of two elements, the first being the activation function for the hidden layer and the second for the output layer.</li><li><code>use_conv</code>: whether to use a 1x1 convolutional layer instead of a <code>Dense</code> layer. This is a tuple of two booleans, the first for the hidden layer and the second for the output layer.</li><li><code>pool_layer</code>: pooling layer to use. This is passed in with the layer instantiated with any arguments that are needed i.e. as <code>AdaptiveMeanPool((1, 1))</code>, for example.</li><li><code>dropout_prob</code>: dropout probability used in the classifier head. Set to <code>nothing</code> to disable dropout.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/layers/classifier.jl#L44-L66">source</a></section></article><p>Due to the power of multiple dispatch in Julia, the above function can be called with two different signatures - one of which creates a classifier with no hidden layers, and the other which creates a classifier with a single hidden layer. The function signature for both is documented above, and the user can choose the one that is most convenient for them. Both are used in Metalhead.jl - the latter is used in MobileNetv3, and the former is used almost everywhere else.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../vit/">« Vision Transformer models</a><a class="docs-footer-nextpage" href="../layers_adv/">More advanced layers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+                  pool_layer = AdaptiveMeanPool((1, 1)), dropout_prob = nothing)</code></pre><p>Creates a classifier head to be used for models with an extra hidden layer.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>hidden_planes</code>: number of hidden feature maps</li><li><code>nclasses</code>: number of output classes</li><li><code>activations</code>: activation functions to use for the hidden and output layers. This is a tuple of two elements, the first being the activation function for the hidden layer and the second for the output layer.</li><li><code>use_conv</code>: whether to use a 1x1 convolutional layer instead of a <code>Dense</code> layer. This is a tuple of two booleans, the first for the hidden layer and the second for the output layer.</li><li><code>pool_layer</code>: pooling layer to use. This is passed in with the layer instantiated with any arguments that are needed i.e. as <code>AdaptiveMeanPool((1, 1))</code>, for example.</li><li><code>dropout_prob</code>: dropout probability used in the classifier head. Set to <code>nothing</code> to disable dropout.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/layers/classifier.jl#L44-L66">source</a></section></article><p>Due to the power of multiple dispatch in Julia, the above function can be called with two different signatures - one of which creates a classifier with no hidden layers, and the other which creates a classifier with a single hidden layer. The function signature for both is documented above, and the user can choose the one that is most convenient for them. Both are used in Metalhead.jl - the latter is used in MobileNetv3, and the former is used almost everywhere else.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../vit/">« Vision Transformer models</a><a class="docs-footer-nextpage" href="../layers_adv/">More advanced layers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/mixers/index.html b/dev/api/mixers/index.html
index 3f3f787a..caccb7d4 100644
--- a/dev/api/mixers/index.html
+++ b/dev/api/mixers/index.html
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>MLPMixer-like models · Metalhead.jl</title><meta name="title" content="MLPMixer-like models · Metalhead.jl"/><meta property="og:title" content="MLPMixer-like models · Metalhead.jl"/><meta property="twitter:title" content="MLPMixer-like models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/mixers/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/mixers/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/mixers/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox" checked/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li class="is-active"><a class="tocitem" href>MLPMixer-like models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-core-MLPMixer-function"><span>The core MLPMixer function</span></a></li><li><a class="tocitem" href="#The-block-functions"><span>The block functions</span></a></li></ul></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Mixers</a></li><li class="is-active"><a href>MLPMixer-like models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>MLPMixer-like models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/mixers.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="MLPMixer-like-models"><a class="docs-heading-anchor" href="#MLPMixer-like-models">MLPMixer-like models</a><a id="MLPMixer-like-models-1"></a><a class="docs-heading-anchor-permalink" href="#MLPMixer-like-models" title="Permalink"></a></h1><p>This is the API reference for the MLPMixer-like models supported by Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MLPMixer" href="#Metalhead.MLPMixer"><code>Metalhead.MLPMixer</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MLPMixer(config::Symbol; patch_size::Dims{2} = (16, 16), imsize::Dims{2} = (224, 224),
-         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/mlpmixer.jl#L36-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResMLP" href="#Metalhead.ResMLP"><code>Metalhead.ResMLP</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResMLP(config::Symbol; patch_size::Dims{2} = (16, 16), imsize::Dims{2} = (224, 224),
-       inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the ResMLP architecture. (<a href="https://arxiv.org/abs/2105.03404">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/resmlp.jl#L36-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.gMLP" href="#Metalhead.gMLP"><code>Metalhead.gMLP</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gMLP(config::Symbol; patch_size::Dims{2} = (16, 16), imsize::Dims{2} = (224, 224),
-     inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the gMLP architecture. (<a href="https://arxiv.org/abs/2105.08050">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/gmlp.jl#L65-L81">source</a></section></article><h2 id="The-core-MLPMixer-function"><a class="docs-heading-anchor" href="#The-core-MLPMixer-function">The core MLPMixer function</a><a id="The-core-MLPMixer-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-core-MLPMixer-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mlpmixer" href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mlpmixer(block, imsize::Dims{2} = (224, 224); inchannels::Integer = 3, norm_layer = LayerNorm,
+         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/mlpmixer.jl#L36-L53">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResMLP" href="#Metalhead.ResMLP"><code>Metalhead.ResMLP</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResMLP(config::Symbol; patch_size::Dims{2} = (16, 16), imsize::Dims{2} = (224, 224),
+       inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the ResMLP architecture. (<a href="https://arxiv.org/abs/2105.03404">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/resmlp.jl#L36-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.gMLP" href="#Metalhead.gMLP"><code>Metalhead.gMLP</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">gMLP(config::Symbol; patch_size::Dims{2} = (16, 16), imsize::Dims{2} = (224, 224),
+     inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a model with the gMLP architecture. (<a href="https://arxiv.org/abs/2105.08050">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the size of the model - one of <code>:small</code>, <code>:base</code>, <code>:large</code> or <code>:huge</code></li><li><code>patch_size</code>: the size of the patches</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/gmlp.jl#L65-L81">source</a></section></article><h2 id="The-core-MLPMixer-function"><a class="docs-heading-anchor" href="#The-core-MLPMixer-function">The core MLPMixer function</a><a id="The-core-MLPMixer-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-core-MLPMixer-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mlpmixer" href="#Metalhead.mlpmixer"><code>Metalhead.mlpmixer</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mlpmixer(block, imsize::Dims{2} = (224, 224); inchannels::Integer = 3, norm_layer = LayerNorm,
          patch_size::Dims{2} = (16, 16), embedplanes = 512, stochastic_depth_prob = 0.,
-         depth::Integer = 12, nclasses::Integer = 1000, kwargs...)</code></pre><p>Creates a model with the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>block</code>: the type of mixer block to use in the model - architecture dependent (a constructor of the form <code>block(embedplanes, npatches; stochastic_depth_prob, kwargs...)</code>)</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>norm_layer</code>: the normalization layer to use in the model</li><li><code>patch_size</code>: the size of the patches</li><li><code>embedplanes</code>: the number of channels after the patch embedding (denotes the hidden dimension)</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>depth</code>: the number of blocks in the model</li><li><code>nclasses</code>: number of output classes</li><li><code>kwargs</code>: additional arguments (if any) to pass to the mixer block. Will use the defaults if not specified.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/core.jl#L1-L23">source</a></section></article><h2 id="The-block-functions"><a class="docs-heading-anchor" href="#The-block-functions">The block functions</a><a id="The-block-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-block-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mixerblock" href="#Metalhead.mixerblock"><code>Metalhead.mixerblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mixerblock(planes::Integer, npatches::Integer; mlp_layer = mlp_block,
+         depth::Integer = 12, nclasses::Integer = 1000, kwargs...)</code></pre><p>Creates a model with the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>block</code>: the type of mixer block to use in the model - architecture dependent (a constructor of the form <code>block(embedplanes, npatches; stochastic_depth_prob, kwargs...)</code>)</li><li><code>imsize</code>: the size of the input image</li><li><code>inchannels</code>: the number of input channels</li><li><code>norm_layer</code>: the normalization layer to use in the model</li><li><code>patch_size</code>: the size of the patches</li><li><code>embedplanes</code>: the number of channels after the patch embedding (denotes the hidden dimension)</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>depth</code>: the number of blocks in the model</li><li><code>nclasses</code>: number of output classes</li><li><code>kwargs</code>: additional arguments (if any) to pass to the mixer block. Will use the defaults if not specified.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/core.jl#L1-L23">source</a></section></article><h2 id="The-block-functions"><a class="docs-heading-anchor" href="#The-block-functions">The block functions</a><a id="The-block-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-block-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mixerblock" href="#Metalhead.mixerblock"><code>Metalhead.mixerblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mixerblock(planes::Integer, npatches::Integer; mlp_layer = mlp_block,
            mlp_ratio = (0.5, 4.0), dropout_prob = 0.0, stochastic_depth_prob = 0.0,
-           activation = gelu)</code></pre><p>Creates a feedforward block for the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: number(s) that determine(s) the number of hidden channels in the token mixing MLP and/or the channel mixing MLP as a ratio to the number of planes in the block.</li><li><code>mlp_layer</code>: the MLP layer to use in the block</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/mlpmixer.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resmixerblock" href="#Metalhead.resmixerblock"><code>Metalhead.resmixerblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resmixerblock(planes, npatches; dropout_prob = 0., stochastic_depth_prob = 0., mlp_ratio = 4.0,
-              activation = gelu, layerscale_init = 1e-4)</code></pre><p>Creates a block for the ResMixer architecture. (<a href="https://arxiv.org/abs/2105.03404">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: ratio of the number of hidden channels in the channel mixing MLP to the number of planes in the block</li><li><code>mlp_layer</code>: the MLP block to use</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li><li><code>layerscale_init</code>: initialisation constant for the LayerScale</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/resmlp.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SpatialGatingUnit" href="#Metalhead.SpatialGatingUnit"><code>Metalhead.SpatialGatingUnit</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SpatialGatingUnit(planes::Integer, npatches::Integer; norm_layer = LayerNorm)</code></pre><p>Creates a spatial gating unit as described in the gMLP paper. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>norm_layer</code>: the normalisation layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/gmlp.jl#L1-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.spatialgatingblock" href="#Metalhead.spatialgatingblock"><code>Metalhead.spatialgatingblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">spatialgatingblock(planes::Integer, npatches::Integer; mlp_ratio = 4.0,
+           activation = gelu)</code></pre><p>Creates a feedforward block for the MLPMixer architecture. (<a href="https://arxiv.org/pdf/2105.01601">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: number(s) that determine(s) the number of hidden channels in the token mixing MLP and/or the channel mixing MLP as a ratio to the number of planes in the block.</li><li><code>mlp_layer</code>: the MLP layer to use in the block</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/mlpmixer.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resmixerblock" href="#Metalhead.resmixerblock"><code>Metalhead.resmixerblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resmixerblock(planes, npatches; dropout_prob = 0., stochastic_depth_prob = 0., mlp_ratio = 4.0,
+              activation = gelu, layerscale_init = 1e-4)</code></pre><p>Creates a block for the ResMixer architecture. (<a href="https://arxiv.org/abs/2105.03404">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: ratio of the number of hidden channels in the channel mixing MLP to the number of planes in the block</li><li><code>mlp_layer</code>: the MLP block to use</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li><li><code>layerscale_init</code>: initialisation constant for the LayerScale</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/resmlp.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SpatialGatingUnit" href="#Metalhead.SpatialGatingUnit"><code>Metalhead.SpatialGatingUnit</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SpatialGatingUnit(planes::Integer, npatches::Integer; norm_layer = LayerNorm)</code></pre><p>Creates a spatial gating unit as described in the gMLP paper. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>norm_layer</code>: the normalisation layer to use</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/gmlp.jl#L1-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.spatialgatingblock" href="#Metalhead.spatialgatingblock"><code>Metalhead.spatialgatingblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">spatialgatingblock(planes::Integer, npatches::Integer; mlp_ratio = 4.0,
                    norm_layer = LayerNorm, mlp_layer = gated_mlp_block,
                    dropout_prob = 0.0, stochastic_depth_prob = 0.0,
-                   activation = gelu)</code></pre><p>Creates a feedforward block based on the gMLP model architecture described in the paper. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: ratio of the number of hidden channels in the channel mixing MLP to the number of planes in the block</li><li><code>norm_layer</code>: the normalisation layer to use</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/mixers/gmlp.jl#L33-L52">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../others/">« Other models</a><a class="docs-footer-nextpage" href="../vit/">Vision Transformer models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+                   activation = gelu)</code></pre><p>Creates a feedforward block based on the gMLP model architecture described in the paper. (<a href="https://arxiv.org/abs/2105.08050">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>planes</code>: the number of planes in the block</li><li><code>npatches</code>: the number of patches of the input</li><li><code>mlp_ratio</code>: ratio of the number of hidden channels in the channel mixing MLP to the number of planes in the block</li><li><code>norm_layer</code>: the normalisation layer to use</li><li><code>dropout_prob</code>: the dropout probability to use in the MLP blocks</li><li><code>stochastic_depth_prob</code>: Stochastic depth probability</li><li><code>activation</code>: the activation function to use in the MLP blocks</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/mixers/gmlp.jl#L33-L52">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../others/">« Other models</a><a class="docs-footer-nextpage" href="../vit/">Vision Transformer models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/mobilenet/index.html b/dev/api/mobilenet/index.html
index 1b31073a..dfde2747 100644
--- a/dev/api/mobilenet/index.html
+++ b/dev/api/mobilenet/index.html
@@ -1,10 +1,10 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>MobileNet family of models · Metalhead.jl</title><meta name="title" content="MobileNet family of models · Metalhead.jl"/><meta property="og:title" content="MobileNet family of models · Metalhead.jl"/><meta property="twitter:title" content="MobileNet family of models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/mobilenet/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/mobilenet/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/mobilenet/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li class="is-active"><a class="tocitem" href>MobileNet family of models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>MobileNet family of models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>MobileNet family of models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/mobilenet.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="MobileNet-family-of-models"><a class="docs-heading-anchor" href="#MobileNet-family-of-models">MobileNet family of models</a><a id="MobileNet-family-of-models-1"></a><a class="docs-heading-anchor-permalink" href="#MobileNet-family-of-models" title="Permalink"></a></h1><p>This is the API reference for the MobileNet family of models supported by Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MobileNetv1" href="#Metalhead.MobileNetv1"><code>Metalhead.MobileNetv1</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MobileNetv1(width_mult::Real = 1; pretrain::Bool = false,
-            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv1 model with the baseline configuration (<a href="https://arxiv.org/abs/1704.04861v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv1</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv1"><code>Metalhead.mobilenetv1</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv1.jl#L39-L59">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MobileNetv2" href="#Metalhead.MobileNetv2"><code>Metalhead.MobileNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MobileNetv2(width_mult = 1.0; inchannels::Integer = 3, pretrain::Bool = false,
-            nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv2 model with the specified configuration. (<a href="https://arxiv.org/abs/1801.04381">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv2"><code>Metalhead.mobilenetv2</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv2.jl#L45-L65">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MobileNetv3" href="#Metalhead.MobileNetv3"><code>Metalhead.MobileNetv3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MobileNetv3(config::Symbol; width_mult::Real = 1, pretrain::Bool = false,
-            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv3 model with the specified configuration. (<a href="https://arxiv.org/abs/1905.02244">reference</a>). Set <code>pretrain = true</code> to load the model with pre-trained weights for ImageNet.</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: :small or :large for the size of the model (see paper).</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv3</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv3.jl#L63-L85">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MNASNet" href="#Metalhead.MNASNet"><code>Metalhead.MNASNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MNASNet(config::Symbol; width_mult::Real = 1, pretrain::Bool = false,
-        inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a MNASNet model with the specified configuration. (<a href="https://arxiv.org/abs/1807.11626">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: configuration of the model. One of <code>B1</code>, <code>A1</code> or <code>small</code>. <code>B1</code> is without squeeze-and-excite layers, <code>A1</code> is with squeeze-and-excite layers, and <code>small</code> is a smaller version of <code>A1</code>.</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MNASNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mnasnet"><code>Metalhead.mnasnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mnasnet.jl#L70-L93">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv1" href="#Metalhead.mobilenetv1"><code>Metalhead.mobilenetv1</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv1(width_mult::Real = 1; inplanes::Integer = 32, dropout_prob = nothing,
-            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv1 model. (<a href="https://arxiv.org/abs/1704.04861v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>inplanes</code>: Number of input channels to the first convolution layer</li><li><code>dropout_prob</code>: Dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: Number of input channels.</li><li><code>nclasses</code>: Number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv1.jl#L17-L31">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv2" href="#Metalhead.mobilenetv2"><code>Metalhead.mobilenetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv2(width_mult::Real = 1; max_width::Integer = 1280,
+            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv1 model with the baseline configuration (<a href="https://arxiv.org/abs/1704.04861v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv1</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv1"><code>Metalhead.mobilenetv1</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv1.jl#L39-L59">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MobileNetv2" href="#Metalhead.MobileNetv2"><code>Metalhead.MobileNetv2</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MobileNetv2(width_mult = 1.0; inchannels::Integer = 3, pretrain::Bool = false,
+            nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv2 model with the specified configuration. (<a href="https://arxiv.org/abs/1801.04381">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv2</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv2"><code>Metalhead.mobilenetv2</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv2.jl#L45-L65">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MobileNetv3" href="#Metalhead.MobileNetv3"><code>Metalhead.MobileNetv3</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MobileNetv3(config::Symbol; width_mult::Real = 1, pretrain::Bool = false,
+            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv3 model with the specified configuration. (<a href="https://arxiv.org/abs/1905.02244">reference</a>). Set <code>pretrain = true</code> to load the model with pre-trained weights for ImageNet.</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: :small or :large for the size of the model (see paper).</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MobileNetv3</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv3.jl#L63-L85">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.MNASNet" href="#Metalhead.MNASNet"><code>Metalhead.MNASNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">MNASNet(config::Symbol; width_mult::Real = 1, pretrain::Bool = false,
+        inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a MNASNet model with the specified configuration. (<a href="https://arxiv.org/abs/1807.11626">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: configuration of the model. One of <code>B1</code>, <code>A1</code> or <code>small</code>. <code>B1</code> is without squeeze-and-excite layers, <code>A1</code> is with squeeze-and-excite layers, and <code>small</code> is a smaller version of <code>A1</code>.</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>MNASNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.mnasnet"><code>Metalhead.mnasnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mnasnet.jl#L70-L93">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv1" href="#Metalhead.mobilenetv1"><code>Metalhead.mobilenetv1</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv1(width_mult::Real = 1; inplanes::Integer = 32, dropout_prob = nothing,
+            inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv1 model. (<a href="https://arxiv.org/abs/1704.04861v1">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>inplanes</code>: Number of input channels to the first convolution layer</li><li><code>dropout_prob</code>: Dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: Number of input channels.</li><li><code>nclasses</code>: Number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv1.jl#L17-L31">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv2" href="#Metalhead.mobilenetv2"><code>Metalhead.mobilenetv2</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv2(width_mult::Real = 1; max_width::Integer = 1280,
             inplanes::Integer = 32, dropout_prob = 0.2,
             inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv2 model. (<a href="https://arxiv.org/abs/1801.04381v1">reference</a>).</p><p><strong>Arguments</strong></p><pre><code class="nohighlight hljs">- `width_mult`: Controls the number of output feature maps in each block
 (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)
@@ -12,11 +12,11 @@
 - `inplanes`: Number of input channels to the first convolution layer
 - `dropout_prob`: Dropout probability for the classifier head. Set to `nothing` to disable dropout.
 - `inchannels`: Number of input channels.
-- `nclasses`: Number of output classes.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv2.jl#L21-L37">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv3" href="#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv3(config::Symbol; width_mult::Real = 1, dropout_prob = 0.2,
+- `nclasses`: Number of output classes.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv2.jl#L21-L37">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mobilenetv3" href="#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mobilenetv3(config::Symbol; width_mult::Real = 1, dropout_prob = 0.2,
             inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a MobileNetv3 model with the specified configuration. (<a href="https://arxiv.org/abs/1905.02244">reference</a>).</p><p><strong>Arguments</strong></p><pre><code class="nohighlight hljs">- `config`: The configuration of the model. Can be either `small` or `large`.
 - `width_mult`: Controls the number of output feature maps in each block
   (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)
 - `dropout_prob`: Dropout probability for the classifier head. Set to `nothing` to disable dropout.
 - `inchannels`: The number of input channels.
-- `nclasses`: The number of output classes.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mobilenetv3.jl#L37-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mnasnet" href="#Metalhead.mnasnet"><code>Metalhead.mnasnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mnasnet(config::Symbol; width_mult::Real = 1, max_width::Integer = 1280,
-        dropout_prob = 0.2, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an MNasNet model. (<a href="https://arxiv.org/abs/1807.11626">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: configuration of the model. One of <code>B1</code>, <code>A1</code> or <code>small</code>. <code>B1</code> is without squeeze-and-excite layers, <code>A1</code> is with squeeze-and-excite layers, and <code>small</code> is a smaller version of <code>A1</code>.</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>max_width</code>: Controls the maximum number of output feature maps in each block</li><li><code>dropout_prob</code>: Dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: Number of input channels.</li><li><code>nclasses</code>: Number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/mobilenets/mnasnet.jl#L42-L59">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../efficientnet/">« EfficientNet family of models</a><a class="docs-footer-nextpage" href="../inception/">Inception family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+- `nclasses`: The number of output classes.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mobilenetv3.jl#L37-L52">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.mnasnet" href="#Metalhead.mnasnet"><code>Metalhead.mnasnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">mnasnet(config::Symbol; width_mult::Real = 1, max_width::Integer = 1280,
+        dropout_prob = 0.2, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an MNasNet model. (<a href="https://arxiv.org/abs/1807.11626">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: configuration of the model. One of <code>B1</code>, <code>A1</code> or <code>small</code>. <code>B1</code> is without squeeze-and-excite layers, <code>A1</code> is with squeeze-and-excite layers, and <code>small</code> is a smaller version of <code>A1</code>.</li><li><code>width_mult</code>: Controls the number of output feature maps in each block (with 1 being the default in the paper; this is usually a value between 0.1 and 1.4)</li><li><code>max_width</code>: Controls the maximum number of output feature maps in each block</li><li><code>dropout_prob</code>: Dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: Number of input channels.</li><li><code>nclasses</code>: Number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/mobilenets/mnasnet.jl#L42-L59">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../efficientnet/">« EfficientNet family of models</a><a class="docs-footer-nextpage" href="../inception/">Inception family of models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/others/index.html b/dev/api/others/index.html
index f29b0d5c..2944437c 100644
--- a/dev/api/others/index.html
+++ b/dev/api/others/index.html
@@ -1,14 +1,14 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Other models · Metalhead.jl</title><meta name="title" content="Other models · Metalhead.jl"/><meta property="og:title" content="Other models · Metalhead.jl"/><meta property="twitter:title" content="Other models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/others/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/others/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/others/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li class="is-active"><a class="tocitem" href>Other models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li><li><a class="tocitem" href="#Block-level-functions"><span>Block-level functions</span></a></li></ul></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>Other models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Other models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/others.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Other-models"><a class="docs-heading-anchor" href="#Other-models">Other models</a><a id="Other-models-1"></a><a class="docs-heading-anchor-permalink" href="#Other-models" title="Permalink"></a></h1><p>This is the API reference for some of the models supported by Metalhead.jl that do not fit into the other categories.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.AlexNet" href="#Metalhead.AlexNet"><code>Metalhead.AlexNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">AlexNet(; pretrain::Bool = false, inchannels::Integer = 3,
-        nclasses::Integer = 1000)</code></pre><p>Create a <code>AlexNet</code>. (<a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>AlexNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.alexnet"><code>alexnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/alexnet.jl#L31-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.VGG" href="#Metalhead.VGG"><code>Metalhead.VGG</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">VGG(depth::Integer; pretrain::Bool = false, batchnorm::Bool = false,
-    inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a VGG style model with specified <code>depth</code>. (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>VGG</code> does not currently support pretrained weights for the <code>batchnorm = true</code> option.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>depth</code>: the depth of the VGG model. Must be one of [11, 13, 16, 19].</li><li><code>pretrain</code>: set to <code>true</code> to load pre-trained model weights for ImageNet</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.vgg"><code>vgg</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/vgg.jl#L104-L124">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SqueezeNet" href="#Metalhead.SqueezeNet"><code>Metalhead.SqueezeNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SqueezeNet(; pretrain::Bool = false, inchannels::Integer = 3,
-           nclasses::Integer = 1000)</code></pre><p>Create a SqueezeNet (<a href="https://arxiv.org/abs/1602.07360v4">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><p>See also <a href="#Metalhead.squeezenet"><code>squeezenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/squeezenet.jl#L53-L67">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.UNet" href="#Metalhead.UNet"><code>Metalhead.UNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,
-     encoder_backbone = Metalhead.backbone(DenseNet(121)); pretrain::Bool = false)</code></pre><p>Creates a UNet model with an encoder built of specified backbone. By default it uses  <a href="../densenet/#DenseNet"><code>DenseNet</code></a> backbone, but any ResNet-like Metalhead model can be used for the encoder. (<a href="https://arxiv.org/abs/1505.04597">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: size of input image</li><li><code>inchannels</code>: number of channels in input image</li><li><code>outplanes</code>: number of output feature planes.</li><li><code>encoder_backbone</code>: The backbone layers of specified model to be used as encoder. For example, <code>Metalhead.backbone(Metalhead.ResNet(18))</code> can be passed to instantiate a UNet with layers of resnet18 as encoder.</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>UNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.unet"><code>Metalhead.unet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/unet.jl#L86-L109">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.alexnet" href="#Metalhead.alexnet"><code>Metalhead.alexnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">alexnet(; dropout_prob = 0.5, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an AlexNet model (<a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: dropout probability for the classifier</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/alexnet.jl#L1-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg" href="#Metalhead.vgg"><code>Metalhead.vgg</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg(imsize::Dims{2}; config, batchnorm::Bool = false, fcsize::Integer = 4096,
-    dropout_prob = 0.0, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a VGG model (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: input image width and height as a tuple</li><li><code>config</code>: the configuration for the convolution layers (see <a href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a>)</li><li><code>inchannels</code>: number of input channels</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>nclasses</code>: number of output classes</li><li><code>fcsize</code>: intermediate fully connected layer size (see <a href="#Metalhead.vgg_classifier_layers"><code>Metalhead.vgg_classifier_layers</code></a>)</li><li><code>dropout_prob</code>: dropout level between fully connected layers</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/vgg.jl#L72-L90">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.squeezenet" href="#Metalhead.squeezenet"><code>Metalhead.squeezenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">squeezenet(; dropout_prob = 0.5, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a SqueezeNet model. (<a href="https://arxiv.org/abs/1602.07360v4">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/squeezenet.jl#L23-L34">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.unet" href="#Metalhead.unet"><code>Metalhead.unet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unet(encoder_backbone, imgdims, outplanes::Integer, final::Any = unet_final_block,
+        nclasses::Integer = 1000)</code></pre><p>Create a <code>AlexNet</code>. (<a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>AlexNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.alexnet"><code>alexnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/alexnet.jl#L31-L49">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.VGG" href="#Metalhead.VGG"><code>Metalhead.VGG</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">VGG(depth::Integer; pretrain::Bool = false, batchnorm::Bool = false,
+    inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a VGG style model with specified <code>depth</code>. (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>VGG</code> does not currently support pretrained weights for the <code>batchnorm = true</code> option.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>depth</code>: the depth of the VGG model. Must be one of [11, 13, 16, 19].</li><li><code>pretrain</code>: set to <code>true</code> to load pre-trained model weights for ImageNet</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>inchannels</code>: number of input channels</li><li><code>nclasses</code>: number of output classes</li></ul><p>See also <a href="#Metalhead.vgg"><code>vgg</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/vgg.jl#L104-L124">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SqueezeNet" href="#Metalhead.SqueezeNet"><code>Metalhead.SqueezeNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SqueezeNet(; pretrain::Bool = false, inchannels::Integer = 3,
+           nclasses::Integer = 1000)</code></pre><p>Create a SqueezeNet (<a href="https://arxiv.org/abs/1602.07360v4">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>pretrain</code>: set to <code>true</code> to load the pre-trained weights for ImageNet</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul><p>See also <a href="#Metalhead.squeezenet"><code>squeezenet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/squeezenet.jl#L53-L67">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.UNet" href="#Metalhead.UNet"><code>Metalhead.UNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,
+     encoder_backbone = Metalhead.backbone(DenseNet(121)); pretrain::Bool = false)</code></pre><p>Creates a UNet model with an encoder built of specified backbone. By default it uses  <a href="../densenet/#DenseNet"><code>DenseNet</code></a> backbone, but any ResNet-like Metalhead model can be used for the encoder. (<a href="https://arxiv.org/abs/1505.04597">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: size of input image</li><li><code>inchannels</code>: number of channels in input image</li><li><code>outplanes</code>: number of output feature planes.</li><li><code>encoder_backbone</code>: The backbone layers of specified model to be used as encoder. For example, <code>Metalhead.backbone(Metalhead.ResNet(18))</code> can be passed to instantiate a UNet with layers of resnet18 as encoder.</li><li><code>pretrain</code>: Whether to load the pre-trained weights for ImageNet</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>UNet</code> does not currently support pretrained weights.</p></div></div><p>See also <a href="#Metalhead.unet"><code>Metalhead.unet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/unet.jl#L86-L109">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.alexnet" href="#Metalhead.alexnet"><code>Metalhead.alexnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">alexnet(; dropout_prob = 0.5, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create an AlexNet model (<a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: dropout probability for the classifier</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/alexnet.jl#L1-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg" href="#Metalhead.vgg"><code>Metalhead.vgg</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg(imsize::Dims{2}; config, batchnorm::Bool = false, fcsize::Integer = 4096,
+    dropout_prob = 0.0, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a VGG model (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: input image width and height as a tuple</li><li><code>config</code>: the configuration for the convolution layers (see <a href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a>)</li><li><code>inchannels</code>: number of input channels</li><li><code>batchnorm</code>: set to <code>true</code> to use batch normalization after each convolution</li><li><code>nclasses</code>: number of output classes</li><li><code>fcsize</code>: intermediate fully connected layer size (see <a href="#Metalhead.vgg_classifier_layers"><code>Metalhead.vgg_classifier_layers</code></a>)</li><li><code>dropout_prob</code>: dropout level between fully connected layers</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/vgg.jl#L72-L90">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.squeezenet" href="#Metalhead.squeezenet"><code>Metalhead.squeezenet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">squeezenet(; dropout_prob = 0.5, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Create a SqueezeNet model. (<a href="https://arxiv.org/abs/1602.07360v4">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>dropout_prob</code>: dropout probability for the classifier head. Set to <code>nothing</code> to disable dropout.</li><li><code>inchannels</code>: number of input channels.</li><li><code>nclasses</code>: the number of output classes.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/squeezenet.jl#L23-L34">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.unet" href="#Metalhead.unet"><code>Metalhead.unet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">unet(encoder_backbone, imgdims, outplanes::Integer, final::Any = unet_final_block,
      fdownscale::Integer = 0)</code></pre><p>Creates a UNet model with specified convolutional backbone.  Backbone of any Metalhead ResNet-like model can be used as encoder  (<a href="https://arxiv.org/abs/1505.04597">reference</a>).</p><p><strong>Arguments</strong></p><pre><code class="nohighlight hljs">- `encoder_backbone`: The backbone layers of specified model to be used as encoder.
 	For example, `Metalhead.backbone(Metalhead.ResNet(18))` can be passed 
 	to instantiate a UNet with layers of resnet18 as encoder.
 - `inputsize`: size of input image
 - `outplanes`: number of output feature planes
 - `final`: final block as described in original paper
-- `fdownscale`: downscale factor</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/unet.jl#L56-L73">source</a></section></article><h2 id="Block-level-functions"><a class="docs-heading-anchor" href="#Block-level-functions">Block-level functions</a><a id="Block-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Block-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_block" href="#Metalhead.vgg_block"><code>Metalhead.vgg_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_block(ifilters, ofilters, depth, batchnorm)</code></pre><p>A VGG block of convolution layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>ifilters</code>: number of input feature maps</li><li><code>ofilters</code>: number of output feature maps</li><li><code>depth</code>: number of convolution/convolution + batch norm layers</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/vgg.jl#L1-L13">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_convolutional_layers" href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_convolutional_layers(config, batchnorm, inchannels)</code></pre><p>Create VGG convolution layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: vector of tuples <code>(output_channels, num_convolutions)</code> for each block (see <a href="#Metalhead.vgg_block"><code>Metalhead.vgg_block</code></a>)</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>inchannels</code>: number of input channels</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/vgg.jl#L23-L35">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_classifier_layers" href="#Metalhead.vgg_classifier_layers"><code>Metalhead.vgg_classifier_layers</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_classifier_layers(imsize, nclasses, fcsize, dropout_prob)</code></pre><p>Create VGG classifier (fully connected) layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: tuple <code>(width, height, channels)</code> indicating the size after the convolution layers (see <a href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a>)</li><li><code>nclasses</code>: number of output classes</li><li><code>fcsize</code>: input and output size of the intermediate fully connected layer</li><li><code>dropout_prob</code>: the dropout level between each fully connected layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/vgg.jl#L48-L61">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../hybrid/">« Hybrid CNN architectures</a><a class="docs-footer-nextpage" href="../mixers/">MLPMixer-like models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+- `fdownscale`: downscale factor</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/unet.jl#L56-L73">source</a></section></article><h2 id="Block-level-functions"><a class="docs-heading-anchor" href="#Block-level-functions">Block-level functions</a><a id="Block-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Block-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_block" href="#Metalhead.vgg_block"><code>Metalhead.vgg_block</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_block(ifilters, ofilters, depth, batchnorm)</code></pre><p>A VGG block of convolution layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>ifilters</code>: number of input feature maps</li><li><code>ofilters</code>: number of output feature maps</li><li><code>depth</code>: number of convolution/convolution + batch norm layers</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/vgg.jl#L1-L13">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_convolutional_layers" href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_convolutional_layers(config, batchnorm, inchannels)</code></pre><p>Create VGG convolution layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: vector of tuples <code>(output_channels, num_convolutions)</code> for each block (see <a href="#Metalhead.vgg_block"><code>Metalhead.vgg_block</code></a>)</li><li><code>batchnorm</code>: set to <code>true</code> to include batch normalization after each convolution</li><li><code>inchannels</code>: number of input channels</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/vgg.jl#L23-L35">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vgg_classifier_layers" href="#Metalhead.vgg_classifier_layers"><code>Metalhead.vgg_classifier_layers</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vgg_classifier_layers(imsize, nclasses, fcsize, dropout_prob)</code></pre><p>Create VGG classifier (fully connected) layers (<a href="https://arxiv.org/abs/1409.1556v6">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: tuple <code>(width, height, channels)</code> indicating the size after the convolution layers (see <a href="#Metalhead.vgg_convolutional_layers"><code>Metalhead.vgg_convolutional_layers</code></a>)</li><li><code>nclasses</code>: number of output classes</li><li><code>fcsize</code>: input and output size of the intermediate fully connected layer</li><li><code>dropout_prob</code>: the dropout level between each fully connected layer</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/vgg.jl#L48-L61">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../hybrid/">« Hybrid CNN architectures</a><a class="docs-footer-nextpage" href="../mixers/">MLPMixer-like models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/resnet/index.html b/dev/api/resnet/index.html
index 94a08cb8..088c34ee 100644
--- a/dev/api/resnet/index.html
+++ b/dev/api/resnet/index.html
@@ -1,11 +1,11 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>ResNet-like models · Metalhead.jl</title><meta name="title" content="ResNet-like models · Metalhead.jl"/><meta property="og:title" content="ResNet-like models · Metalhead.jl"/><meta property="twitter:title" content="ResNet-like models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li class="is-active"><a class="tocitem" href>ResNet-like models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-function"><span>The mid-level function</span></a></li><li><a class="tocitem" href="#Lower-level-functions-and-builders"><span>Lower-level functions and builders</span></a></li><li><a class="tocitem" href="#Utility-callbacks"><span>Utility callbacks</span></a></li></ul></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>ResNet-like models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>ResNet-like models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/resnet.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="ResNet-like-models"><a class="docs-heading-anchor" href="#ResNet-like-models">ResNet-like models</a><a id="ResNet-like-models-1"></a><a class="docs-heading-anchor-permalink" href="#ResNet-like-models" title="Permalink"></a></h1><p>This is the API reference for the ResNet inspired model structures present in Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResNet" href="#Metalhead.ResNet"><code>Metalhead.ResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ResNet model with the specified depth. (<a href="https://arxiv.org/abs/1512.03385">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the ResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/resnet.jl#L1-L15">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.WideResNet" href="#Metalhead.WideResNet"><code>Metalhead.WideResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">WideResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a Wide ResNet model with the specified depth. The model is the same as ResNet except for the bottleneck number of channels which is twice larger in every block. The number of channels in outer 1x1 convolutions is the same. (<a href="https://arxiv.org/abs/1605.07146">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the Wide ResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/resnet.jl#L43-L59">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResNeXt" href="#Metalhead.ResNeXt"><code>Metalhead.ResNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResNeXt(depth::Integer; pretrain::Bool = false, cardinality::Integer = 32,
-        base_width::Integer = 4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ResNeXt model with the specified depth, cardinality, and base width. (<a href="https://arxiv.org/abs/1611.05431">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><p><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the ResNeXt model.</p></li><li><p><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet. Supported configurations are:</p><ul><li>depth 50, cardinality of 32 and base width of 4.</li><li>depth 101, cardinality of 32 and base width of 8.</li><li>depth 101, cardinality of 64 and base width of 4.</li></ul></li><li><p><code>cardinality</code>: the number of groups to be used in the 3x3 convolution in each block.</p></li><li><p><code>base_width</code>: the number of feature maps in each group.</p></li><li><p><code>inchannels</code>: the number of input channels.</p></li><li><p><code>nclasses</code>: the number of output classes</p></li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/resnext.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SEResNet" href="#Metalhead.SEResNet"><code>Metalhead.SEResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SEResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a SEResNet model with the specified depth. (<a href="https://arxiv.org/pdf/1709.01507.pdf">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the SEResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>SEResNet</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/seresnet.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SEResNeXt" href="#Metalhead.SEResNeXt"><code>Metalhead.SEResNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SEResNeXt(depth::Integer; pretrain::Bool = false, cardinality::Integer = 32,
-          base_width::Integer = 4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a SEResNeXt model with the specified depth, cardinality, and base width. (<a href="https://arxiv.org/pdf/1709.01507.pdf">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the SEResNeXt model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>cardinality</code>: the number of groups to be used in the 3x3 convolution in each block.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>SEResNeXt</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/seresnet.jl#L43-L64">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Res2Net" href="#Metalhead.Res2Net"><code>Metalhead.Res2Net</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Res2Net(depth::Integer; pretrain::Bool = false, scale::Integer = 4,
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>ResNet-like models · Metalhead.jl</title><meta name="title" content="ResNet-like models · Metalhead.jl"/><meta property="og:title" content="ResNet-like models · Metalhead.jl"/><meta property="twitter:title" content="ResNet-like models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/resnet/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox" checked/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li class="is-active"><a class="tocitem" href>ResNet-like models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-function"><span>The mid-level function</span></a></li><li><a class="tocitem" href="#Lower-level-functions-and-builders"><span>Lower-level functions and builders</span></a></li><li><a class="tocitem" href="#Utility-callbacks"><span>Utility callbacks</span></a></li></ul></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Convolutional Neural Networks</a></li><li class="is-active"><a href>ResNet-like models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>ResNet-like models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/resnet.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="ResNet-like-models"><a class="docs-heading-anchor" href="#ResNet-like-models">ResNet-like models</a><a id="ResNet-like-models-1"></a><a class="docs-heading-anchor-permalink" href="#ResNet-like-models" title="Permalink"></a></h1><p>This is the API reference for the ResNet inspired model structures present in Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResNet" href="#Metalhead.ResNet"><code>Metalhead.ResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ResNet model with the specified depth. (<a href="https://arxiv.org/abs/1512.03385">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the ResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/resnet.jl#L1-L15">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.WideResNet" href="#Metalhead.WideResNet"><code>Metalhead.WideResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">WideResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a Wide ResNet model with the specified depth. The model is the same as ResNet except for the bottleneck number of channels which is twice larger in every block. The number of channels in outer 1x1 convolutions is the same. (<a href="https://arxiv.org/abs/1605.07146">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the Wide ResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes</li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/resnet.jl#L43-L59">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ResNeXt" href="#Metalhead.ResNeXt"><code>Metalhead.ResNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ResNeXt(depth::Integer; pretrain::Bool = false, cardinality::Integer = 32,
+        base_width::Integer = 4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a ResNeXt model with the specified depth, cardinality, and base width. (<a href="https://arxiv.org/abs/1611.05431">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><p><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the ResNeXt model.</p></li><li><p><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet. Supported configurations are:</p><ul><li>depth 50, cardinality of 32 and base width of 4.</li><li>depth 101, cardinality of 32 and base width of 8.</li><li>depth 101, cardinality of 64 and base width of 4.</li></ul></li><li><p><code>cardinality</code>: the number of groups to be used in the 3x3 convolution in each block.</p></li><li><p><code>base_width</code>: the number of feature maps in each group.</p></li><li><p><code>inchannels</code>: the number of input channels.</p></li><li><p><code>nclasses</code>: the number of output classes</p></li></ul><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/resnext.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SEResNet" href="#Metalhead.SEResNet"><code>Metalhead.SEResNet</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SEResNet(depth::Integer; pretrain::Bool = false, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a SEResNet model with the specified depth. (<a href="https://arxiv.org/pdf/1709.01507.pdf">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[18, 34, 50, 101, 152]</code>. The depth of the SEResNet model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>SEResNet</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/seresnet.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.SEResNeXt" href="#Metalhead.SEResNeXt"><code>Metalhead.SEResNeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">SEResNeXt(depth::Integer; pretrain::Bool = false, cardinality::Integer = 32,
+          base_width::Integer = 4, inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a SEResNeXt model with the specified depth, cardinality, and base width. (<a href="https://arxiv.org/pdf/1709.01507.pdf">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the SEResNeXt model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>cardinality</code>: the number of groups to be used in the 3x3 convolution in each block.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>inchannels</code>: the number of input channels</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>SEResNeXt</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/seresnet.jl#L43-L64">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Res2Net" href="#Metalhead.Res2Net"><code>Metalhead.Res2Net</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Res2Net(depth::Integer; pretrain::Bool = false, scale::Integer = 4,
         base_width::Integer = 26, inchannels::Integer = 3,
-        nclasses::Integer = 1000)</code></pre><p>Creates a Res2Net model with the specified depth, scale, and base width. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the Res2Net model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Res2Net</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/res2net.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Res2NeXt" href="#Metalhead.Res2NeXt"><code>Metalhead.Res2NeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Res2NeXt(depth::Integer; pretrain::Bool = false, scale::Integer = 4,
+        nclasses::Integer = 1000)</code></pre><p>Creates a Res2Net model with the specified depth, scale, and base width. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the Res2Net model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Res2Net</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/res2net.jl#L1-L24">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.Res2NeXt" href="#Metalhead.Res2NeXt"><code>Metalhead.Res2NeXt</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">Res2NeXt(depth::Integer; pretrain::Bool = false, scale::Integer = 4,
          base_width::Integer = 4, cardinality::Integer = 8,
-         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a Res2NeXt model with the specified depth, scale, base width and cardinality. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the Res2Net model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>cardinality</code>: the number of groups in the 3x3 convolutions.</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Res2NeXt</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/res2net.jl#L50-L74">source</a></section></article><h2 id="The-mid-level-function"><a class="docs-heading-anchor" href="#The-mid-level-function">The mid-level function</a><a id="The-mid-level-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet" href="#Metalhead.resnet"><code>Metalhead.resnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet(block_type, block_repeats::AbstractVector{&lt;:Integer},
+         inchannels::Integer = 3, nclasses::Integer = 1000)</code></pre><p>Creates a Res2NeXt model with the specified depth, scale, base width and cardinality. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>depth</code>: one of <code>[50, 101, 152]</code>. The depth of the Res2Net model.</li><li><code>pretrain</code>: set to <code>true</code> to load the model with pre-trained weights for ImageNet</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>base_width</code>: the number of feature maps in each group.</li><li><code>cardinality</code>: the number of groups in the 3x3 convolutions.</li><li><code>inchannels</code>: the number of input channels.</li><li><code>nclasses</code>: the number of output classes</li></ul><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p><code>Res2NeXt</code> does not currently support pretrained weights.</p></div></div><p>Advanced users who want more configuration options will be better served by using <a href="#Metalhead.resnet"><code>resnet</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/res2net.jl#L50-L74">source</a></section></article><h2 id="The-mid-level-function"><a class="docs-heading-anchor" href="#The-mid-level-function">The mid-level function</a><a id="The-mid-level-function-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet" href="#Metalhead.resnet"><code>Metalhead.resnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet(block_type, block_repeats::AbstractVector{&lt;:Integer},
        downsample_opt::NTuple{2, Any} = (downsample_conv, downsample_identity);
        cardinality::Integer = 1, base_width::Integer = 64,
        inplanes::Integer = 64, reduction_factor::Integer = 1,
@@ -15,28 +15,28 @@
        use_conv::Bool = false, dropblock_prob = nothing,
        stochastic_depth_prob = nothing, dropout_prob = nothing,
        imsize::Dims{2} = (256, 256), inchannels::Integer = 3,
-       nclasses::Integer = 1000, kwargs...)</code></pre><p>Creates a generic ResNet-like model that is used to create The higher-level model constructors like ResNet, Wide ResNet, ResNeXt and Res2Net. For an <em>even</em> more generic model API, see <a href="#Metalhead.build_resnet"><code>Metalhead.build_resnet</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>block_type</code>: The type of block to be used in the model. This can be one of <a href="#Metalhead.basicblock"><code>Metalhead.basicblock</code></a>, <a href="#Metalhead.bottleneck"><code>Metalhead.bottleneck</code></a> and <a href="#Metalhead.bottle2neck"><code>Metalhead.bottle2neck</code></a>. <code>basicblock</code> is used in the original ResNet paper for ResNet-18 and ResNet-34, and <code>bottleneck</code> is used in the original ResNet-50 and ResNet-101 models, as well as for the Wide ResNet and ResNeXt models. <code>bottle2neck</code> is introduced in the <code>Res2Net</code> paper.</li><li><code>block_repeats</code>: A <code>Vector</code> of integers specifying the number of times each block is repeated in each stage of the ResNet model. For example, <code>[3, 4, 6, 3]</code> is the configuration used in ResNet-50, which has 3 blocks in the first stage, 4 blocks in the second stage, 6 blocks in the third stage and 3 blocks in the fourth stage.</li><li><code>downsample_opt</code>: A <code>NTuple</code> of two callbacks that are used to determine the downsampling operation to be used in the model. The first callback is used to determine the convolutional operation to be used in the downsampling operation and the second callback is used to determine the identity operation to be used in the downsampling operation.</li><li><code>cardinality</code>: The number of groups to be used in the 3x3 convolutional layer in the bottleneck block. This is usually modified from the default value of <code>1</code> in the ResNet models to <code>32</code> or <code>64</code> in the <code>ResNeXt</code> models.</li><li><code>base_width</code>: The base width of the convolutional layer in the blocks of the model.</li><li><code>inplanes</code>: The number of input channels in the first convolutional layer.</li><li><code>reduction_factor</code>: The reduction factor used in the model.</li><li><code>connection</code>: This is a function that determines the residual connection in the model. For <code>resnets</code>, either of <a href="../layers_adv/#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> or <a href="../layers_adv/#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> is recommended. These decide whether the residual connection is added before or after the activation function.</li><li><code>norm_layer</code>: The normalisation layer to be used in the model.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layers before the convolutions</li><li><code>attn_fn</code>: A callback that is used to determine the attention function to be used in the model. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>Metalhead.Layers.squeeze_excite</code></a> for an example.</li><li><code>pool_layer</code>: A fully-instantiated pooling layer passed in to be used by the classifier head. For example, <code>AdaptiveMeanPool((1, 1))</code> is used in the ResNet family by default, but something like <code>MeanPool((3, 3))</code> should also work provided the dimensions after applying the pooling layer are compatible with the rest of the classifier head.</li><li><code>use_conv</code>: Set to true to use convolutions instead of identity operations in the model.</li><li><code>dropblock_prob</code>: <code>DropBlock</code> probability to be used in the model. Set to <code>nothing</code> to disable DropBlock. See <a href="../layers_intro/#Metalhead.Layers.DropBlock"><code>Metalhead.DropBlock</code></a> for more details.</li><li><code>stochastic_depth_prob</code>: <code>StochasticDepth</code> probability to be used in the model. Set to <code>nothing</code> to disable StochasticDepth. See <a href="../layers_intro/#Metalhead.Layers.StochasticDepth"><code>Metalhead.StochasticDepth</code></a> for more details.</li><li><code>dropout_prob</code>: <code>Dropout</code> probability to be used in the classifier head. Set to <code>nothing</code> to disable Dropout.</li><li><code>imsize</code>: The size of the input (height, width).</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes.</li><li><code>kwargs</code>: Additional keyword arguments to be passed to the block builder (note: ignore this argument if you are not sure what it does. To know more about how this works, check out the section of the documentation that talks about builders in Metalhead and specifically for the ResNet block functions).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L336-L398">source</a></section></article><h2 id="Lower-level-functions-and-builders"><a class="docs-heading-anchor" href="#Lower-level-functions-and-builders">Lower-level functions and builders</a><a id="Lower-level-functions-and-builders-1"></a><a class="docs-heading-anchor-permalink" href="#Lower-level-functions-and-builders" title="Permalink"></a></h2><h3 id="Block-functions"><a class="docs-heading-anchor" href="#Block-functions">Block functions</a><a id="Block-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Block-functions" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.basicblock" href="#Metalhead.basicblock"><code>Metalhead.basicblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">basicblock(inplanes::Integer, planes::Integer; stride::Integer = 1,
+       nclasses::Integer = 1000, kwargs...)</code></pre><p>Creates a generic ResNet-like model that is used to create The higher-level model constructors like ResNet, Wide ResNet, ResNeXt and Res2Net. For an <em>even</em> more generic model API, see <a href="#Metalhead.build_resnet"><code>Metalhead.build_resnet</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>block_type</code>: The type of block to be used in the model. This can be one of <a href="#Metalhead.basicblock"><code>Metalhead.basicblock</code></a>, <a href="#Metalhead.bottleneck"><code>Metalhead.bottleneck</code></a> and <a href="#Metalhead.bottle2neck"><code>Metalhead.bottle2neck</code></a>. <code>basicblock</code> is used in the original ResNet paper for ResNet-18 and ResNet-34, and <code>bottleneck</code> is used in the original ResNet-50 and ResNet-101 models, as well as for the Wide ResNet and ResNeXt models. <code>bottle2neck</code> is introduced in the <code>Res2Net</code> paper.</li><li><code>block_repeats</code>: A <code>Vector</code> of integers specifying the number of times each block is repeated in each stage of the ResNet model. For example, <code>[3, 4, 6, 3]</code> is the configuration used in ResNet-50, which has 3 blocks in the first stage, 4 blocks in the second stage, 6 blocks in the third stage and 3 blocks in the fourth stage.</li><li><code>downsample_opt</code>: A <code>NTuple</code> of two callbacks that are used to determine the downsampling operation to be used in the model. The first callback is used to determine the convolutional operation to be used in the downsampling operation and the second callback is used to determine the identity operation to be used in the downsampling operation.</li><li><code>cardinality</code>: The number of groups to be used in the 3x3 convolutional layer in the bottleneck block. This is usually modified from the default value of <code>1</code> in the ResNet models to <code>32</code> or <code>64</code> in the <code>ResNeXt</code> models.</li><li><code>base_width</code>: The base width of the convolutional layer in the blocks of the model.</li><li><code>inplanes</code>: The number of input channels in the first convolutional layer.</li><li><code>reduction_factor</code>: The reduction factor used in the model.</li><li><code>connection</code>: This is a function that determines the residual connection in the model. For <code>resnets</code>, either of <a href="../layers_adv/#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> or <a href="../layers_adv/#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> is recommended. These decide whether the residual connection is added before or after the activation function.</li><li><code>norm_layer</code>: The normalisation layer to be used in the model.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layers before the convolutions</li><li><code>attn_fn</code>: A callback that is used to determine the attention function to be used in the model. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>Metalhead.Layers.squeeze_excite</code></a> for an example.</li><li><code>pool_layer</code>: A fully-instantiated pooling layer passed in to be used by the classifier head. For example, <code>AdaptiveMeanPool((1, 1))</code> is used in the ResNet family by default, but something like <code>MeanPool((3, 3))</code> should also work provided the dimensions after applying the pooling layer are compatible with the rest of the classifier head.</li><li><code>use_conv</code>: Set to true to use convolutions instead of identity operations in the model.</li><li><code>dropblock_prob</code>: <code>DropBlock</code> probability to be used in the model. Set to <code>nothing</code> to disable DropBlock. See <a href="../layers_intro/#Metalhead.Layers.DropBlock"><code>Metalhead.DropBlock</code></a> for more details.</li><li><code>stochastic_depth_prob</code>: <code>StochasticDepth</code> probability to be used in the model. Set to <code>nothing</code> to disable StochasticDepth. See <a href="../layers_intro/#Metalhead.Layers.StochasticDepth"><code>Metalhead.StochasticDepth</code></a> for more details.</li><li><code>dropout_prob</code>: <code>Dropout</code> probability to be used in the classifier head. Set to <code>nothing</code> to disable Dropout.</li><li><code>imsize</code>: The size of the input (height, width).</li><li><code>inchannels</code>: The number of input channels.</li><li><code>nclasses</code>: The number of output classes.</li><li><code>kwargs</code>: Additional keyword arguments to be passed to the block builder (note: ignore this argument if you are not sure what it does. To know more about how this works, check out the section of the documentation that talks about builders in Metalhead and specifically for the ResNet block functions).</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L336-L398">source</a></section></article><h2 id="Lower-level-functions-and-builders"><a class="docs-heading-anchor" href="#Lower-level-functions-and-builders">Lower-level functions and builders</a><a id="Lower-level-functions-and-builders-1"></a><a class="docs-heading-anchor-permalink" href="#Lower-level-functions-and-builders" title="Permalink"></a></h2><h3 id="Block-functions"><a class="docs-heading-anchor" href="#Block-functions">Block functions</a><a id="Block-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Block-functions" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.basicblock" href="#Metalhead.basicblock"><code>Metalhead.basicblock</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">basicblock(inplanes::Integer, planes::Integer; stride::Integer = 1,
            reduction_factor::Integer = 1, activation = relu,
            norm_layer = BatchNorm, revnorm::Bool = false,
            drop_block = identity, drop_path = identity,
-           attn_fn = planes -&gt; identity)</code></pre><p>Creates a basic residual block (see <a href="https://arxiv.org/abs/1512.03385v1">reference</a>). This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.basicblock_builder"><code>Metalhead.basicblock_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>reduction_factor</code>: the factor by which the input feature maps are reduced before</li></ul><p>the first convolution.</p><ul><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>drop_block</code>: the drop block layer</li><li><code>drop_path</code>: the drop path layer</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L3-L27">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottleneck" href="#Metalhead.bottleneck"><code>Metalhead.bottleneck</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottleneck(inplanes::Integer, planes::Integer; stride::Integer,
+           attn_fn = planes -&gt; identity)</code></pre><p>Creates a basic residual block (see <a href="https://arxiv.org/abs/1512.03385v1">reference</a>). This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.basicblock_builder"><code>Metalhead.basicblock_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>reduction_factor</code>: the factor by which the input feature maps are reduced before</li></ul><p>the first convolution.</p><ul><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>drop_block</code>: the drop block layer</li><li><code>drop_path</code>: the drop path layer</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L3-L27">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottleneck" href="#Metalhead.bottleneck"><code>Metalhead.bottleneck</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottleneck(inplanes::Integer, planes::Integer; stride::Integer,
            cardinality::Integer = 1, base_width::Integer = 64,
            reduction_factor::Integer = 1, activation = relu,
            norm_layer = BatchNorm, revnorm::Bool = false,
            drop_block = identity, drop_path = identity,
-           attn_fn = planes -&gt; identity)</code></pre><p>Creates a bottleneck residual block (see <a href="https://arxiv.org/abs/1512.03385v1">reference</a>). This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.bottleneck_builder"><code>Metalhead.bottleneck_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>cardinality</code>: the number of groups in the convolution.</li><li><code>base_width</code>: the number of output feature maps for each convolutional group.</li><li><code>reduction_factor</code>: the factor by which the input feature maps are reduced before the first convolution.</li><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>drop_block</code>: the drop block layer</li><li><code>drop_path</code>: the drop path layer</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L49-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottle2neck" href="#Metalhead.bottle2neck"><code>Metalhead.bottle2neck</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottle2neck(inplanes::Integer, planes::Integer; stride::Integer = 1,
+           attn_fn = planes -&gt; identity)</code></pre><p>Creates a bottleneck residual block (see <a href="https://arxiv.org/abs/1512.03385v1">reference</a>). This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.bottleneck_builder"><code>Metalhead.bottleneck_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>cardinality</code>: the number of groups in the convolution.</li><li><code>base_width</code>: the number of output feature maps for each convolutional group.</li><li><code>reduction_factor</code>: the factor by which the input feature maps are reduced before the first convolution.</li><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li><li><code>drop_block</code>: the drop block layer</li><li><code>drop_path</code>: the drop path layer</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L49-L76">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottle2neck" href="#Metalhead.bottle2neck"><code>Metalhead.bottle2neck</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottle2neck(inplanes::Integer, planes::Integer; stride::Integer = 1,
             cardinality::Integer = 1, base_width::Integer = 26,
             scale::Integer = 4, activation = relu, norm_layer = BatchNorm,
-            revnorm::Bool = false, attn_fn = planes -&gt; identity)</code></pre><p>Creates a bottleneck block as described in the Res2Net paper. (<a href="https://arxiv.org/abs/1904.01169">reference</a>) This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.bottle2neck_builder"><code>Metalhead.bottle2neck_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>cardinality</code>: the number of groups in the 3x3 convolutions.</li><li><code>base_width</code>: the number of output feature maps for each convolutional group.</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the batch norm before the convolution</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L96-L119">source</a></section></article><h3 id="Downsampling-functions"><a class="docs-heading-anchor" href="#Downsampling-functions">Downsampling functions</a><a id="Downsampling-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Downsampling-functions" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_identity" href="#Metalhead.downsample_identity"><code>Metalhead.downsample_identity</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_identity(inplanes::Integer, outplanes::Integer; kwargs...)</code></pre><p>Creates an identity downsample layer. This returns <code>identity</code> if <code>inplanes == outplanes</code>. If <code>outplanes &gt; inplanes</code>, it maps the input to <code>outplanes</code> channels using a 1x1 max pooling layer and zero padding.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This does not currently support the scenario where <code>inplanes &gt; outplanes</code>.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li></ul><p>Note that kwargs are ignored and only included for compatibility with other downsample layers.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L192-L209">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_conv" href="#Metalhead.downsample_conv"><code>Metalhead.downsample_conv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_conv(inplanes::Integer, outplanes::Integer; stride::Integer = 1,
-                norm_layer = BatchNorm, revnorm::Bool = false)</code></pre><p>Creates a 1x1 convolutional downsample layer as used in ResNet.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>stride</code>: the stride of the convolution</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L147-L160">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_pool" href="#Metalhead.downsample_pool"><code>Metalhead.downsample_pool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_pool(inplanes::Integer, outplanes::Integer; stride::Integer = 1,
-                norm_layer = BatchNorm, revnorm::Bool = false)</code></pre><p>Creates a pooling-based downsample layer as described in the <a href="https://arxiv.org/abs/1812.01187v1">Bag of Tricks</a> paper. This adds an average pooling layer of size <code>(2, 2)</code> with <code>stride</code> followed by a 1x1 convolution.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>stride</code>: the stride of the convolution</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L167-L182">source</a></section></article><h3 id="Block-builders"><a class="docs-heading-anchor" href="#Block-builders">Block builders</a><a id="Block-builders-1"></a><a class="docs-heading-anchor-permalink" href="#Block-builders" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.basicblock_builder" href="#Metalhead.basicblock_builder"><code>Metalhead.basicblock_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">basicblock_builder(block_repeats::AbstractVector{&lt;:Integer};
+            revnorm::Bool = false, attn_fn = planes -&gt; identity)</code></pre><p>Creates a bottleneck block as described in the Res2Net paper. (<a href="https://arxiv.org/abs/1904.01169">reference</a>) This function creates the layers. For more configuration options and to see the function used to build the block for the model, see <a href="#Metalhead.bottle2neck_builder"><code>Metalhead.bottle2neck_builder</code></a>.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>planes</code>: number of feature maps for the block</li><li><code>stride</code>: the stride of the block</li><li><code>cardinality</code>: the number of groups in the 3x3 convolutions.</li><li><code>base_width</code>: the number of output feature maps for each convolutional group.</li><li><code>scale</code>: the number of feature groups in the block. See the <a href="https://arxiv.org/abs/1904.01169">paper</a> for more details.</li><li><code>activation</code>: the activation function to use.</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the batch norm before the convolution</li><li><code>attn_fn</code>: the attention function to use. See <a href="../layers_adv/#Metalhead.Layers.squeeze_excite"><code>squeeze_excite</code></a> for an example.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L96-L119">source</a></section></article><h3 id="Downsampling-functions"><a class="docs-heading-anchor" href="#Downsampling-functions">Downsampling functions</a><a id="Downsampling-functions-1"></a><a class="docs-heading-anchor-permalink" href="#Downsampling-functions" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_identity" href="#Metalhead.downsample_identity"><code>Metalhead.downsample_identity</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_identity(inplanes::Integer, outplanes::Integer; kwargs...)</code></pre><p>Creates an identity downsample layer. This returns <code>identity</code> if <code>inplanes == outplanes</code>. If <code>outplanes &gt; inplanes</code>, it maps the input to <code>outplanes</code> channels using a 1x1 max pooling layer and zero padding.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>This does not currently support the scenario where <code>inplanes &gt; outplanes</code>.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li></ul><p>Note that kwargs are ignored and only included for compatibility with other downsample layers.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L192-L209">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_conv" href="#Metalhead.downsample_conv"><code>Metalhead.downsample_conv</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_conv(inplanes::Integer, outplanes::Integer; stride::Integer = 1,
+                norm_layer = BatchNorm, revnorm::Bool = false)</code></pre><p>Creates a 1x1 convolutional downsample layer as used in ResNet.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>stride</code>: the stride of the convolution</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L147-L160">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.downsample_pool" href="#Metalhead.downsample_pool"><code>Metalhead.downsample_pool</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">downsample_pool(inplanes::Integer, outplanes::Integer; stride::Integer = 1,
+                norm_layer = BatchNorm, revnorm::Bool = false)</code></pre><p>Creates a pooling-based downsample layer as described in the <a href="https://arxiv.org/abs/1812.01187v1">Bag of Tricks</a> paper. This adds an average pooling layer of size <code>(2, 2)</code> with <code>stride</code> followed by a 1x1 convolution.</p><p><strong>Arguments</strong></p><ul><li><code>inplanes</code>: number of input feature maps</li><li><code>outplanes</code>: number of output feature maps</li><li><code>stride</code>: the stride of the convolution</li><li><code>norm_layer</code>: the normalization layer to use.</li><li><code>revnorm</code>: set to <code>true</code> to place the normalisation layer before the convolution</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L167-L182">source</a></section></article><h3 id="Block-builders"><a class="docs-heading-anchor" href="#Block-builders">Block builders</a><a id="Block-builders-1"></a><a class="docs-heading-anchor-permalink" href="#Block-builders" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.basicblock_builder" href="#Metalhead.basicblock_builder"><code>Metalhead.basicblock_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">basicblock_builder(block_repeats::AbstractVector{&lt;:Integer};
                    inplanes::Integer = 64, reduction_factor::Integer = 1,
                    expansion::Integer = 1, norm_layer = BatchNorm,
                    revnorm::Bool = false, activation = relu,
                    attn_fn = planes -&gt; identity,
                    dropblock_prob = nothing, stochastic_depth_prob = nothing,
                    stride_fn = resnet_stride, planes_fn = resnet_planes,
-                   downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a basic block for a ResNet model. (<a href="https://arxiv.org/abs/1512.03385">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><p><code>block_repeats</code>: number of repeats of a block in each stage</p></li><li><p><code>inplanes</code>: number of input channels</p></li><li><p><code>reduction_factor</code>: reduction factor for the number of channels in each stage</p></li><li><p><code>expansion</code>: expansion factor for the number of channels for the block</p></li><li><p><code>norm_layer</code>: normalization layer to use</p></li><li><p><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</p></li><li><p><code>activation</code>: activation function to use</p></li><li><p><code>attn_fn</code>: attention function to use</p></li><li><p><code>dropblock_prob</code>: dropblock probability. Set to <code>nothing</code> to disable <code>DropBlock</code></p></li><li><p><code>stochastic_depth_prob</code>: stochastic depth probability. Set to <code>nothing</code> to disable <code>StochasticDepth</code></p></li><li><p><code>stride_fn</code>: callback for computing the stride of the block</p></li><li><p><code>planes_fn</code>: callback for computing the number of channels in each block</p></li><li><p><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</p></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/builders/resblocks.jl#L1-L32">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottleneck_builder" href="#Metalhead.bottleneck_builder"><code>Metalhead.bottleneck_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottleneck_builder(block_repeats::AbstractVector{&lt;:Integer};
+                   downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a basic block for a ResNet model. (<a href="https://arxiv.org/abs/1512.03385">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><p><code>block_repeats</code>: number of repeats of a block in each stage</p></li><li><p><code>inplanes</code>: number of input channels</p></li><li><p><code>reduction_factor</code>: reduction factor for the number of channels in each stage</p></li><li><p><code>expansion</code>: expansion factor for the number of channels for the block</p></li><li><p><code>norm_layer</code>: normalization layer to use</p></li><li><p><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</p></li><li><p><code>activation</code>: activation function to use</p></li><li><p><code>attn_fn</code>: attention function to use</p></li><li><p><code>dropblock_prob</code>: dropblock probability. Set to <code>nothing</code> to disable <code>DropBlock</code></p></li><li><p><code>stochastic_depth_prob</code>: stochastic depth probability. Set to <code>nothing</code> to disable <code>StochasticDepth</code></p></li><li><p><code>stride_fn</code>: callback for computing the stride of the block</p></li><li><p><code>planes_fn</code>: callback for computing the number of channels in each block</p></li><li><p><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</p></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/builders/resblocks.jl#L1-L32">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottleneck_builder" href="#Metalhead.bottleneck_builder"><code>Metalhead.bottleneck_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottleneck_builder(block_repeats::AbstractVector{&lt;:Integer};
                    inplanes::Integer = 64, cardinality::Integer = 1,
                    base_width::Integer = 64, reduction_factor::Integer = 1,
                    expansion::Integer = 4, norm_layer = BatchNorm,
@@ -44,13 +44,13 @@
                    attn_fn = planes -&gt; identity, dropblock_prob = nothing,
                    stochastic_depth_prob = nothing, stride_fn = resnet_stride,
                    planes_fn = resnet_planes,
-                   downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a bottleneck block for a ResNet/ResNeXt model. (<a href="https://arxiv.org/abs/1611.05431">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>block_repeats</code>: number of repeats of a block in each stage</li><li><code>inplanes</code>: number of input channels</li><li><code>cardinality</code>: number of groups for the convolutional layer</li><li><code>base_width</code>: base width for the convolutional layer</li><li><code>reduction_factor</code>: reduction factor for the number of channels in each stage</li><li><code>expansion</code>: expansion factor for the number of channels for the block</li><li><code>norm_layer</code>: normalization layer to use</li><li><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</li><li><code>activation</code>: activation function to use</li><li><code>attn_fn</code>: attention function to use</li><li><code>dropblock_prob</code>: dropblock probability. Set to <code>nothing</code> to disable <code>DropBlock</code></li><li><code>stochastic_depth_prob</code>: stochastic depth probability. Set to <code>nothing</code> to disable <code>StochasticDepth</code></li><li><code>stride_fn</code>: callback for computing the stride of the block</li><li><code>planes_fn</code>: callback for computing the number of channels in each block</li><li><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/builders/resblocks.jl#L67-L100">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottle2neck_builder" href="#Metalhead.bottle2neck_builder"><code>Metalhead.bottle2neck_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottle2neck_builder(block_repeats::AbstractVector{&lt;:Integer};
+                   downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a bottleneck block for a ResNet/ResNeXt model. (<a href="https://arxiv.org/abs/1611.05431">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>block_repeats</code>: number of repeats of a block in each stage</li><li><code>inplanes</code>: number of input channels</li><li><code>cardinality</code>: number of groups for the convolutional layer</li><li><code>base_width</code>: base width for the convolutional layer</li><li><code>reduction_factor</code>: reduction factor for the number of channels in each stage</li><li><code>expansion</code>: expansion factor for the number of channels for the block</li><li><code>norm_layer</code>: normalization layer to use</li><li><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</li><li><code>activation</code>: activation function to use</li><li><code>attn_fn</code>: attention function to use</li><li><code>dropblock_prob</code>: dropblock probability. Set to <code>nothing</code> to disable <code>DropBlock</code></li><li><code>stochastic_depth_prob</code>: stochastic depth probability. Set to <code>nothing</code> to disable <code>StochasticDepth</code></li><li><code>stride_fn</code>: callback for computing the stride of the block</li><li><code>planes_fn</code>: callback for computing the number of channels in each block</li><li><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/builders/resblocks.jl#L67-L100">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.bottle2neck_builder" href="#Metalhead.bottle2neck_builder"><code>Metalhead.bottle2neck_builder</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">bottle2neck_builder(block_repeats::AbstractVector{&lt;:Integer};
                     inplanes::Integer = 64, cardinality::Integer = 1,
                     base_width::Integer = 26, scale::Integer = 4,
                     expansion::Integer = 4, norm_layer = BatchNorm,
                     revnorm::Bool = false, activation = relu,
                     attn_fn = planes -&gt; identity, stride_fn = resnet_stride,
                     planes_fn = resnet_planes,
-                    downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a bottle2neck block for a Res2Net model. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>block_repeats</code>: number of repeats of a block in each stage</li><li><code>inplanes</code>: number of input channels</li><li><code>cardinality</code>: number of groups for the convolutional layer</li><li><code>base_width</code>: base width for the convolutional layer</li><li><code>scale</code>: scale for the number of channels in each block</li><li><code>expansion</code>: expansion factor for the number of channels for the block</li><li><code>norm_layer</code>: normalization layer to use</li><li><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</li><li><code>activation</code>: activation function to use</li><li><code>attn_fn</code>: attention function to use</li><li><code>stride_fn</code>: callback for computing the stride of the block</li><li><code>planes_fn</code>: callback for computing the number of channels in each block</li><li><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/builders/resblocks.jl#L135-L165">source</a></section></article><h3 id="Generic-ResNet-model-builder"><a class="docs-heading-anchor" href="#Generic-ResNet-model-builder">Generic ResNet model builder</a><a id="Generic-ResNet-model-builder-1"></a><a class="docs-heading-anchor-permalink" href="#Generic-ResNet-model-builder" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.build_resnet" href="#Metalhead.build_resnet"><code>Metalhead.build_resnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">build_resnet(img_dims, stem, get_layers, block_repeats::AbstractVector{&lt;:Integer},
-             connection, classifier_fn)</code></pre><p>Creates a generic ResNet-like model.</p><div class="admonition is-info"><header class="admonition-header">Info</header><div class="admonition-body"><p>This is a very generic, flexible but low level function that can be used to create any of the ResNet variants. For a more user friendly function, see <a href="#Metalhead.resnet"><code>Metalhead.resnet</code></a>.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>img_dims</code>: The dimensions of the input image. This is used to determine the number of feature maps to be passed to the classifier. This should be a tuple of the form <code>(height, width, channels)</code>.</li><li><code>stem</code>: The stem of the ResNet model. The stem should be created outside of this function and passed in as an argument. This is done to allow for more flexibility in creating the stem. <a href="#Metalhead.resnet_stem"><code>resnet_stem</code></a> is a helper function that Metalhead provides which is recommended for creating the stem.</li><li><code>get_layers</code> is a function that takes in two inputs - the <code>stage_idx</code>, or the index of the stage, and the <code>block_idx</code>, or the index of the block within the stage. It returns a tuple of layers. If the tuple returned by <code>get_layers</code> has more than one element, then <code>connection</code> is used to splat this tuple into <code>Parallel</code> - if not, then the only element of the tuple is directly inserted into the network. <code>get_layers</code> is a very specific function and should not be created on its own. Instead, use one of the builders provided by Metalhead to create it.</li><li><code>block_repeats</code>: This is a <code>Vector</code> of integers that specifies the number of repeats of each block in each stage.</li><li><code>connection</code>: This is a function that determines the residual connection in the model. For <code>resnets</code>, either of <a href="../layers_adv/#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> or <a href="../layers_adv/#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> is recommended.</li><li><code>classifier_fn</code>: This is a function that takes in the number of feature maps and returns a classifier. This is usually built as a closure using a function like <a href="../layers_intro/#Metalhead.Layers.create_classifier"><code>Metalhead.create_classifier</code></a>. For example, if the number of output classes is <code>nclasses</code>, then the function can be defined as <code>channels -&gt; create_classifier(channels, nclasses)</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/builders/resnet.jl#L1-L35">source</a></section></article><h2 id="Utility-callbacks"><a class="docs-heading-anchor" href="#Utility-callbacks">Utility callbacks</a><a id="Utility-callbacks-1"></a><a class="docs-heading-anchor-permalink" href="#Utility-callbacks" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_planes" href="#Metalhead.resnet_planes"><code>Metalhead.resnet_planes</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_planes(block_repeats::AbstractVector{&lt;:Integer})</code></pre><p>Default callback for determining the number of channels in each block in a ResNet model.</p><p><strong>Arguments</strong></p><p><code>block_repeats</code>: A <code>Vector</code> of integers specifying the number of times each block is repeated in each stage of the ResNet model. For example, <code>[3, 4, 6, 3]</code> is the configuration used in ResNet-50, which has 3 blocks in the first stage, 4 blocks in the second stage, 6 blocks in the third stage and 3 blocks in the fourth stage.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L303-L314">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_stride" href="#Metalhead.resnet_stride"><code>Metalhead.resnet_stride</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_stride(stage_idx::Integer, block_idx::Integer)</code></pre><p>Default callback for determining the stride of a block in a ResNet model. Returns <code>2</code> for the first block in every stage except the first stage and <code>1</code> for all other blocks.</p><p><strong>Arguments</strong></p><ul><li><code>stage_idx</code>: The index of the stage in the ResNet model.</li><li><code>block_idx</code>: The index of the block in the stage.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L320-L331">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_stem" href="#Metalhead.resnet_stem"><code>Metalhead.resnet_stem</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_stem(; stem_type = :default, inchannels::Integer = 3, replace_stem_pool = false,
-              norm_layer = BatchNorm, activation = relu)</code></pre><p>Builds a stem to be used in a ResNet model. See the <code>stem</code> argument of <a href="#Metalhead.resnet"><code>resnet</code></a> for details on how to use this function.</p><p><strong>Arguments</strong></p><ul><li><p><code>stem_type</code>: The type of stem to be built. One of <code>[:default, :deep, :deep_tiered]</code>.</p><ul><li><code>:default</code>: Builds a stem based on the default ResNet stem, which consists of a single 7x7 convolution with stride 2 and a normalisation layer followed by a 3x3 max pooling layer with stride 2.</li><li><code>:deep</code>: This borrows ideas from other papers (<a href="https://arxiv.org/abs/1602.07261">InceptionResNetv2</a>, for example) in using a deeper stem with 3 successive 3x3 convolutions having normalisation layers after each one. This is followed by a 3x3 max pooling layer with stride 2.</li><li><code>:deep_tiered</code>: A variant of the <code>:deep</code> stem that has a larger width in the second convolution. This is an experimental variant from the <code>timm</code> library in Python that shows peformance improvements over the <code>:deep</code> stem in some cases.</li></ul></li><li><p><code>inchannels</code>: number of input channels</p></li><li><p><code>replace_pool</code>: Set to true to replace the max pooling layers with a 3x3 convolution + normalization with a stride of two.</p></li><li><p><code>norm_layer</code>: The normalisation layer used in the stem.</p></li><li><p><code>activation</code>: The activation function used in the stem.</p></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/convnets/resnets/core.jl#L236-L262">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../contributing/">« Contributing to Metalhead</a><a class="docs-footer-nextpage" href="../densenet/">DenseNet »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+                    downsample_tuple = (downsample_conv, downsample_identity))</code></pre><p>Builder for creating a bottle2neck block for a Res2Net model. (<a href="https://arxiv.org/abs/1904.01169">reference</a>)</p><p><strong>Arguments</strong></p><ul><li><code>block_repeats</code>: number of repeats of a block in each stage</li><li><code>inplanes</code>: number of input channels</li><li><code>cardinality</code>: number of groups for the convolutional layer</li><li><code>base_width</code>: base width for the convolutional layer</li><li><code>scale</code>: scale for the number of channels in each block</li><li><code>expansion</code>: expansion factor for the number of channels for the block</li><li><code>norm_layer</code>: normalization layer to use</li><li><code>revnorm</code>: set to <code>true</code> to place normalization layer before the convolution</li><li><code>activation</code>: activation function to use</li><li><code>attn_fn</code>: attention function to use</li><li><code>stride_fn</code>: callback for computing the stride of the block</li><li><code>planes_fn</code>: callback for computing the number of channels in each block</li><li><code>downsample_tuple</code>: two-element tuple of downsample functions to use. The first one is used when the number of channels changes in the block, the second one is used when the number of channels stays the same.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/builders/resblocks.jl#L135-L165">source</a></section></article><h3 id="Generic-ResNet-model-builder"><a class="docs-heading-anchor" href="#Generic-ResNet-model-builder">Generic ResNet model builder</a><a id="Generic-ResNet-model-builder-1"></a><a class="docs-heading-anchor-permalink" href="#Generic-ResNet-model-builder" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.build_resnet" href="#Metalhead.build_resnet"><code>Metalhead.build_resnet</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">build_resnet(img_dims, stem, get_layers, block_repeats::AbstractVector{&lt;:Integer},
+             connection, classifier_fn)</code></pre><p>Creates a generic ResNet-like model.</p><div class="admonition is-info"><header class="admonition-header">Info</header><div class="admonition-body"><p>This is a very generic, flexible but low level function that can be used to create any of the ResNet variants. For a more user friendly function, see <a href="#Metalhead.resnet"><code>Metalhead.resnet</code></a>.</p></div></div><p><strong>Arguments</strong></p><ul><li><code>img_dims</code>: The dimensions of the input image. This is used to determine the number of feature maps to be passed to the classifier. This should be a tuple of the form <code>(height, width, channels)</code>.</li><li><code>stem</code>: The stem of the ResNet model. The stem should be created outside of this function and passed in as an argument. This is done to allow for more flexibility in creating the stem. <a href="#Metalhead.resnet_stem"><code>resnet_stem</code></a> is a helper function that Metalhead provides which is recommended for creating the stem.</li><li><code>get_layers</code> is a function that takes in two inputs - the <code>stage_idx</code>, or the index of the stage, and the <code>block_idx</code>, or the index of the block within the stage. It returns a tuple of layers. If the tuple returned by <code>get_layers</code> has more than one element, then <code>connection</code> is used to splat this tuple into <code>Parallel</code> - if not, then the only element of the tuple is directly inserted into the network. <code>get_layers</code> is a very specific function and should not be created on its own. Instead, use one of the builders provided by Metalhead to create it.</li><li><code>block_repeats</code>: This is a <code>Vector</code> of integers that specifies the number of repeats of each block in each stage.</li><li><code>connection</code>: This is a function that determines the residual connection in the model. For <code>resnets</code>, either of <a href="../layers_adv/#Metalhead.Layers.addact"><code>Metalhead.Layers.addact</code></a> or <a href="../layers_adv/#Metalhead.Layers.actadd"><code>Metalhead.Layers.actadd</code></a> is recommended.</li><li><code>classifier_fn</code>: This is a function that takes in the number of feature maps and returns a classifier. This is usually built as a closure using a function like <a href="../layers_intro/#Metalhead.Layers.create_classifier"><code>Metalhead.create_classifier</code></a>. For example, if the number of output classes is <code>nclasses</code>, then the function can be defined as <code>channels -&gt; create_classifier(channels, nclasses)</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/builders/resnet.jl#L1-L35">source</a></section></article><h2 id="Utility-callbacks"><a class="docs-heading-anchor" href="#Utility-callbacks">Utility callbacks</a><a id="Utility-callbacks-1"></a><a class="docs-heading-anchor-permalink" href="#Utility-callbacks" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_planes" href="#Metalhead.resnet_planes"><code>Metalhead.resnet_planes</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_planes(block_repeats::AbstractVector{&lt;:Integer})</code></pre><p>Default callback for determining the number of channels in each block in a ResNet model.</p><p><strong>Arguments</strong></p><p><code>block_repeats</code>: A <code>Vector</code> of integers specifying the number of times each block is repeated in each stage of the ResNet model. For example, <code>[3, 4, 6, 3]</code> is the configuration used in ResNet-50, which has 3 blocks in the first stage, 4 blocks in the second stage, 6 blocks in the third stage and 3 blocks in the fourth stage.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L303-L314">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_stride" href="#Metalhead.resnet_stride"><code>Metalhead.resnet_stride</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_stride(stage_idx::Integer, block_idx::Integer)</code></pre><p>Default callback for determining the stride of a block in a ResNet model. Returns <code>2</code> for the first block in every stage except the first stage and <code>1</code> for all other blocks.</p><p><strong>Arguments</strong></p><ul><li><code>stage_idx</code>: The index of the stage in the ResNet model.</li><li><code>block_idx</code>: The index of the block in the stage.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L320-L331">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.resnet_stem" href="#Metalhead.resnet_stem"><code>Metalhead.resnet_stem</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">resnet_stem(; stem_type = :default, inchannels::Integer = 3, replace_stem_pool = false,
+              norm_layer = BatchNorm, activation = relu)</code></pre><p>Builds a stem to be used in a ResNet model. See the <code>stem</code> argument of <a href="#Metalhead.resnet"><code>resnet</code></a> for details on how to use this function.</p><p><strong>Arguments</strong></p><ul><li><p><code>stem_type</code>: The type of stem to be built. One of <code>[:default, :deep, :deep_tiered]</code>.</p><ul><li><code>:default</code>: Builds a stem based on the default ResNet stem, which consists of a single 7x7 convolution with stride 2 and a normalisation layer followed by a 3x3 max pooling layer with stride 2.</li><li><code>:deep</code>: This borrows ideas from other papers (<a href="https://arxiv.org/abs/1602.07261">InceptionResNetv2</a>, for example) in using a deeper stem with 3 successive 3x3 convolutions having normalisation layers after each one. This is followed by a 3x3 max pooling layer with stride 2.</li><li><code>:deep_tiered</code>: A variant of the <code>:deep</code> stem that has a larger width in the second convolution. This is an experimental variant from the <code>timm</code> library in Python that shows peformance improvements over the <code>:deep</code> stem in some cases.</li></ul></li><li><p><code>inchannels</code>: number of input channels</p></li><li><p><code>replace_pool</code>: Set to true to replace the max pooling layers with a 3x3 convolution + normalization with a stride of two.</p></li><li><p><code>norm_layer</code>: The normalisation layer used in the stem.</p></li><li><p><code>activation</code>: The activation function used in the stem.</p></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/convnets/resnets/core.jl#L236-L262">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../contributing/">« Contributing to Metalhead</a><a class="docs-footer-nextpage" href="../densenet/">DenseNet »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/utilities/index.html b/dev/api/utilities/index.html
index 191bb312..fc3deb8c 100644
--- a/dev/api/utilities/index.html
+++ b/dev/api/utilities/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Model Utilities · Metalhead.jl</title><meta name="title" content="Model Utilities · Metalhead.jl"/><meta property="og:title" content="Model Utilities · Metalhead.jl"/><meta property="twitter:title" content="Model Utilities · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li class="is-active"><a class="tocitem" href>Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Model Utilities</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Model Utilities</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/utilities.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Model-utilities"><a class="docs-heading-anchor" href="#Model-utilities">Model utilities</a><a id="Model-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Model-utilities" title="Permalink"></a></h1><p>Metalhead provides some utility functions for making it easier to work with the models inside the library or to build new ones. The API reference for these is documented below.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.backbone" href="#Metalhead.backbone"><code>Metalhead.backbone</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">backbone(model)</code></pre><p>This function returns the backbone of a model that can be used for feature extraction. A <code>Flux.Chain</code> is returned, which can be indexed/sliced into to get the desired layer(s). Note that the model used here as input must be the &quot;camel-cased&quot; version of the model, e.g. <code>ResNet</code> instead of <code>resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/core.jl#L1-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.classifier" href="#Metalhead.classifier"><code>Metalhead.classifier</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">classifier(model)</code></pre><p>This function returns the classifier head of a model. This is sometimes useful for fine-tuning a model on a different dataset. A <code>Flux.Chain</code> is returned, which can be indexed/sliced into to get the desired layer(s). Note that the model used here as input must be the &quot;camel-cased&quot; version of the model, e.g. <code>ResNet</code> instead of <code>resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/core.jl#L11-L18">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers_adv/">« More advanced layers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Model Utilities · Metalhead.jl</title><meta name="title" content="Model Utilities · Metalhead.jl"/><meta property="og:title" content="Model Utilities · Metalhead.jl"/><meta property="twitter:title" content="Model Utilities · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/utilities/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li class="is-active"><a class="tocitem" href>Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Model Utilities</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Model Utilities</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/utilities.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Model-utilities"><a class="docs-heading-anchor" href="#Model-utilities">Model utilities</a><a id="Model-utilities-1"></a><a class="docs-heading-anchor-permalink" href="#Model-utilities" title="Permalink"></a></h1><p>Metalhead provides some utility functions for making it easier to work with the models inside the library or to build new ones. The API reference for these is documented below.</p><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.backbone" href="#Metalhead.backbone"><code>Metalhead.backbone</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">backbone(model)</code></pre><p>This function returns the backbone of a model that can be used for feature extraction. A <code>Flux.Chain</code> is returned, which can be indexed/sliced into to get the desired layer(s). Note that the model used here as input must be the &quot;camel-cased&quot; version of the model, e.g. <code>ResNet</code> instead of <code>resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/core.jl#L1-L8">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.classifier" href="#Metalhead.classifier"><code>Metalhead.classifier</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">classifier(model)</code></pre><p>This function returns the classifier head of a model. This is sometimes useful for fine-tuning a model on a different dataset. A <code>Flux.Chain</code> is returned, which can be indexed/sliced into to get the desired layer(s). Note that the model used here as input must be the &quot;camel-cased&quot; version of the model, e.g. <code>ResNet</code> instead of <code>resnet</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/core.jl#L11-L18">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../layers_adv/">« More advanced layers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/vit/index.html b/dev/api/vit/index.html
index 8ef1feca..ff0df044 100644
--- a/dev/api/vit/index.html
+++ b/dev/api/vit/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
 <html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Vision Transformer models · Metalhead.jl</title><meta name="title" content="Vision Transformer models · Metalhead.jl"/><meta property="og:title" content="Vision Transformer models · Metalhead.jl"/><meta property="twitter:title" content="Vision Transformer models · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/api/vit/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/api/vit/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/api/vit/"/><script data-outdated-warner src="../../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../search_index.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="../../contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../densenet/">DenseNet</a></li><li><a class="tocitem" href="../efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../inception/">Inception family of models</a></li><li><a class="tocitem" href="../hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox" checked/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li class="is-active"><a class="tocitem" href>Vision Transformer models</a><ul class="internal"><li><a class="tocitem" href="#The-higher-level-model-constructors"><span>The higher-level model constructors</span></a></li><li><a class="tocitem" href="#The-mid-level-functions"><span>The mid-level functions</span></a></li></ul></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li><a class="is-disabled">Vision Transformers</a></li><li class="is-active"><a href>Vision Transformer models</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Vision Transformer models</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/api/vit.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Vision-Transformer-models"><a class="docs-heading-anchor" href="#Vision-Transformer-models">Vision Transformer models</a><a id="Vision-Transformer-models-1"></a><a class="docs-heading-anchor-permalink" href="#Vision-Transformer-models" title="Permalink"></a></h1><p>This is the API reference for the Vision Transformer models supported by Metalhead.jl.</p><h2 id="The-higher-level-model-constructors"><a class="docs-heading-anchor" href="#The-higher-level-model-constructors">The higher-level model constructors</a><a id="The-higher-level-model-constructors-1"></a><a class="docs-heading-anchor-permalink" href="#The-higher-level-model-constructors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.ViT" href="#Metalhead.ViT"><code>Metalhead.ViT</code></a> — <span class="docstring-category">Type</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">ViT(config::Symbol = base; imsize::Dims{2} = (224, 224), inchannels::Integer = 3,
-    patch_size::Dims{2} = (16, 16), pool = :class, nclasses::Integer = 1000)</code></pre><p>Creates a Vision Transformer (ViT) model. (<a href="https://arxiv.org/abs/2010.11929">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the model configuration, one of <code>[:tiny, :small, :base, :large, :huge, :giant, :gigantic]</code></li><li><code>imsize</code>: image size</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: size of the patches</li><li><code>pool</code>: pooling type, either :class or :mean</li><li><code>nclasses</code>: number of classes in the output</li></ul><p>See also <a href="#Metalhead.vit"><code>Metalhead.vit</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/vit-based/vit.jl#L80-L98">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vit" href="#Metalhead.vit"><code>Metalhead.vit</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vit(imsize::Dims{2} = (256, 256); inchannels::Integer = 3, patch_size::Dims{2} = (16, 16),
+    patch_size::Dims{2} = (16, 16), pool = :class, nclasses::Integer = 1000)</code></pre><p>Creates a Vision Transformer (ViT) model. (<a href="https://arxiv.org/abs/2010.11929">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>config</code>: the model configuration, one of <code>[:tiny, :small, :base, :large, :huge, :giant, :gigantic]</code></li><li><code>imsize</code>: image size</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: size of the patches</li><li><code>pool</code>: pooling type, either :class or :mean</li><li><code>nclasses</code>: number of classes in the output</li></ul><p>See also <a href="#Metalhead.vit"><code>Metalhead.vit</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/vit-based/vit.jl#L80-L98">source</a></section></article><h2 id="The-mid-level-functions"><a class="docs-heading-anchor" href="#The-mid-level-functions">The mid-level functions</a><a id="The-mid-level-functions-1"></a><a class="docs-heading-anchor-permalink" href="#The-mid-level-functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="Metalhead.vit" href="#Metalhead.vit"><code>Metalhead.vit</code></a> — <span class="docstring-category">Function</span><span class="is-flex-grow-1 docstring-article-toggle-button" title="Collapse docstring"></span></header><section><div><pre><code class="language-julia hljs">vit(imsize::Dims{2} = (256, 256); inchannels::Integer = 3, patch_size::Dims{2} = (16, 16),
     embedplanes = 768, depth = 6, nheads = 16, mlp_ratio = 4.0, dropout_prob = 0.1,
-    emb_dropout_prob = 0.1, pool = :class, nclasses::Integer = 1000)</code></pre><p>Creates a Vision Transformer (ViT) model. (<a href="https://arxiv.org/abs/2010.11929">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: image size</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: size of the patches</li><li><code>embedplanes</code>: the number of channels after the patch embedding</li><li><code>depth</code>: number of blocks in the transformer</li><li><code>nheads</code>: number of attention heads in the transformer</li><li><code>mlpplanes</code>: number of hidden channels in the MLP block in the transformer</li><li><code>dropout_prob</code>: dropout probability</li><li><code>emb_dropout</code>: dropout probability for the positional embedding layer</li><li><code>pool</code>: pooling type, either :class or :mean</li><li><code>nclasses</code>: number of classes in the output</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/e662a6b307eeb77ed5004bd9a5e0232868ea85ab/src/vit-based/vit.jl#L30-L51">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../mixers/">« MLPMixer-like models</a><a class="docs-footer-nextpage" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+    emb_dropout_prob = 0.1, pool = :class, nclasses::Integer = 1000)</code></pre><p>Creates a Vision Transformer (ViT) model. (<a href="https://arxiv.org/abs/2010.11929">reference</a>).</p><p><strong>Arguments</strong></p><ul><li><code>imsize</code>: image size</li><li><code>inchannels</code>: number of input channels</li><li><code>patch_size</code>: size of the patches</li><li><code>embedplanes</code>: the number of channels after the patch embedding</li><li><code>depth</code>: number of blocks in the transformer</li><li><code>nheads</code>: number of attention heads in the transformer</li><li><code>mlpplanes</code>: number of hidden channels in the MLP block in the transformer</li><li><code>dropout_prob</code>: dropout probability</li><li><code>emb_dropout</code>: dropout probability for the positional embedding layer</li><li><code>pool</code>: pooling type, either :class or :mean</li><li><code>nclasses</code>: number of classes in the output</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/FluxML/Metalhead.jl/blob/874827aa1e73bbbf6b9abcea481280f0c7b32e90/src/vit-based/vit.jl#L30-L51">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../mixers/">« MLPMixer-like models</a><a class="docs-footer-nextpage" href="../layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/assets/documenter.js b/dev/assets/documenter.js
index 82252a11..7d68cd80 100644
--- a/dev/assets/documenter.js
+++ b/dev/assets/documenter.js
@@ -612,176 +612,194 @@ function worker_function(documenterSearchIndex, documenterBaseURL, filters) {
   };
 }
 
-// `worker = Threads.@spawn worker_function(documenterSearchIndex)`, but in JavaScript!
-const filters = [
-  ...new Set(documenterSearchIndex["docs"].map((x) => x.category)),
-];
-const worker_str =
-  "(" +
-  worker_function.toString() +
-  ")(" +
-  JSON.stringify(documenterSearchIndex["docs"]) +
-  "," +
-  JSON.stringify(documenterBaseURL) +
-  "," +
-  JSON.stringify(filters) +
-  ")";
-const worker_blob = new Blob([worker_str], { type: "text/javascript" });
-const worker = new Worker(URL.createObjectURL(worker_blob));
-
 /////// SEARCH MAIN ///////
 
-// Whether the worker is currently handling a search. This is a boolean
-// as the worker only ever handles 1 or 0 searches at a time.
-var worker_is_running = false;
-
-// The last search text that was sent to the worker. This is used to determine
-// if the worker should be launched again when it reports back results.
-var last_search_text = "";
-
-// The results of the last search. This, in combination with the state of the filters
-// in the DOM, is used compute the results to display on calls to update_search.
-var unfiltered_results = [];
-
-// Which filter is currently selected
-var selected_filter = "";
-
-$(document).on("input", ".documenter-search-input", function (event) {
-  if (!worker_is_running) {
-    launch_search();
-  }
-});
-
-function launch_search() {
-  worker_is_running = true;
-  last_search_text = $(".documenter-search-input").val();
-  worker.postMessage(last_search_text);
-}
-
-worker.onmessage = function (e) {
-  if (last_search_text !== $(".documenter-search-input").val()) {
-    launch_search();
-  } else {
-    worker_is_running = false;
-  }
-
-  unfiltered_results = e.data;
-  update_search();
-};
+function runSearchMainCode() {
+  // `worker = Threads.@spawn worker_function(documenterSearchIndex)`, but in JavaScript!
+  const filters = [
+    ...new Set(documenterSearchIndex["docs"].map((x) => x.category)),
+  ];
+  const worker_str =
+    "(" +
+    worker_function.toString() +
+    ")(" +
+    JSON.stringify(documenterSearchIndex["docs"]) +
+    "," +
+    JSON.stringify(documenterBaseURL) +
+    "," +
+    JSON.stringify(filters) +
+    ")";
+  const worker_blob = new Blob([worker_str], { type: "text/javascript" });
+  const worker = new Worker(URL.createObjectURL(worker_blob));
+
+  // Whether the worker is currently handling a search. This is a boolean
+  // as the worker only ever handles 1 or 0 searches at a time.
+  var worker_is_running = false;
+
+  // The last search text that was sent to the worker. This is used to determine
+  // if the worker should be launched again when it reports back results.
+  var last_search_text = "";
+
+  // The results of the last search. This, in combination with the state of the filters
+  // in the DOM, is used compute the results to display on calls to update_search.
+  var unfiltered_results = [];
+
+  // Which filter is currently selected
+  var selected_filter = "";
+
+  $(document).on("input", ".documenter-search-input", function (event) {
+    if (!worker_is_running) {
+      launch_search();
+    }
+  });
 
-$(document).on("click", ".search-filter", function () {
-  if ($(this).hasClass("search-filter-selected")) {
-    selected_filter = "";
-  } else {
-    selected_filter = $(this).text().toLowerCase();
+  function launch_search() {
+    worker_is_running = true;
+    last_search_text = $(".documenter-search-input").val();
+    worker.postMessage(last_search_text);
   }
 
-  // This updates search results and toggles classes for UI:
-  update_search();
-});
+  worker.onmessage = function (e) {
+    if (last_search_text !== $(".documenter-search-input").val()) {
+      launch_search();
+    } else {
+      worker_is_running = false;
+    }
 
-/**
- * Make/Update the search component
- */
-function update_search() {
-  let querystring = $(".documenter-search-input").val();
+    unfiltered_results = e.data;
+    update_search();
+  };
 
-  if (querystring.trim()) {
-    if (selected_filter == "") {
-      results = unfiltered_results;
+  $(document).on("click", ".search-filter", function () {
+    if ($(this).hasClass("search-filter-selected")) {
+      selected_filter = "";
     } else {
-      results = unfiltered_results.filter((result) => {
-        return selected_filter == result.category.toLowerCase();
-      });
+      selected_filter = $(this).text().toLowerCase();
     }
 
-    let search_result_container = ``;
-    let modal_filters = make_modal_body_filters();
-    let search_divider = `<div class="search-divider w-100"></div>`;
+    // This updates search results and toggles classes for UI:
+    update_search();
+  });
 
-    if (results.length) {
-      let links = [];
-      let count = 0;
-      let search_results = "";
-
-      for (var i = 0, n = results.length; i < n && count < 200; ++i) {
-        let result = results[i];
-        if (result.location && !links.includes(result.location)) {
-          search_results += result.div;
-          count++;
-          links.push(result.location);
-        }
-      }
+  /**
+   * Make/Update the search component
+   */
+  function update_search() {
+    let querystring = $(".documenter-search-input").val();
 
-      if (count == 1) {
-        count_str = "1 result";
-      } else if (count == 200) {
-        count_str = "200+ results";
+    if (querystring.trim()) {
+      if (selected_filter == "") {
+        results = unfiltered_results;
       } else {
-        count_str = count + " results";
+        results = unfiltered_results.filter((result) => {
+          return selected_filter == result.category.toLowerCase();
+        });
       }
-      let result_count = `<div class="is-size-6">${count_str}</div>`;
 
-      search_result_container = `
+      let search_result_container = ``;
+      let modal_filters = make_modal_body_filters();
+      let search_divider = `<div class="search-divider w-100"></div>`;
+
+      if (results.length) {
+        let links = [];
+        let count = 0;
+        let search_results = "";
+
+        for (var i = 0, n = results.length; i < n && count < 200; ++i) {
+          let result = results[i];
+          if (result.location && !links.includes(result.location)) {
+            search_results += result.div;
+            count++;
+            links.push(result.location);
+          }
+        }
+
+        if (count == 1) {
+          count_str = "1 result";
+        } else if (count == 200) {
+          count_str = "200+ results";
+        } else {
+          count_str = count + " results";
+        }
+        let result_count = `<div class="is-size-6">${count_str}</div>`;
+
+        search_result_container = `
+              <div class="is-flex is-flex-direction-column gap-2 is-align-items-flex-start">
+                  ${modal_filters}
+                  ${search_divider}
+                  ${result_count}
+                  <div class="is-clipped w-100 is-flex is-flex-direction-column gap-2 is-align-items-flex-start has-text-justified mt-1">
+                    ${search_results}
+                  </div>
+              </div>
+          `;
+      } else {
+        search_result_container = `
             <div class="is-flex is-flex-direction-column gap-2 is-align-items-flex-start">
                 ${modal_filters}
                 ${search_divider}
-                ${result_count}
-                <div class="is-clipped w-100 is-flex is-flex-direction-column gap-2 is-align-items-flex-start has-text-justified mt-1">
-                  ${search_results}
-                </div>
-            </div>
+                <div class="is-size-6">0 result(s)</div>
+              </div>
+              <div class="has-text-centered my-5 py-5">No result found!</div>
         `;
-    } else {
-      search_result_container = `
-           <div class="is-flex is-flex-direction-column gap-2 is-align-items-flex-start">
-               ${modal_filters}
-               ${search_divider}
-               <div class="is-size-6">0 result(s)</div>
-            </div>
-            <div class="has-text-centered my-5 py-5">No result found!</div>
-       `;
-    }
+      }
 
-    if ($(".search-modal-card-body").hasClass("is-justify-content-center")) {
-      $(".search-modal-card-body").removeClass("is-justify-content-center");
-    }
+      if ($(".search-modal-card-body").hasClass("is-justify-content-center")) {
+        $(".search-modal-card-body").removeClass("is-justify-content-center");
+      }
 
-    $(".search-modal-card-body").html(search_result_container);
-  } else {
-    if (!$(".search-modal-card-body").hasClass("is-justify-content-center")) {
-      $(".search-modal-card-body").addClass("is-justify-content-center");
+      $(".search-modal-card-body").html(search_result_container);
+    } else {
+      if (!$(".search-modal-card-body").hasClass("is-justify-content-center")) {
+        $(".search-modal-card-body").addClass("is-justify-content-center");
+      }
+
+      $(".search-modal-card-body").html(`
+        <div class="has-text-centered my-5 py-5">Type something to get started!</div>
+      `);
     }
+  }
 
-    $(".search-modal-card-body").html(`
-      <div class="has-text-centered my-5 py-5">Type something to get started!</div>
-    `);
+  /**
+   * Make the modal filter html
+   *
+   * @returns string
+   */
+  function make_modal_body_filters() {
+    let str = filters
+      .map((val) => {
+        if (selected_filter == val.toLowerCase()) {
+          return `<a href="javascript:;" class="search-filter search-filter-selected"><span>${val}</span></a>`;
+        } else {
+          return `<a href="javascript:;" class="search-filter"><span>${val}</span></a>`;
+        }
+      })
+      .join("");
+
+    return `
+          <div class="is-flex gap-2 is-flex-wrap-wrap is-justify-content-flex-start is-align-items-center search-filters">
+              <span class="is-size-6">Filters:</span>
+              ${str}
+          </div>`;
   }
 }
 
-/**
- * Make the modal filter html
- *
- * @returns string
- */
-function make_modal_body_filters() {
-  let str = filters
-    .map((val) => {
-      if (selected_filter == val.toLowerCase()) {
-        return `<a href="javascript:;" class="search-filter search-filter-selected"><span>${val}</span></a>`;
-      } else {
-        return `<a href="javascript:;" class="search-filter"><span>${val}</span></a>`;
-      }
-    })
-    .join("");
-
-  return `
-        <div class="is-flex gap-2 is-flex-wrap-wrap is-justify-content-flex-start is-align-items-center search-filters">
-            <span class="is-size-6">Filters:</span>
-            ${str}
-        </div>`;
+function waitUntilSearchIndexAvailable() {
+  // It is possible that the documenter.js script runs before the page
+  // has finished loading and documenterSearchIndex gets defined.
+  // So we need to wait until the search index actually loads before setting
+  // up all the search-related stuff.
+  if (typeof documenterSearchIndex !== "undefined") {
+    runSearchMainCode();
+  } else {
+    console.warn("Search Index not available, waiting");
+    setTimeout(waitUntilSearchIndexAvailable, 1000);
+  }
 }
 
+// The actual entry point to the search code
+waitUntilSearchIndexAvailable();
+
 })
 ////////////////////////////////////////////////////////////////////////////////
 require(['jquery'], function($) {
diff --git a/dev/contributing/index.html b/dev/contributing/index.html
index d8a61b7a..f89c352e 100644
--- a/dev/contributing/index.html
+++ b/dev/contributing/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Contributing to Metalhead · Metalhead.jl</title><meta name="title" content="Contributing to Metalhead · Metalhead.jl"/><meta property="og:title" content="Contributing to Metalhead · Metalhead.jl"/><meta property="twitter:title" content="Contributing to Metalhead · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li class="is-active"><a class="tocitem" href>Contributing to Metalhead</a><ul class="internal"><li><a class="tocitem" href="#Fixing-bugs"><span>Fixing bugs</span></a></li><li><a class="tocitem" href="#Adding-models"><span>Adding models</span></a></li><li><a class="tocitem" href="#Adding-pre-trained-weights"><span>Adding pre-trained weights</span></a></li></ul></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../api/densenet/">DenseNet</a></li><li><a class="tocitem" href="../api/efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../api/mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../api/inception/">Inception family of models</a></li><li><a class="tocitem" href="../api/hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../api/others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../api/layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../api/utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Contributing to Metalhead</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Contributing to Metalhead</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/contributing.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="contributing"><a class="docs-heading-anchor" href="#contributing">Contribute to Metalhead.jl</a><a id="contributing-1"></a><a class="docs-heading-anchor-permalink" href="#contributing" title="Permalink"></a></h1><p>We welcome contributions from anyone to Metalhead.jl! Thank you for taking the time to make our ecosystem better.</p><p>You can contribute by fixing bugs, adding new models, or adding pre-trained weights. If you aren&#39;t ready to write some code, but you think you found a bug or have a feature request, please <a href="https://github.com/FluxML/Metalhead.jl/issues/new/choose">post an issue</a>.</p><p>Before continuing, make sure you read the <a href="https://github.com/FluxML/Flux.jl/blob/master/CONTRIBUTING.md">FluxML contributing guide</a> for general guidelines and tips.</p><h2 id="Fixing-bugs"><a class="docs-heading-anchor" href="#Fixing-bugs">Fixing bugs</a><a id="Fixing-bugs-1"></a><a class="docs-heading-anchor-permalink" href="#Fixing-bugs" title="Permalink"></a></h2><p>To fix a bug in Metalhead.jl, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. It would be helpful to file an issue first so that we can confirm the bug.</p><h2 id="Adding-models"><a class="docs-heading-anchor" href="#Adding-models">Adding models</a><a id="Adding-models-1"></a><a class="docs-heading-anchor-permalink" href="#Adding-models" title="Permalink"></a></h2><p>To add a new model architecture to Metalhead.jl, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. Keep in mind a few guiding principles for how this package is designed:</p><ul><li>reuse layers from Flux as much as possible (e.g. use <code>Parallel</code> before defining a <code>Bottleneck</code> struct)</li><li>adhere as closely as possible to a reference such as a published paper (i.e. the structure of your model should follow intuitively from the paper)</li><li>use generic functional builders (e.g. <a href="../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> is the underlying function that builds &quot;ResNet-like&quot; models)</li><li>use multiple dispatch to add convenience constructors that wrap your functional builder</li></ul><p>When in doubt, just open a PR! We are more than happy to help review your code to help it align with the rest of the library. After adding a model, you might consider adding some pre-trained weights (see below).</p><h2 id="Adding-pre-trained-weights"><a class="docs-heading-anchor" href="#Adding-pre-trained-weights">Adding pre-trained weights</a><a id="Adding-pre-trained-weights-1"></a><a class="docs-heading-anchor-permalink" href="#Adding-pre-trained-weights" title="Permalink"></a></h2><p>To add pre-trained weights for an existing model or new model, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. Below, we describe the steps you should follow to get there.</p><p>All Metalhead.jl model artifacts are hosted on HuggingFace. You can find the FluxML account <a href="https://huggingface.co/FluxML">here</a>. This <a href="https://huggingface.co/docs/hub/models">documentation from HuggingFace</a> will provide you with an introduction to their ModelHub. In short, the Model Hub is a collection of Git repositories, similar to Julia packages on GitHub. This means you can <a href="https://huggingface.co/docs/hub/repositories-pull-requests-discussions">make a pull request to our HuggingFace repositories</a> to upload updated weight artifacts just like you would make a PR on GitHub to upload code.</p><ol><li>Train your model or port the weights from another framework.</li><li>Save the model state using <a href="https://github.com/JuliaIO/BSON.jl">BSON.jl</a> with <code>BSON.@save &quot;modelname.bson&quot; model_state=Flux.state(model)</code>. It is important that your model is saved under the key <code>model_state</code>.</li><li>Compress the saved model as a tarball using <code>tar -cvzf modelname.tar.gz modelname.bson</code>.</li><li>Obtain the SHAs (see the <a href="https://pkgdocs.julialang.org/v1/artifacts/#Basic-Usage">Pkg docs</a>). Edit the <code>Artifacts.toml</code> file in the Metalhead.jl repository and add entry for your model. You can leave the URL empty for now.</li><li>Open a PR on Metalhead.jl. Be sure to ping a maintainer (e.g. <code>@darsnack</code> or <code>@theabhirath</code>) to let us know that you are adding a pre-trained weight. We will create a model repository on HuggingFace if it does not already exist.</li><li>Open a PR to the <a href="https://huggingface.co/FluxML">corresponding HuggingFace repo</a>. Do this by going to the &quot;Community&quot; tab in the HuggingFace repository. PRs and discussions are shown as the same thing in the HuggingFace web app. You can use your local Git program to make clone the repo and make PRs if you wish. Check out the <a href="https://huggingface.co/docs/hub/repositories-pull-requests-discussions">guide on PRs to HuggingFace</a> for more information.</li><li>Copy the download URL for the model file that you added to HuggingFace. Make sure to grab the URL for a specific commit and not for the <code>main</code> branch.</li><li>Update your Metalhead.jl PR by adding the URL to the Artifacts.toml.</li><li>If the tests pass for your weights, we will merge your PR! Your model should pass the <code>acctest</code> function in the Metalhead.jl test suite. If your model already exists in the repo, then these tests are already in place, and you can add your model configuration to the <code>PRETRAINED_MODELS</code> list in the <code>runtests.jl</code> file. Please refer to the ResNet tests as an example.</li></ol><p>If you want to fix existing weights, then you can follow the same set of steps.</p><p>See the <a href="https://github.com/FluxML/Metalhead.jl/tree/master/scripts">scripts/</a> folder in the repo for some helpful scripts that can be used to automate some of these steps.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../howto/resnet/">« Using the ResNet model family in Metalhead.jl</a><a class="docs-footer-nextpage" href="../api/resnet/">ResNet-like models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Contributing to Metalhead · Metalhead.jl</title><meta name="title" content="Contributing to Metalhead · Metalhead.jl"/><meta property="og:title" content="Contributing to Metalhead · Metalhead.jl"/><meta property="twitter:title" content="Contributing to Metalhead · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/contributing/"/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="../tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="../tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="../howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li class="is-active"><a class="tocitem" href>Contributing to Metalhead</a><ul class="internal"><li><a class="tocitem" href="#Fixing-bugs"><span>Fixing bugs</span></a></li><li><a class="tocitem" href="#Adding-models"><span>Adding models</span></a></li><li><a class="tocitem" href="#Adding-pre-trained-weights"><span>Adding pre-trained weights</span></a></li></ul></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/resnet/">ResNet-like models</a></li><li><a class="tocitem" href="../api/densenet/">DenseNet</a></li><li><a class="tocitem" href="../api/efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="../api/mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="../api/inception/">Inception family of models</a></li><li><a class="tocitem" href="../api/hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="../api/others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="../api/layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="../api/layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="../api/utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Contributing to Metalhead</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Contributing to Metalhead</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/contributing.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="contributing"><a class="docs-heading-anchor" href="#contributing">Contribute to Metalhead.jl</a><a id="contributing-1"></a><a class="docs-heading-anchor-permalink" href="#contributing" title="Permalink"></a></h1><p>We welcome contributions from anyone to Metalhead.jl! Thank you for taking the time to make our ecosystem better.</p><p>You can contribute by fixing bugs, adding new models, or adding pre-trained weights. If you aren&#39;t ready to write some code, but you think you found a bug or have a feature request, please <a href="https://github.com/FluxML/Metalhead.jl/issues/new/choose">post an issue</a>.</p><p>Before continuing, make sure you read the <a href="https://github.com/FluxML/Flux.jl/blob/master/CONTRIBUTING.md">FluxML contributing guide</a> for general guidelines and tips.</p><h2 id="Fixing-bugs"><a class="docs-heading-anchor" href="#Fixing-bugs">Fixing bugs</a><a id="Fixing-bugs-1"></a><a class="docs-heading-anchor-permalink" href="#Fixing-bugs" title="Permalink"></a></h2><p>To fix a bug in Metalhead.jl, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. It would be helpful to file an issue first so that we can confirm the bug.</p><h2 id="Adding-models"><a class="docs-heading-anchor" href="#Adding-models">Adding models</a><a id="Adding-models-1"></a><a class="docs-heading-anchor-permalink" href="#Adding-models" title="Permalink"></a></h2><p>To add a new model architecture to Metalhead.jl, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. Keep in mind a few guiding principles for how this package is designed:</p><ul><li>reuse layers from Flux as much as possible (e.g. use <code>Parallel</code> before defining a <code>Bottleneck</code> struct)</li><li>adhere as closely as possible to a reference such as a published paper (i.e. the structure of your model should follow intuitively from the paper)</li><li>use generic functional builders (e.g. <a href="../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> is the underlying function that builds &quot;ResNet-like&quot; models)</li><li>use multiple dispatch to add convenience constructors that wrap your functional builder</li></ul><p>When in doubt, just open a PR! We are more than happy to help review your code to help it align with the rest of the library. After adding a model, you might consider adding some pre-trained weights (see below).</p><h2 id="Adding-pre-trained-weights"><a class="docs-heading-anchor" href="#Adding-pre-trained-weights">Adding pre-trained weights</a><a id="Adding-pre-trained-weights-1"></a><a class="docs-heading-anchor-permalink" href="#Adding-pre-trained-weights" title="Permalink"></a></h2><p>To add pre-trained weights for an existing model or new model, you can <a href="https://github.com/FluxML/Metalhead.jl/pulls">open a PR</a>. Below, we describe the steps you should follow to get there.</p><p>All Metalhead.jl model artifacts are hosted on HuggingFace. You can find the FluxML account <a href="https://huggingface.co/FluxML">here</a>. This <a href="https://huggingface.co/docs/hub/models">documentation from HuggingFace</a> will provide you with an introduction to their ModelHub. In short, the Model Hub is a collection of Git repositories, similar to Julia packages on GitHub. This means you can <a href="https://huggingface.co/docs/hub/repositories-pull-requests-discussions">make a pull request to our HuggingFace repositories</a> to upload updated weight artifacts just like you would make a PR on GitHub to upload code.</p><ol><li>Train your model or port the weights from another framework.</li><li>Save the model state using <a href="https://github.com/JuliaIO/BSON.jl">BSON.jl</a> with <code>BSON.@save &quot;modelname.bson&quot; model_state=Flux.state(model)</code>. It is important that your model is saved under the key <code>model_state</code>.</li><li>Compress the saved model as a tarball using <code>tar -cvzf modelname.tar.gz modelname.bson</code>.</li><li>Obtain the SHAs (see the <a href="https://pkgdocs.julialang.org/v1/artifacts/#Basic-Usage">Pkg docs</a>). Edit the <code>Artifacts.toml</code> file in the Metalhead.jl repository and add entry for your model. You can leave the URL empty for now.</li><li>Open a PR on Metalhead.jl. Be sure to ping a maintainer (e.g. <code>@darsnack</code> or <code>@theabhirath</code>) to let us know that you are adding a pre-trained weight. We will create a model repository on HuggingFace if it does not already exist.</li><li>Open a PR to the <a href="https://huggingface.co/FluxML">corresponding HuggingFace repo</a>. Do this by going to the &quot;Community&quot; tab in the HuggingFace repository. PRs and discussions are shown as the same thing in the HuggingFace web app. You can use your local Git program to make clone the repo and make PRs if you wish. Check out the <a href="https://huggingface.co/docs/hub/repositories-pull-requests-discussions">guide on PRs to HuggingFace</a> for more information.</li><li>Copy the download URL for the model file that you added to HuggingFace. Make sure to grab the URL for a specific commit and not for the <code>main</code> branch.</li><li>Update your Metalhead.jl PR by adding the URL to the Artifacts.toml.</li><li>If the tests pass for your weights, we will merge your PR! Your model should pass the <code>acctest</code> function in the Metalhead.jl test suite. If your model already exists in the repo, then these tests are already in place, and you can add your model configuration to the <code>PRETRAINED_MODELS</code> list in the <code>runtests.jl</code> file. Please refer to the ResNet tests as an example.</li></ol><p>If you want to fix existing weights, then you can follow the same set of steps.</p><p>See the <a href="https://github.com/FluxML/Metalhead.jl/tree/master/scripts">scripts/</a> folder in the repo for some helpful scripts that can be used to automate some of these steps.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../howto/resnet/">« Using the ResNet model family in Metalhead.jl</a><a class="docs-footer-nextpage" href="../api/resnet/">ResNet-like models »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/howto/resnet/index.html b/dev/howto/resnet/index.html
index 6dac49a7..c2582a84 100644
--- a/dev/howto/resnet/index.html
+++ b/dev/howto/resnet/index.html
@@ -8,4 +8,4 @@
                                  stochastic_depth_prob = 0.2)</code></pre><p>To make this a ResNeXt-like model, all we need to do is configure the cardinality and the  base width:</p><pre><code class="language-julia hljs">custom_resnet = Metalhead.resnet(Metalhead.bottleneck, [3, 4, 6, 3];
                                  cardinality = 32, base_width = 4,
                                  pool_layer = AdaptiveMeanMaxPool((1, 1)),
-                                 stochastic_depth_prob = 0.2)</code></pre><p>And we have a custom model, built with minimal effort! The documentation for <a href="../../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> has been written with extensive care and in as much detail as possible to facilitate ease of use. Still, if you find anything difficult to understand, feel free to open an issue and we will be happy to help you out, and to improve the documentation where necessary.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../tutorials/pretrained/">« Working with pre-trained models from Metalhead</a><a class="docs-footer-nextpage" href="../../contributing/">Contributing to Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+                                 stochastic_depth_prob = 0.2)</code></pre><p>And we have a custom model, built with minimal effort! The documentation for <a href="../../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> has been written with extensive care and in as much detail as possible to facilitate ease of use. Still, if you find anything difficult to understand, feel free to open an issue and we will be happy to help you out, and to improve the documentation where necessary.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../tutorials/pretrained/">« Working with pre-trained models from Metalhead</a><a class="docs-footer-nextpage" href="../../contributing/">Contributing to Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index 4814a717..667d180f 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · Metalhead.jl</title><meta name="title" content="Home · Metalhead.jl"/><meta property="og:title" content="Home · Metalhead.jl"/><meta property="twitter:title" content="Home · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href>Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a><ul class="internal"><li><a class="tocitem" href="#Installation"><span>Installation</span></a></li><li><a class="tocitem" href="#Getting-Started"><span>Getting Started</span></a></li><li><a class="tocitem" href="#Available-models"><span>Available models</span></a></li></ul></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/resnet/">ResNet-like models</a></li><li><a class="tocitem" href="api/densenet/">DenseNet</a></li><li><a class="tocitem" href="api/efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="api/mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="api/inception/">Inception family of models</a></li><li><a class="tocitem" href="api/hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="api/others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="api/layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="api/utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Metalhead"><a class="docs-heading-anchor" href="#Metalhead">Metalhead</a><a id="Metalhead-1"></a><a class="docs-heading-anchor-permalink" href="#Metalhead" title="Permalink"></a></h1><p><a href="https://fluxml.github.io/Metalhead.jl/dev"><img src="https://img.shields.io/badge/docs-dev-blue.svg" alt="Dev"/></a> <a href="https://github.com/FluxML/Metalhead.jl/actions/workflows/CI.yml"><img src="https://github.com/FluxML/Metalhead.jl/actions/workflows/CI.yml/badge.svg" alt="CI"/></a> <a href="https://codecov.io/gh/FluxML/Metalhead.jl"><img src="https://codecov.io/gh/FluxML/Metalhead.jl/branch/master/graph/badge.svg" alt="Coverage"/></a></p><p><a href="https://github.com/FluxML/Metalhead.jl">Metalhead.jl</a> provides standard machine learning vision models for use with <a href="https://fluxml.ai">Flux.jl</a>. The architectures in this package make use of pure Flux layers, and they represent the best-practices for creating modules like residual blocks, inception blocks, etc. in Flux. Metalhead also provides some building blocks for more complex models in the Layers module.</p><h2 id="Installation"><a class="docs-heading-anchor" href="#Installation">Installation</a><a id="Installation-1"></a><a class="docs-heading-anchor-permalink" href="#Installation" title="Permalink"></a></h2><pre><code class="language-julia hljs">julia&gt; ]add Metalhead</code></pre><h2 id="Getting-Started"><a class="docs-heading-anchor" href="#Getting-Started">Getting Started</a><a id="Getting-Started-1"></a><a class="docs-heading-anchor-permalink" href="#Getting-Started" title="Permalink"></a></h2><p>You can find the Metalhead.jl getting started guide <a href="https://fluxml.ai/Metalhead.jl/dev/tutorials/quickstart/">here</a>.</p><h2 id="Available-models"><a class="docs-heading-anchor" href="#Available-models">Available models</a><a id="Available-models-1"></a><a class="docs-heading-anchor-permalink" href="#Available-models" title="Permalink"></a></h2><p>To contribute new models, see our <a href="https://fluxml.ai/Metalhead.jl/dev/contributing/">contributing docs</a>.</p><h3 id="Image-Classification"><a class="docs-heading-anchor" href="#Image-Classification">Image Classification</a><a id="Image-Classification-1"></a><a class="docs-heading-anchor-permalink" href="#Image-Classification" title="Permalink"></a></h3><table><tr><th style="text-align: left">Model Name</th><th style="text-align: left">Constructor</th><th style="text-align: center">Pre-trained?</th></tr><tr><td style="text-align: left"><a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">AlexNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.AlexNet"><code>AlexNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2201.09792">ConvMixer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/hybrid/#Metalhead.ConvMixer"><code>ConvMixer</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2201.03545">ConvNeXt</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/hybrid/#Metalhead.ConvNeXt"><code>ConvNeXt</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1608.06993">DenseNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/densenet/#Metalhead.DenseNet"><code>DenseNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1905.11946">EfficientNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.EfficientNet"><code>EfficientNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2104.00298">EfficientNetv2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.EfficientNetv2"><code>EfficientNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2105.08050">gMLP</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.gMLP"><code>gMLP</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1409.4842">GoogLeNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.GoogLeNet"><code>GoogLeNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1512.00567">Inception-v3</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Inceptionv3"><code>Inceptionv3</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07261">Inception-v4</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Inceptionv4"><code>Inceptionv4</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07261">InceptionResNet-v2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.InceptionResNetv2"><code>InceptionResNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/pdf/2105.01601">MLPMixer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.MLPMixer"><code>MLPMixer</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1704.04861">MobileNetv1</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv1"><code>MobileNetv1</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1801.04381">MobileNetv2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv2"><code>MobileNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1905.02244">MobileNetv3</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv3"><code>MobileNetv3</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1807.11626">MNASNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.MNASNet"><code>MNASNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2105.03404">ResMLP</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.ResMLP"><code>ResMLP</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1512.03385">ResNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.ResNet"><code>ResNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1611.05431">ResNeXt</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.ResNeXt"><code>ResNeXt</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07360">SqueezeNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.SqueezeNet"><code>SqueezeNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1610.02357">Xception</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Xception"><code>Xception</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1605.07146">WideResNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.WideResNet"><code>WideResNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1409.1556">VGG</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.VGG"><code>VGG</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2010.11929">Vision Transformer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/vit/#Metalhead.ViT"><code>ViT</code></a></td><td style="text-align: center">Y</td></tr></table><h3 id="Other-Models"><a class="docs-heading-anchor" href="#Other-Models">Other Models</a><a id="Other-Models-1"></a><a class="docs-heading-anchor-permalink" href="#Other-Models" title="Permalink"></a></h3><table><tr><th style="text-align: left">Model Name</th><th style="text-align: left">Constructor</th><th style="text-align: center">Pre-trained?</th></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1505.04597">UNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.UNet"><code>UNet</code></a></td><td style="text-align: center">N</td></tr></table></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="tutorials/quickstart/">A guide to getting started with Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Home · Metalhead.jl</title><meta name="title" content="Home · Metalhead.jl"/><meta property="og:title" content="Home · Metalhead.jl"/><meta property="twitter:title" content="Home · Metalhead.jl"/><meta name="description" content="Documentation for Metalhead.jl."/><meta property="og:description" content="Documentation for Metalhead.jl."/><meta property="twitter:description" content="Documentation for Metalhead.jl."/><meta property="og:url" content="https://fluxml.ai/Metalhead.jl/stable/"/><meta property="twitter:url" content="https://fluxml.ai/Metalhead.jl/stable/"/><link rel="canonical" href="https://fluxml.ai/Metalhead.jl/stable/"/><script data-outdated-warner src="assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="assets/documenter.js"></script><script src="search_index.js"></script><script src="siteinfo.js"></script><script src="../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-mocha.css" data-theme-name="catppuccin-mocha"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-macchiato.css" data-theme-name="catppuccin-macchiato"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-frappe.css" data-theme-name="catppuccin-frappe"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/catppuccin-latte.css" data-theme-name="catppuccin-latte"/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="assets/themeswap.js"></script><link href="assets/flux.css" rel="stylesheet" type="text/css"/></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href>Metalhead.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li class="is-active"><a class="tocitem" href>Home</a><ul class="internal"><li><a class="tocitem" href="#Installation"><span>Installation</span></a></li><li><a class="tocitem" href="#Getting-Started"><span>Getting Started</span></a></li><li><a class="tocitem" href="#Available-models"><span>Available models</span></a></li></ul></li><li><span class="tocitem">Tutorials</span><ul><li><a class="tocitem" href="tutorials/quickstart/">A guide to getting started with Metalhead</a></li><li><a class="tocitem" href="tutorials/pretrained/">Working with pre-trained models from Metalhead</a></li></ul></li><li><span class="tocitem">Guides</span><ul><li><a class="tocitem" href="howto/resnet/">Using the ResNet model family in Metalhead.jl</a></li></ul></li><li><a class="tocitem" href="contributing/">Contributing to Metalhead</a></li><li><span class="tocitem">API reference</span><ul><li><input class="collapse-toggle" id="menuitem-5-1" type="checkbox"/><label class="tocitem" for="menuitem-5-1"><span class="docs-label">Convolutional Neural Networks</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/resnet/">ResNet-like models</a></li><li><a class="tocitem" href="api/densenet/">DenseNet</a></li><li><a class="tocitem" href="api/efficientnet/">EfficientNet family of models</a></li><li><a class="tocitem" href="api/mobilenet/">MobileNet family of models</a></li><li><a class="tocitem" href="api/inception/">Inception family of models</a></li><li><a class="tocitem" href="api/hybrid/">Hybrid CNN architectures</a></li><li><a class="tocitem" href="api/others/">Other models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-2" type="checkbox"/><label class="tocitem" for="menuitem-5-2"><span class="docs-label">Mixers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/mixers/">MLPMixer-like models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-3" type="checkbox"/><label class="tocitem" for="menuitem-5-3"><span class="docs-label">Vision Transformers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/vit/">Vision Transformer models</a></li></ul></li><li><input class="collapse-toggle" id="menuitem-5-4" type="checkbox"/><label class="tocitem" for="menuitem-5-4"><span class="docs-label">Layers</span><i class="docs-chevron"></i></label><ul class="collapsed"><li><a class="tocitem" href="api/layers_intro/">An introduction to the <code>Layers</code> module in Metalhead.jl</a></li><li><a class="tocitem" href="api/layers_adv/">More advanced layers</a></li></ul></li><li><a class="tocitem" href="api/utilities/">Model Utilities</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Home</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Home</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/FluxML/Metalhead.jl/blob/master/docs/src/index.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Metalhead"><a class="docs-heading-anchor" href="#Metalhead">Metalhead</a><a id="Metalhead-1"></a><a class="docs-heading-anchor-permalink" href="#Metalhead" title="Permalink"></a></h1><p><a href="https://fluxml.github.io/Metalhead.jl/dev"><img src="https://img.shields.io/badge/docs-dev-blue.svg" alt="Dev"/></a> <a href="https://github.com/FluxML/Metalhead.jl/actions/workflows/CI.yml"><img src="https://github.com/FluxML/Metalhead.jl/actions/workflows/CI.yml/badge.svg" alt="CI"/></a> <a href="https://codecov.io/gh/FluxML/Metalhead.jl"><img src="https://codecov.io/gh/FluxML/Metalhead.jl/branch/master/graph/badge.svg" alt="Coverage"/></a></p><p><a href="https://github.com/FluxML/Metalhead.jl">Metalhead.jl</a> provides standard machine learning vision models for use with <a href="https://fluxml.ai">Flux.jl</a>. The architectures in this package make use of pure Flux layers, and they represent the best-practices for creating modules like residual blocks, inception blocks, etc. in Flux. Metalhead also provides some building blocks for more complex models in the Layers module.</p><h2 id="Installation"><a class="docs-heading-anchor" href="#Installation">Installation</a><a id="Installation-1"></a><a class="docs-heading-anchor-permalink" href="#Installation" title="Permalink"></a></h2><pre><code class="language-julia hljs">julia&gt; ]add Metalhead</code></pre><h2 id="Getting-Started"><a class="docs-heading-anchor" href="#Getting-Started">Getting Started</a><a id="Getting-Started-1"></a><a class="docs-heading-anchor-permalink" href="#Getting-Started" title="Permalink"></a></h2><p>You can find the Metalhead.jl getting started guide <a href="https://fluxml.ai/Metalhead.jl/dev/tutorials/quickstart/">here</a>.</p><h2 id="Available-models"><a class="docs-heading-anchor" href="#Available-models">Available models</a><a id="Available-models-1"></a><a class="docs-heading-anchor-permalink" href="#Available-models" title="Permalink"></a></h2><p>To contribute new models, see our <a href="https://fluxml.ai/Metalhead.jl/dev/contributing/">contributing docs</a>.</p><h3 id="Image-Classification"><a class="docs-heading-anchor" href="#Image-Classification">Image Classification</a><a id="Image-Classification-1"></a><a class="docs-heading-anchor-permalink" href="#Image-Classification" title="Permalink"></a></h3><table><tr><th style="text-align: left">Model Name</th><th style="text-align: left">Constructor</th><th style="text-align: center">Pre-trained?</th></tr><tr><td style="text-align: left"><a href="https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf">AlexNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.AlexNet"><code>AlexNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2201.09792">ConvMixer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/hybrid/#Metalhead.ConvMixer"><code>ConvMixer</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2201.03545">ConvNeXt</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/hybrid/#Metalhead.ConvNeXt"><code>ConvNeXt</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1608.06993">DenseNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/densenet/#Metalhead.DenseNet"><code>DenseNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1905.11946">EfficientNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.EfficientNet"><code>EfficientNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2104.00298">EfficientNetv2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.EfficientNetv2"><code>EfficientNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2105.08050">gMLP</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.gMLP"><code>gMLP</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1409.4842">GoogLeNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.GoogLeNet"><code>GoogLeNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1512.00567">Inception-v3</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Inceptionv3"><code>Inceptionv3</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07261">Inception-v4</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Inceptionv4"><code>Inceptionv4</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07261">InceptionResNet-v2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.InceptionResNetv2"><code>InceptionResNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/pdf/2105.01601">MLPMixer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.MLPMixer"><code>MLPMixer</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1704.04861">MobileNetv1</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv1"><code>MobileNetv1</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1801.04381">MobileNetv2</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv2"><code>MobileNetv2</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1905.02244">MobileNetv3</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mobilenet/#Metalhead.MobileNetv3"><code>MobileNetv3</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1807.11626">MNASNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/efficientnet/#Metalhead.MNASNet"><code>MNASNet</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2105.03404">ResMLP</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/mixers/#Metalhead.ResMLP"><code>ResMLP</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1512.03385">ResNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.ResNet"><code>ResNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1611.05431">ResNeXt</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.ResNeXt"><code>ResNeXt</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1602.07360">SqueezeNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.SqueezeNet"><code>SqueezeNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1610.02357">Xception</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/inception/#Metalhead.Xception"><code>Xception</code></a></td><td style="text-align: center">N</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1605.07146">WideResNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/resnet/#Metalhead.WideResNet"><code>WideResNet</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1409.1556">VGG</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.VGG"><code>VGG</code></a></td><td style="text-align: center">Y</td></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/2010.11929">Vision Transformer</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/vit/#Metalhead.ViT"><code>ViT</code></a></td><td style="text-align: center">Y</td></tr></table><h3 id="Other-Models"><a class="docs-heading-anchor" href="#Other-Models">Other Models</a><a id="Other-Models-1"></a><a class="docs-heading-anchor-permalink" href="#Other-Models" title="Permalink"></a></h3><table><tr><th style="text-align: left">Model Name</th><th style="text-align: left">Constructor</th><th style="text-align: center">Pre-trained?</th></tr><tr><td style="text-align: left"><a href="https://arxiv.org/abs/1505.04597">UNet</a></td><td style="text-align: left"><a href="https://fluxml.ai/Metalhead.jl/dev/api/others/#Metalhead.UNet"><code>UNet</code></a></td><td style="text-align: center">N</td></tr></table></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="tutorials/quickstart/">A guide to getting started with Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/tutorials/pretrained/index.html b/dev/tutorials/pretrained/index.html
index 349455a0..43c0e541 100644
--- a/dev/tutorials/pretrained/index.html
+++ b/dev/tutorials/pretrained/index.html
@@ -268,4 +268,4 @@
         logitcrossentropy(m(x), y)
     end;
     state, model = Optimisers.update(state, model, gs);
-end</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../quickstart/">« A guide to getting started with Metalhead</a><a class="docs-footer-nextpage" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+end</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../quickstart/">« A guide to getting started with Metalhead</a><a class="docs-footer-nextpage" href="../../howto/resnet/">Using the ResNet model family in Metalhead.jl »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/tutorials/quickstart/index.html b/dev/tutorials/quickstart/index.html
index 8ee89f53..f3969b08 100644
--- a/dev/tutorials/quickstart/index.html
+++ b/dev/tutorials/quickstart/index.html
@@ -3,4 +3,4 @@
 
 model = ResNet(18);</code></pre><p>The API reference contains the documentation and options for each model function. These models also support the option for loading pre-trained weights from ImageNet.</p><div class="admonition is-info"><header class="admonition-header">Note</header><div class="admonition-body"><p>Metalhead is still under active development and thus not all models have pre-trained weights supported. While we are working on expanding the footprint of the pre-trained models, if you would like to help contribute model weights yourself, please check out the <a href="../../contributing/#contributing">contributing guide</a> guide.</p></div></div><p>To use a pre-trained model, just instantiate the model with the <code>pretrain</code> keyword argument set to <code>true</code>:</p><pre><code class="language-julia hljs">using Metalhead
   
-model = ResNet(18; pretrain = true);</code></pre><p>Refer to the <a href="../pretrained/#pretrained">pretraining guide</a> for more details on how to use pre-trained models.</p><h2 id="More-model-configuration-options"><a class="docs-heading-anchor" href="#More-model-configuration-options">More model configuration options</a><a id="More-model-configuration-options-1"></a><a class="docs-heading-anchor-permalink" href="#More-model-configuration-options" title="Permalink"></a></h2><p>For users who want to use more options for model configuration, Metalhead provides a &quot;mid-level&quot; API for models. These are the model functions that are in lowercase such as <a href="../../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> or <a href="../../api/mobilenet/#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a>. End-users who want to experiment with model architectures should use these functions. These models do not support the option for loading pre-trained weights from ImageNet out of the box, although one can always load weights explicitly using the <code>loadmodel!</code> function from Flux.</p><p>To use any of these models, check out the docstrings for the model functions (these are documented in the API reference). Note that these functions typically require more configuration options to be passed in, but offer a lot more flexibility in terms of model architecture. Metalhead defines as many default options as possible so as to make it easier for the user to pick and choose specific options to customise.</p><h2 id="Builders-for-the-advanced-user"><a class="docs-heading-anchor" href="#Builders-for-the-advanced-user">Builders for the advanced user</a><a id="Builders-for-the-advanced-user-1"></a><a class="docs-heading-anchor-permalink" href="#Builders-for-the-advanced-user" title="Permalink"></a></h2><p>For users who want the ability to customise their models as much as possible, Metalhead offers a powerful low-level interface. These are known as <strong>builders</strong> and allow the user to hack into the core of models and build them up as per their liking. Most users will not need to use builders since a large number of configuration options are exposed at the mid-level API. However, for package developers and users who want to build customised versions of their own models, the low-level API provides the customisability required while still reducing user code.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../pretrained/">Working with pre-trained models from Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.7.0 on <span class="colophon-date" title="Monday 4 November 2024 12:53">Monday 4 November 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+model = ResNet(18; pretrain = true);</code></pre><p>Refer to the <a href="../pretrained/#pretrained">pretraining guide</a> for more details on how to use pre-trained models.</p><h2 id="More-model-configuration-options"><a class="docs-heading-anchor" href="#More-model-configuration-options">More model configuration options</a><a id="More-model-configuration-options-1"></a><a class="docs-heading-anchor-permalink" href="#More-model-configuration-options" title="Permalink"></a></h2><p>For users who want to use more options for model configuration, Metalhead provides a &quot;mid-level&quot; API for models. These are the model functions that are in lowercase such as <a href="../../api/resnet/#Metalhead.resnet"><code>Metalhead.resnet</code></a> or <a href="../../api/mobilenet/#Metalhead.mobilenetv3"><code>Metalhead.mobilenetv3</code></a>. End-users who want to experiment with model architectures should use these functions. These models do not support the option for loading pre-trained weights from ImageNet out of the box, although one can always load weights explicitly using the <code>loadmodel!</code> function from Flux.</p><p>To use any of these models, check out the docstrings for the model functions (these are documented in the API reference). Note that these functions typically require more configuration options to be passed in, but offer a lot more flexibility in terms of model architecture. Metalhead defines as many default options as possible so as to make it easier for the user to pick and choose specific options to customise.</p><h2 id="Builders-for-the-advanced-user"><a class="docs-heading-anchor" href="#Builders-for-the-advanced-user">Builders for the advanced user</a><a id="Builders-for-the-advanced-user-1"></a><a class="docs-heading-anchor-permalink" href="#Builders-for-the-advanced-user" title="Permalink"></a></h2><p>For users who want the ability to customise their models as much as possible, Metalhead offers a powerful low-level interface. These are known as <strong>builders</strong> and allow the user to hack into the core of models and build them up as per their liking. Most users will not need to use builders since a large number of configuration options are exposed at the mid-level API. However, for package developers and users who want to build customised versions of their own models, the low-level API provides the customisability required while still reducing user code.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../pretrained/">Working with pre-trained models from Metalhead »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option><option value="catppuccin-latte">catppuccin-latte</option><option value="catppuccin-frappe">catppuccin-frappe</option><option value="catppuccin-macchiato">catppuccin-macchiato</option><option value="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.8.0 on <span class="colophon-date" title="Sunday 15 December 2024 17:22">Sunday 15 December 2024</span>. Using Julia version 1.6.7.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>

Model Name	Constructor	Pre-trained?
AlexNet	`AlexNet`	N
ConvMixer	`ConvMixer`	N
ConvNeXt	`ConvNeXt`	N
DenseNet	`DenseNet`	N
EfficientNet	`EfficientNet`	N
EfficientNetv2	`EfficientNetv2`	N
gMLP	`gMLP`	N
GoogLeNet	`GoogLeNet`	N
Inception-v3	`Inceptionv3`	N
Inception-v4	`Inceptionv4`	N
InceptionResNet-v2	`InceptionResNetv2`	N
MLPMixer	`MLPMixer`	N
MobileNetv1	`MobileNetv1`	N
MobileNetv2	`MobileNetv2`	N
MobileNetv3	`MobileNetv3`	N
MNASNet	`MNASNet`	N
ResMLP	`ResMLP`	N
ResNet	`ResNet`	Y
ResNeXt	`ResNeXt`	Y
SqueezeNet	`SqueezeNet`	Y
Xception	`Xception`	N
WideResNet	`WideResNet`	Y
VGG	`VGG`	Y
Vision Transformer	`ViT`	Y