web-realesrgan

Run Real-ESRGAN/Real-CUGAN in the browser with tensorflow.js

Usage

Drag in or select an image, then choose a model and WebGPU/WebGL to upscale the image.

Sure, PNG with alpha channel is supported.

Online Demo

https://cappuccino.moe

Accelaration

WebGL: Enabled on most devices by default.
WebGPU: Enabled on Chrome on most devices. Much faster than WebGL. If you encounter issues. Try chrome://flags/#enable-unsafe-webgpu flag or other browsers with WebGPU support.

All models are quantized to FP16, which reduces the size of the model download and has no noticeable difference in performance compared to FP32 models. However, currently, the computation speed of FP16 is the same as FP32. Future updates from the TensorFlow.js team are expected to improve FP16 computation performance on WebGPU.

🚀️ ️Advantages

Accessible Anywhere: The tool can be conveniently run on any device, such as a mobile phone📱️, without needing to download or install any software. This makes it possible to upscale images anytime, anywhere, directly in your browser.

⚠️️ Limitations

Performance Gap: Even with WebGPU enabled, the performance of running models in the browser is slower than running them locally. WebGPU’s speed is approximately half of the speed you’d achieve with local execution. On local devices, FP16 computation provides significant acceleration, whereas in tfjs-webgpu for now, FP16 and FP32 run at the same speed.

If you need to upscale a lot of images, it is recommended to download and use the official Real-ESRGAN and Real-CUGAN repositories for local execution.

Development

npm install
npm run dev

WebGPU cannot be used over HTTP. If you're developing this project locally, you can use OpenSSL to self-sign a certificate.

See https://developer.chrome.com/docs/web-platform/webgpu/troubleshooting-tips

Models

The first run will download the models and cache them in the browser indexedDB. No need to download them again.

Unfortunately, since the transformed model must determine the input size, you need to download the model separately for each different tile size.

I've converted 4 models to tensorflow.js format, you can find the original pytorch models in xinntao/Real-ESRGAN

anime_fast RealESRGAN-animevideov3
anime_plus RealESRGAN_x4plus_anime_6B
general_fast RealESRGAN-general-x4v3
general_plus RealESRGAN_x4plus

Now supports bilibili/ailab/Real-CUGAN🎉🎉🎉, which offers nearly the same restoration quality as Real-ESRGAN but is up to 5x-10x faster. 🥳🥳🥳

Details

Type	Model	Scale	Dtype	Size	Denoise Level
Real-ESRGAN	anime_fast	4	fp16	1.3 MB	not adjustable
Real-ESRGAN	anime_plus	4	fp16	9.2 MB	not adjustable
Real-ESRGAN	general_fast	4	fp16	2.5 MB	not adjustable
Real-ESRGAN	general_plus	4	fp16	34.2 MB	not adjustable
Real-CUGAN	2x	2	fp16	2.6 MB	conservative,0x,1x,2x,3x
Real-CUGAN	4x	4	fp16	2.9 MB	conservative,0x,3x

There is something wrong with Real-CUGAN 3x series models. 3x models are not available now.

Best Practice

Model Selection Guide

With nearly the same restoration quality, Real-CUGAN is much faster than Real-ESRGAN. Real-CUGAN is a better choice in most cases.

But Real-ESRGAN may be better when the input image is very small. Here, input is a 120x120 image.

Understanding Parameters

Denoise: "Denoise" refers to the process of reducing or removing noise from an image or audio signal. Noise can be any unwanted or random interference that distorts the clarity, like graininess in an image or background static in audio.
- Conservative: This usually indicates a mild or conservative noise reduction approach, meaning it tries to keep the original details intact and avoids over-smoothing.
- No-denoise: No noise reduction is applied, so the original noise and details are preserved.
- Denoise1x: This refers to a light noise reduction, applying minimal processing to remove only some of the noise.
- Denoise2x: This represents a moderate level of noise reduction, stronger than 1x, removing more noise but still maintaining some balance with the details.
- Denoise3x: This is a strong noise reduction, which will remove most or all of the noise, but it might also cause some loss of fine details, making the image smoother.
Tile Size: The image is not enlarged by the model as a whole; instead, it is split into tiles. The model enlarges each tile sequentially, and then all the tiles are stitched together to form the entire image.
- On WebGL, larger tile sizes can make your device appear to lag during execution. If your device becomes laggy, you can reduce the tile size.
- On WebGPU, larger tile sizes can speed up the entire process.
- If your GPU can handle it, the larger the tile size, the faster the overall computation. Otherwise, smaller slices would be faster.
Example: 120x120 image cut to 64x64 tiles

Overlap: The default min_overlap is 12. When the image is split into tiles, adjacent tiles need to have overlap. Without overlap, the boundaries between two tiles can appear disconnected, resulting in a lack of smooth transitions.
- You can set the overlap to 0 to observe this phenomenon.
- If you notice horizontal or vertical dividing lines after enlarging the image, you can increase the overlap to improve the seamlessness.

Credits

Pytorch model -> ONNX -> Tensorflow saved model -> Tensorflow.js

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
demo		demo
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.d.ts		env.d.ts
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

web-realesrgan

Usage

Online Demo

Accelaration

🚀️ ️Advantages

⚠️️ Limitations

Development

Models

Details

Best Practice

Model Selection Guide

Understanding Parameters

Credits

About

Releases 1

Packages

Languages

License

xororz/web-realesrgan

Folders and files

Latest commit

History

Repository files navigation

web-realesrgan

Usage

Online Demo

Accelaration

🚀️ ️Advantages

⚠️️ Limitations

Development

Models

Details

Best Practice

Model Selection Guide

Understanding Parameters

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages