Git Product home page Git Product logo

wavenet's Introduction

[PG 2023] WaveNet: Wave-Aware Image Enhancement

Jiachen Dang, Zehao Li, Yong Zhong, Lishun Wang

official_paper official_paper_slide

News

  • 11.09, 2023: Training codes and logs have been released. Feel free to use them. 🚀
  • 10 12, 2023: Codes and pre-trained models are released! ⭐
  • 09 15, 2023: Our paper has been accepted by PG 2023. Code and Models will be released. 🎉

Abstract: * As a low-level vision task, image enhancement is widely used in various computer vision applications. Recently, multiple methods combined with CNNs, MLP, Transformer, and the Fourier transform have achieved promising results on image enhancement tasks. However, these methods cannot achieve a balance between accuracy and computational cost. In this paper, we formulate the enhancement into a signal modulation problem and propose the WaveNet architecture, which performs well in various parameters and improves the feature expression using wave-like feature representation. Specifically, to better capture wave-like feature representations, we propose to represent a pixel as a sampled value of a signal function with three wave functions (Cosine Wave (CW), Sine Wave (SW), and Gating Wave (GW)) inspired by the Fourier transform. The amplitude and phase are required to generate the wave-like features. The amplitude term includes the original contents of features, and the phase term modulates the relationship between various inputs and fixed weights. To dynamically obtain the phase and the amplitude, we build the Wave Transform Block (WTB) that adaptively generates the waves and modulates the wave superposition mode. Based on the WTB, we establish an effective architecture WaveNet for image enhancement. Extensive experiments on six real-world datasets show that our model achieves better quantitative and qualitative results than state-of-the-art methods.


Pipeline

result3

Installation

See INSTALL.md for the installation of dependencies required to run WaveNet.

Quick Run (Demo)

Test on the local environment:

To test the pre-trained models for enhancing your images, run

cd basicsr

python demo.py --input_dir images_folder_path --result_dir save_images_here --weights path_to_models

**All pre-trained models can be found in the folder "checkpoints/$<$dataset$>$"

Train

Training for Image Enhancement is provided in TRAINING.md. Here is a summary table containing hyperlinks for easy navigation:

Model LOL | log FiveK | log SID | log
WaveNet-B weights | log weights | log weights | log
WaveNet-S weights | log weights | log weights | log
WaveNet-T weights | log weights | log weights | log

Dataset

For the preparation of dataset, see datasets/README.md.

Test (Evaluation)

  • To test the PSNR, SSIM and LPIPS of image enhancement, see evaluation.py and run
cd basicsr

python evaluation.py -dirA images_folder_path -dirB images_folder_path -type image_data_type --use_gpu use_gpu_or_not

Result

Quantitative Evaluation on LOL/FiveK/VE-LOL/SID/SICE/DARK FACE datasets.

result1 result2

Qualitative Evaluation on LOL/FiveK/VE-LOL/SID/SICE/DARK FACE datasets.

result3

Efficiency (Tested on a single RTX 3090, Pytorch 1.11)

result3

Citation

@inproceedings {10.2312:pg.20231267,
booktitle = {Pacific Graphics Short Papers and Posters},
editor = {Chaine, Raphaëlle and Deng, Zhigang and Kim, Min H.},
title = {{WaveNet: Wave-Aware Image Enhancement}},
author = {Dang, Jiachen and Li, Zehao and Zhong, Yong and Wang, Lishun},
year = {2023},
publisher = {The Eurographics Association},
ISBN = {978-3-03868-234-9},
DOI = {10.2312/pg.20231267}
} 

Contact

Should you have any questions, please contact [email protected]

Acknowledgment: This code is based on the BasicSR toolbox.

wavenet's People

Contributors

denijsonc avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.