Intrinsic Image Harmonization

Last update: Dec 21, 2022

Related tags

Deep Learning IntrinsicHarmony

Overview

Intrinsic Image Harmonization [Paper]

Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng

Here we provide PyTorch implementation and the trained model of our framework.

Prerequisites

Linux
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Train/Test

Download iHarmony4 dataset, and our HVIDIT dataset Google Drive or BaiduCloud (access code: akbi).
Train a model:

CUDA_VISIBLE_DEVICES=0 python train.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test the model

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name retinexltifpm_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply a pre-trained model

Download the pretrained model from Google Drive or BaiduCloud (access code: 20m6), and put net_G.pth in the directory checkpoints/experiment. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model retinexltifpm  --name experiment  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Evaluation

We provide the code in ih_evaluation.py. Run:

CUDA_VISIBLE_DEVICES=0 python evaluation/ih_evaluation.py --dataroot <dataset_dir> --result_root  results/experiment/test_latest/images/ --evaluation_type our --dataset_name ALL

Quantitative Result

Dataset	Metrics	Composite	Ours (iHarmony4)	Ours (iHarmony4+HVIDIT)
HCOCO	PSNR MSE fMSE	33.99 69.37 996.59	37.61 23.25 386.39	37.77 21.84 367.38
HAdobe5k	PSNR MSE fMSE	28.52 345.54 2051.61	36.20 42.21 296.76	36.49 39.53 266.49
HFlickr	PSNR MSE fMSE	28.43 264.35 1574.37	31.74 100.86 676.71	32.08 96.87 635.60
Hday2night	PSNR MSE fMSE	34.36 109.65 1409.98	36.48 50.64 755.88	36.60 50.37 763.33
HVIDIT	PSNR MSE fMSE	38.72 53.12 1604.41	- - -	41.83 22.49 691.06
ALL	PSNR MSE fMSE	32.07 167.39 1386.12	36.53 37.95 399.34	36.96 35.33 388.50

Bibtex

If you use this code for your research, please cite our papers.

@InProceedings{Guo_2021_CVPR,
    author    = {Guo, Zonghui and Zheng, Haiyong and Jiang, Yufeng and Gu, Zhaorui and Zheng, Bing},
    title     = {Intrinsic Image Harmonization},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {16367-16376}
}

Acknowledgement

For some of the data modules and model functions used in this source code, we need to acknowledge the repo of DoveNet and CycleGAN.

You might also like...

python library for invisible image watermark (blind image watermark)

invisible-watermark invisible-watermark is a python library and command line tool for creating invisible watermark over image.(aka. blink image waterm

572 Jan 7, 2023

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

AOT-GAN for High-Resolution Image Inpainting Arxiv Paper | AOT-GAN: Aggregated Contextual Transformations for High-Resolution Image Inpainting Yanhong

214 Jan 3, 2023

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

arXiv Dual Contrastive Learning Adversarial Generative Networks (DCLGAN) We provide our PyTorch implementation of DCLGAN, which is a simple yet powerf

119 Dec 4, 2022

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Deep Image Search - AI-Based Image Search Engine Deep Image Search is an AI-based image search engine that includes deep transfer learning features Ex

139 Jan 1, 2023

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

ImageProcessingTransformer Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

61 Jan 1, 2023

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set —— PyTorch implementation This is an unofficial offici

833 Dec 28, 2022

Comments

Model Inference

Hello, is there a way to infer the model by reading an image and passing the image and its mask to the model and getting the harmonized output? Without the need to store the image's path in a text file and reading it from the text file then loading the image?

opened by AhmedHashish123 2
visdom interface is blank

first，thanks for your excellent work！ When I execute the training code, the visdom interface does not display the result picture and the training loss. it works when I execute the code of dovenet. could you tell me how to solve this problem? thanks again

opened by Ligouhi 0

Releases(v1.0)

v1.0(Feb 9, 2022)

Code version of our CVPR work [Paper].
Source code(tar.gz)
Source code(zip)

Intrinsic Image Harmonization

Related tags

Overview

Intrinsic Image Harmonization [Paper]

Prerequisites

Train/Test

Apply a pre-trained model

Evaluation

Quantitative Result

Bibtex

Acknowledgement

You might also like...

python library for invisible image watermark (blind image watermark)

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Comments

Model Inference

visdom interface is blank

Releases(v1.0)

v1.0(Feb 9, 2022)

Owner

VISION @ OUC

Labels4Free: Unsupervised Segmentation using StyleGAN

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

PyTorch version implementation of DORN

Evaluating saliency methods on artificial data with different background types

source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT

A geometric deep learning pipeline for predicting protein interface contacts.

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

Rethinking Transformer-based Set Prediction for Object Detection

HNECV: Heterogeneous Network Embedding via Cloud model and Variational inference

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

Human Pose Detection on EdgeTPU

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

sense-py-AnishaBaishya created by GitHub Classroom

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Repo for the Video Person Clustering dataset, and code for the associated paper

A synthetic texture-invariant dataset for object detection of UAVs