Blind face restoration via diffusion prior is a highly ill-posed problem, which suffers from 1) slow training and inference speed, and 2) failure in preserving the original identity and recovering fine-grained facial details. In this work, we propose WaveFace to solve the problems in the frequency domain, where low- and high-frequency components decomposed by wavelet transformation are considered individually to maximize authenticity as well as efficiency. The diffusion model is applied to recover the low-frequency component only, which presents general information of the original image but 1/16 in size. To preserve the original identity, the generation is conditioned on the low-frequency component of low-quality images at each denoising step. Meanwhile, high-frequency components at multiple decomposition levels are handled by an unified network, which recovers complex facial details in a single step. Evaluations on four benchmark datasets show that: 1) WaveFace outperforms state-of-the-art methods in the authenticity, especially in terms of identity preservation, and 2) authentic images are restored with the efficiency 10x faster than existing diffusion model-based BFR methods.
We provide the synthetic CelebA-Test used in our paper and our restoration results. Please download to check more results.
@article{miao2024waveface,
author = {Miao, Yunqi and Deng, Jiankang and Han, Jungong},
title = {WaveFace: Authentic Face Restoration with Efficient Frequency Recovery},
journal = {CVPR},
year = {2024}
}