FACE QUALITY TRANSFORMER: A FACE QUALITY ASSESSMENT AND ENHANCEMENT FRAMEWORK

The quality of image generation has reached impressive levels.
Advanced text-to-image models have become
amazingly good at creating objects, depicting actions with high precision.
However, despite significant progress in image generation,
the quality of generated faces remains a
critical factor for users. Even the most advanced text-to-image
diffusion models struggle to generate high quality faces consistently. This
highlights the importance of estimation of face quality in generated images as one of
the most important metric to assess. In this paper, we propose
a hybrid architecture comprising of attention-based Vision-Transformer
along with EfficientNet to capture intrinsic face
deformations present in image generation models. We also conduct a comparative
analysis of state-of-the-art diffusion models for face quality estimation including DALL-E, Flux, Stable Diffusion and Firefly.
Furthermore, we show that our proposed pipeline can be plugged with image generation
models to effectively correct the poor quality faces in generated images through automated re-generations.

FaceQ_transformer.zip

FaceQ_transformer.zip (224)

Thumbs Up

CITE

Documents

Description of Database/Benchmark

FACE QUALITY TRANSFORMER: A FACE QUALITY ASSESSMENT AND ENHANCEMENT FRAMEWORK

FaceQ_transformer.zip

QUESTIONS?