Zhao et al., 2023 - Google Patents

SDE-RAE: CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion

Zhao et al., 2023

Document ID: 12503317981933209174
Author: Zhao H; Jin G; Jiang X; Li M
Publication year: 2023
Publication venue: Image and Vision Computing

External Links

Cited by

Snippet

Abstract Generative Adversarial Networks (GANs) has long dominated the field of image reconstruction and editing. It is capable to train a generator in an adversarial way, which can fool the discriminator and enable the generated image to be of high quality. However, this …

Continue reading at www.sciencedirect.com (PDF) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/24—Editing, e.g. insert/delete
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image

Similar Documents

Publication	Publication Date	Title
Zhan et al.	2023	Multimodal image synthesis and editing: A survey and taxonomy
Zhang et al.	2024	Mmginpainting: Multi-modality guided image inpainting based on diffusion models
Song et al.	2025	AttriDiffuser: Adversarially enhanced diffusion model for text-to-facial attribute image synthesis
US20220405583A1 (en)	2022-12-22	Score-based generative modeling in latent space
Yang et al.	2021	Controllable sketch-to-image translation for robust face synthesis
Bolkart et al.	2015	3D faces in motion: Fully automatic registration and statistical analysis
CN113538608A (en)	2021-10-22	Generative Adversarial Network-Based Controllable Character Image Generation Method
Zhao et al.	2023	SDE-RAE: CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion
Wu et al.	2023	DeepPortraitDrawing: Generating human body images from freehand sketches
Xia et al.	2025	Collaborative contrastive learning for cross-domain gaze estimation
Liu et al.	2024	Unified generation, reconstruction, and representation: Generalized diffusion with adaptive latent encoding-decoding
Yildirim et al.	2025	Warping the residuals for image editing with stylegan
Yang et al.	2025	Semantic layout-guided diffusion model for high-fidelity image synthesis in ‘The Thousand Li of Rivers and Mountains’
Sudha et al.	2024	Semantic image synthesis from text: Current trends and future horizons in text-to-image generation
Farooq et al.	2025	ChildDiffusion: Unlocking the potential of generative AI and controllable augmentations for child facial data using stable diffusion and large language models
Mahajan et al.	2025	Integrating speech-to-text for image generation using generative adversarial networks
Nguyen et al.	2024	Instruction-guided editing controls for images and multimedia: A survey in llm era
Du et al.	2025	One-for-all: towards universal domain translation with a single stylegan
Wang et al.	2023	Generative adversarial text-to-image generation with style image constraint
Song et al.	2020	Face attribute editing based on generative adversarial networks
Guo et al.	2024	An improved stylegan-based texttoface model with local-global information fusion
Xu et al.	2026	Learning region-aware style-content feature transformations for face image beautification
Kong et al.	2021	DualPathGAN: Facial reenacted emotion synthesis
Chae et al.	2023	Semantic image synthesis with unconditional generator
Sivuk et al.	2025	Diverse semantic image editing with style codes