Zhao et al., 2023 - Google Patents
SDE-RAE: CLIP-based realistic image reconstruction and editing network using stochastic differential diffusionZhao et al., 2023
View PDF- Document ID
- 12503317981933209174
- Author
- Zhao H
- Jin G
- Jiang X
- Li M
- Publication year
- Publication venue
- Image and Vision Computing
External Links
Snippet
Abstract Generative Adversarial Networks (GANs) has long dominated the field of image reconstruction and editing. It is capable to train a generator in an adversarial way, which can fool the discriminator and enable the generated image to be of high quality. However, this …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/24—Editing, e.g. insert/delete
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zhan et al. | Multimodal image synthesis and editing: A survey and taxonomy | |
| Zhang et al. | Mmginpainting: Multi-modality guided image inpainting based on diffusion models | |
| Song et al. | AttriDiffuser: Adversarially enhanced diffusion model for text-to-facial attribute image synthesis | |
| US20220405583A1 (en) | Score-based generative modeling in latent space | |
| Yang et al. | Controllable sketch-to-image translation for robust face synthesis | |
| Bolkart et al. | 3D faces in motion: Fully automatic registration and statistical analysis | |
| CN113538608A (en) | Generative Adversarial Network-Based Controllable Character Image Generation Method | |
| Zhao et al. | SDE-RAE: CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion | |
| Wu et al. | DeepPortraitDrawing: Generating human body images from freehand sketches | |
| Xia et al. | Collaborative contrastive learning for cross-domain gaze estimation | |
| Liu et al. | Unified generation, reconstruction, and representation: Generalized diffusion with adaptive latent encoding-decoding | |
| Yildirim et al. | Warping the residuals for image editing with stylegan | |
| Yang et al. | Semantic layout-guided diffusion model for high-fidelity image synthesis in ‘The Thousand Li of Rivers and Mountains’ | |
| Sudha et al. | Semantic image synthesis from text: Current trends and future horizons in text-to-image generation | |
| Farooq et al. | ChildDiffusion: Unlocking the potential of generative AI and controllable augmentations for child facial data using stable diffusion and large language models | |
| Mahajan et al. | Integrating speech-to-text for image generation using generative adversarial networks | |
| Nguyen et al. | Instruction-guided editing controls for images and multimedia: A survey in llm era | |
| Du et al. | One-for-all: towards universal domain translation with a single stylegan | |
| Wang et al. | Generative adversarial text-to-image generation with style image constraint | |
| Song et al. | Face attribute editing based on generative adversarial networks | |
| Guo et al. | An improved stylegan-based texttoface model with local-global information fusion | |
| Xu et al. | Learning region-aware style-content feature transformations for face image beautification | |
| Kong et al. | DualPathGAN: Facial reenacted emotion synthesis | |
| Chae et al. | Semantic image synthesis with unconditional generator | |
| Sivuk et al. | Diverse semantic image editing with style codes |