Lin et al., 2025 - Google Patents
GaussianAvatar: Human avatar Gaussian splatting from monocular videosLin et al., 2025
- Document ID
- 3937029221366579905
- Author
- Lin H
- Zhan Y
- Publication year
- Publication venue
- Computers & Graphics
External Links
Snippet
Many application fields including virtual reality and movie production demand reconstructing high-quality digital human avatars from monocular videos and real-time rendering. However, existing neural radiance field (NeRF)-based methods are costly to train and render. In this …
- 238000000034 method 0 abstract description 90
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Chen et al. | A survey on 3d gaussian splatting | |
| Zhu et al. | Champ: Controllable and consistent human image animation with 3d parametric guidance | |
| Jiang et al. | Instantavatar: Learning avatars from monocular video in 60 seconds | |
| Tang et al. | Real-time neural radiance talking portrait synthesis via audio-spatial decomposition | |
| Liu et al. | Devrf: Fast deformable voxel radiance fields for dynamic scenes | |
| Bao et al. | 3d gaussian splatting: Survey, technologies, challenges, and opportunities | |
| Li et al. | Gaussianbody: Clothed human reconstruction via 3d gaussian splatting | |
| Liu et al. | Animatable 3d gaussian: Fast and high-quality reconstruction of multiple human avatars | |
| Chen et al. | Meshavatar: Learning high-quality triangular human avatars from multi-view videos | |
| Li et al. | Detailed 3D human body reconstruction from multi-view images combining voxel super-resolution and learned implicit representation | |
| Wang et al. | Look at the sky: Sky-aware efficient 3d gaussian splatting in the wild | |
| Deng et al. | Lumigan: Unconditional generation of relightable 3d human faces | |
| Hu et al. | Humanliff: Layer-wise 3d human generation with diffusion model | |
| CN119722919A (en) | A dynamic human body modeling method based on three-dimensional Gaussian sputtering technology | |
| CN120976449B (en) | A method and system for cross-source data 3D reconstruction based on improved Gaussian sputtering | |
| Li et al. | Diffusion-fof: Single-view clothed human reconstruction via diffusion-based fourier occupancy field | |
| Xiao et al. | NECA: Neural customizable human avatar | |
| Lin et al. | GaussianAvatar: Human avatar Gaussian splatting from monocular videos | |
| CN116452715A (en) | Dynamic hand rendering method, device and storage medium | |
| CN115761801A (en) | Three-dimensional human body posture migration method based on video time sequence information | |
| Peng et al. | RMAvatar: Photorealistic human avatar reconstruction from monocular video based on rectified mesh-embedded Gaussians | |
| Zhang et al. | Mesh-centric gaussian splatting for human avatar modelling with real-time dynamic mesh reconstruction | |
| Shin et al. | Canonicalfusion: Generating drivable 3d human avatars from multiple images | |
| Li et al. | Spa: Sparse photorealistic animation using a single rgb-d camera | |
| Hu et al. | HumanLiff: Layer-wise 3D Human Diffusion Model: S. Hu et al. |