PhD in Information Engineering
The Chinese University of Hong Kong, 2023
CameraCtrl: Enabling Camera Control for Video Diffusion Models
Conference paper
Edicho: Consistent Image Editing in the Wild
Conference paper
Interspatial Attention for Efficient 4D Human Video Generation
Article
FLARE: Feed-Forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
Conference paper
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
Conference paper
GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling
Conference paper
Video World Models with Long-term Spatial Memory
Conference paper
Representing Long Volumetric Video with Temporal Gaussian Hierarchy
Article
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
Conference paper
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Conference paper
Efficient 3D Articulated Human Generation with Layered Surface Volumes
Conference paper
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Conference paper
Flow as the Cross-Domain Manipulation Interface
Conference paper
Gaussian Shell Maps for Efficient 3D Human Generation
Conference paper
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Conference paper
One-Shot Generative Domain Adaptation
Conference paper
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction
Conference paper
Real-Time 3D-Aware Portrait Editing from a Single Image
Conference paper
Towards Text-guided 3D Scene Composition
Conference paper
GH-Feat: Learning Versatile Generative Hierarchical Features From GANs
Article
Conference paper
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase
Conference paper
Conference paper
GLeaD: Improving GANs with A Generator-Leading Task
Conference paper
Learning 3D-Aware Image Synthesis with Unknown Pose Distribution
Conference paper
Learning Modulated Transformation in GANs
Conference paper
Towards Smooth Video Composition
Conference paper
3D-aware Image Synthesis via Learning Structural and Textural Representations
Conference paper
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Conference paper
High-Fidelity GAN Inversion with Padding Space
Conference paper
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
Conference paper
Improving GAN Equilibrium by Raising Spatial Awareness
Conference paper
Improving GANs with A Dynamic Discriminator
Conference paper
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Conference paper
Region-Based Semantic Factorization in GANs
Conference paper
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
Conference paper
Compconv: A compact convolution module for efficient feature learning
Conference paper
Data-Efficient Instance Generation from Instance Discrimination
Conference paper
Conference paper
Generative Hierarchical Features from Synthesizing Images
Conference paper
Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering
Conference paper
Conference paper
A main/subsidiary network framework for simplifying binary neural networks
Conference paper
Dense RepPoints: Representing Visual Objects with Dense Point Sets
Conference paper
Temporal pyramid network for action recognition
Conference paper
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Efficient 3D Articulated Human Generation with Layered Surface Volumes
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction
3D-aware Image Synthesis via Learning Structural and Textural Representations
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
A main/subsidiary network framework for simplifying binary neural networks
Conference paper
Dense RepPoints: Representing Visual Objects with Dense Point Sets
Conference paper
Temporal pyramid network for action recognition
Conference paper
Update your browser to view this website correctly. Update your browser now