PhD in Computer Science
Stanford University, 2017
Article
In-Domain GAN Inversion for Faithful Reconstruction and Editability
Article
Open-Vocabulary Category-Level Object Pose and Size Estimation
Article
A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging
Conference paper
Automatic Controllable Colorization via Imagination
Conference paper
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Conference paper
Conference paper
Conference paper
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
Conference paper
Gaussian shell maps for efficient 3D human generation
Conference paper
HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation
Conference paper
Learning High-Resolution Vector Representation from Multi-camera Images for 3D Object Detection
Conference paper
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
Conference paper
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
Conference paper
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Conference paper
Deep Video Prior for Video Consistency and Propagation
Article
Defending ChatGPT against jailbreak attack via self-reminders
Article
Article
Learn to Grasp via Intention Discovery and its Application to Challenging Clutter
Article
Robust Reflection Removal With Flash-Only Cues in the Wild
Article
4D Panoptic Scene Graph Generation
Conference paper
AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections
Conference paper
Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Conference paper
Bootstrap Motion Forecasting With Self-Consistent Constraints
Conference paper
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
Conference paper
DYNAFED: Tackling Client Data Heterogeneity with Global Dynamics
Conference paper
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
Conference paper
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Conference paper
Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer
Conference paper
Conference paper
High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization
Conference paper
Human MotionFormer: Transferring Human Motions with Vision Transformers
Conference paper
Improving Video Super-Resolution with Long-Term Self-Exemplars
Conference paper
Learning 3D-Aware Image Synthesis with Unknown Pose Distribution
Conference paper
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Conference paper
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Conference paper
Neural Image Popularity Assessment with Retrieval-augmented Transformer
Conference paper
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning
Conference paper
Real-time 6K Image Rescaling with Rate-distortion Optimization
Conference paper
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Conference paper
Rotating without Seeing: Towards In-hand Dexterity through Touch
Conference paper
Scene-level Point Cloud Colorization with Semantic-and-Geometric-aware Networks
Conference paper
TextDiffuser: Diffusion Models as Text Painters
Conference paper
The devil is in the wrongly-classified samples: towards unified open-set recognition
Conference paper
Video Waterdrop Removal via Spatio-Temporal Fusion in Driving Scenes
Conference paper
Physics Assisted Deep Learning for Indoor Imaging using Phaseless Wi-Fi Measurements
Article
3D-Aware Indoor Scene Synthesis with Depth Priors
Conference paper
A Categorized Reflection Removal Dataset with Diverse Real-world Scenes
Conference paper
A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes
Conference paper
A Well-aligned Dataset for Learning Image Signal Processing on Smartphones from a High-end Camera
Conference paper
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
Conference paper
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Conference paper
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
Conference paper
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition
Conference paper
Composite Photograph Harmonization with Complete Background Cues
Conference paper
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks
Conference paper
FS6D: Few-Shot 6D Pose Estimation of Novel Objects
Conference paper
Conference paper
High-Fidelity GAN Inversion for Image Attribute Editing
Conference paper
Improving 3D-aware image synthesis with a geometry-aware discriminator
Conference paper
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Conference paper
Optimizing Image Compression via Joint Learning with Denoising
Conference paper
Optimizing Video Prediction via Video Frame Interpolation
Conference paper
Planning for Sample Efficient Imitation Learning
Conference paper
Point Cloud Compression with Sibling Context and Surface Priors
Conference paper
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
Conference paper
Real-time Streaming Video Denoising with Bidirectional Buffers
Conference paper
Region-Based Semantic Factorization in GANs
Conference paper
Restorable Image Operators with Quasi-Invertible Networks
Conference paper
Shape from Polarization for Complex Scenes in the Wild
Conference paper
Volumetric-based Contact Point Detection for 7-DoF Grasping
Conference paper
Learning to Denoise Astronomical Images with U-nets
Article
DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation
Conference paper
Dual-Camera Super-Resolution with Aligned Attention Modules
Conference paper
Embedding Novel Views in a Single JPEG Image
Conference paper
Enhanced Invertible Encoding for Learned Image Compression
Conference paper
Evaluating Adversarial Robustness in Simulated Cerebellum
Conference paper
FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
Conference paper
IICNet: A Generic Framework for Reversible Image Conversion
Conference paper
Image Inpainting with External-internal Learning and Monochromic Bottleneck
Conference paper
Internal Video Inpainting by Implicit Long-range Propagation
Conference paper
Invertible Image Signal Processing
Conference paper
Involution: Inverting the inherence of convolution for visual recognition
Conference paper
Involution: Inverting the Inherence of Convolution for Visual Recognition
Conference paper
Joint Depth and Normal Estimation from Real-world Time-of-flight Raw Data
Conference paper
Learning to Predict Vehicle Trajectories with Model-based Planning
Conference paper
Conference paper
Conference paper
Normalized Human Pose Features for Human Action Video Alignment
Conference paper
Robust Federated Learning with Attack-Adaptive Aggregation
Conference paper
Robust Reflection Removal with Reflection-free Flash-only Cues
Conference paper
Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving
Conference paper
SinIR: Efficient General Image Manipulation with Single Image Reconstruction
Conference paper
Stereo Matching by Self-supervision of Multiscopic Vision
Conference paper
Stereo Waterdrop Removal with Row-wise Dilated Attention
Conference paper
TPCN: Temporal Point Cloud Networks for Motion Forecasting
Conference paper
Unsupervised Portrait Shadow Removal via Generative Priors
Conference paper
MFuseNet: Robust Depth Estimation with Learned Multiscopic Fusion
Article
Blind Video Temporal Consistency via Deep Video Prior
Conference paper
Deep Reinforced Attention Learning for Quality-Aware Visual Recognition
Conference paper
Depth Sensing Beyond LiDAR Range
Conference paper
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
Conference paper
Fully Convolutional Networks for Continuous Sign Language Recognition
Conference paper
Future Video Synthesis with Object Motion Prediction
Conference paper
Learning to Learn Parameterized Classification Networks for Scalable Input Images
Conference paper
PiP: Planning-Informed Trajectory Prediction for Autonomous Driving
Conference paper
Polarized Reflection Removal with Perfect Alignment in the Wild
Conference paper
PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer
Conference paper
Self-supervised Dance Video Synthesis Conditioned on Music
Conference paper
Self-supervised Object Tracking with Cycle-consistent Siamese Networks
Conference paper
Video Depth Estimation by Fusing Flow-to-Depth Proposals
Conference paper
3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis
Conference paper
Conference paper
Fully Automatic Video Colorization with Self-Regularization and Diversity
Conference paper
Hiding Video in Audio via Reversible Generative Models
Conference paper
LeapDetect: An agile platform for inspecting power transmission lines from drones
Conference paper
Conference paper
Speech Denoising with Deep Feature Losses
Conference paper
Conference paper
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
Conference paper
Interactive Image Segmentation with Latent Diversity
Conference paper
Conference paper
Single Image Reflection Separation with Perceptual Losses
Conference paper
Fast Image Processing with Fully-Convolutional Networks
Conference paper
Photographic Image Synthesis with Cascaded Refinement Networks
Conference paper
A redox-flow battery with an alloxazine-based organic electrolyte
Article
Dense Monocular Depth Estimation in Complex Dynamic Scenes
Conference paper
Full Flow: Optical Flow Estimation By Global Optimization over Regular Grids
Conference paper
Robust Nonrigid Registration by Convex Optimization
Conference paper
Fast MRF Optimization with Application to Depth Reconstruction
Conference paper
Article
A Simple Model for Intrinsic Image Decomposition with Depth Cues
Conference paper
Motion-aware KNN Laplacian for Video Matting
Conference paper
Conference paper
In-Domain GAN Inversion for Faithful Reconstruction and Editability
Open-Vocabulary Category-Level Object Pose and Size Estimation
A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation
Learning High-Resolution Vector Representation from Multi-camera Images for 3D Object Detection
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Defending ChatGPT against jailbreak attack via self-reminders
Learn to Grasp via Intention Discovery and its Application to Challenging Clutter
AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections
Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Bootstrap Motion Forecasting With Self-Consistent Constraints
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
DYNAFED: Tackling Client Data Heterogeneity with Global Dynamics
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer
High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization
Human MotionFormer: Transferring Human Motions with Vision Transformers
Improving Video Super-Resolution with Long-Term Self-Exemplars
Learning 3D-Aware Image Synthesis with Unknown Pose Distribution
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Neural Image Popularity Assessment with Retrieval-augmented Transformer
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning
Real-time 6K Image Rescaling with Rate-distortion Optimization
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Rotating without Seeing: Towards In-hand Dexterity through Touch
Scene-level Point Cloud Colorization with Semantic-and-Geometric-aware Networks
The devil is in the wrongly-classified samples: towards unified open-set recognition
Video Waterdrop Removal via Spatio-Temporal Fusion in Driving Scenes
A Categorized Reflection Removal Dataset with Diverse Real-world Scenes
A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes
A Well-aligned Dataset for Learning Image Signal Processing on Smartphones from a High-end Camera
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition
Composite Photograph Harmonization with Complete Background Cues
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks
Improving 3D-aware image synthesis with a geometry-aware discriminator
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Optimizing Image Compression via Joint Learning with Denoising
Point Cloud Compression with Sibling Context and Surface Priors
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
Real-time Streaming Video Denoising with Bidirectional Buffers
DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation
FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
Image Inpainting with External-internal Learning and Monochromic Bottleneck
Internal Video Inpainting by Implicit Long-range Propagation
Involution: Inverting the inherence of convolution for visual recognition
Involution: Inverting the Inherence of Convolution for Visual Recognition
Joint Depth and Normal Estimation from Real-world Time-of-flight Raw Data
Learning to Predict Vehicle Trajectories with Model-based Planning
Normalized Human Pose Features for Human Action Video Alignment
Robust Reflection Removal with Reflection-free Flash-only Cues
Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving
SinIR: Efficient General Image Manipulation with Single Image Reconstruction
Deep Reinforced Attention Learning for Quality-Aware Visual Recognition
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
Fully Convolutional Networks for Continuous Sign Language Recognition
Learning to Learn Parameterized Classification Networks for Scalable Input Images
PiP: Planning-Informed Trajectory Prediction for Autonomous Driving
Polarized Reflection Removal with Perfect Alignment in the Wild
PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer
Self-supervised Object Tracking with Cycle-consistent Siamese Networks
Fast MRF Optimization with Application to Depth Reconstruction
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
Conference paper
Interactive Image Segmentation with Latent Diversity
Conference paper
Conference paper
Single Image Reflection Separation with Perceptual Losses
Conference paper
Fast Image Processing with Fully-Convolutional Networks
Conference paper
Photographic Image Synthesis with Cascaded Refinement Networks
Conference paper
A redox-flow battery with an alloxazine-based organic electrolyte
Article
Dense Monocular Depth Estimation in Complex Dynamic Scenes
Conference paper
Full Flow: Optical Flow Estimation By Global Optimization over Regular Grids
Conference paper
Robust Nonrigid Registration by Convex Optimization
Conference paper
Fast MRF Optimization with Application to Depth Reconstruction
Conference paper
Article
A Simple Model for Intrinsic Image Decomposition with Depth Cues
Conference paper
Motion-aware KNN Laplacian for Video Matting
Conference paper
Conference paper
COMP4471 | Deep Learning in Computer Vision |
COMP4971A | Independent Work |
COMP4981 | Final Year Project |
EESM5900V | Deep Learning for Vision and Multimodal Data |
ELEC4240 | Deep Learning in Computer Vision |
UROP1100N | Undergraduate Research Opportunities Series 1 |
UROP2100N | Undergraduate Research Opportunities Series 2 |
UROP3100N | Undergraduate Research Opportunities Series 3 |
UROP3200 | Undergraduate Research Opportunities with Mini-conference Experience |
COMP4981 | Final Year Project |
UROP1000 | Undergraduate Research Opportunities |
UROP1100M | Undergraduate Research Opportunities Series 1 |
UROP2100M | Undergraduate Research Opportunities Series 2 |
COMP3071 | Honors Competitive Programming |
COMP4981 | Final Year Project |
COMP5214 | Advanced Deep Learning Architectures |
COMP6921N | Research Project |
CPEG4901 | Computer Engineering Final Year Project in COMP |
CPEG4910 | Co-op Program |
ELEC5680 | Advanced Deep Learning Architectures |
UROP1100L | Undergraduate Research Opportunities Series 1 |
UROP3100L | Undergraduate Research Opportunities Series 3 |
COMP4471 | Deep Learning in Computer Vision |
COMP4971A | Independent Work |
COMP4981 | Final Year Project |
COMP6921N | Research Project |
ELEC4240 | Deep Learning in Computer Vision |
UROP1100K | Undergraduate Research Opportunities Series 1 |
UROP2100K | Undergraduate Research Opportunities Series 2 |
COMP4971A | Independent Work |
COMP4981 | Final Year Project |
UROP1000 | Undergraduate Research Opportunities |
UROP1100J | Undergraduate Research Opportunities Series 1 |
UROP2100J | Undergraduate Research Opportunities Series 2 |
No Teaching Assignments |
CHAN, Ho Shu
(co-supervision)
Electronic and Computer Engineering
CHEN, Junming
Individualized Interdisciplinary Program (Artificial Intelligence)
LAU, Yuen Fui
Computer Science and Engineering
LIU, Zichen
Computer Science and Engineering
MA, Yue
Computer Science and Engineering
PHAM, Trung Kien
(co-supervision)
Computer Science and Engineering
YU, Runyi
Computer Science and Engineering
BAI, Qingyan
Computer Science and Engineering
CHEN, Zhili
Computer Science and Engineering
LIU, Runtao
Computer Science and Engineering
LIU, Zhaoyang
Computer Science and Engineering
RAO, Zhefan
Computer Science and Engineering
YANG, Xin
(co-supervision)
Individualized Interdisciplinary Program (Artificial Intelligence)
CAI, Junhao
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)
CHEN, Jingye
Computer Science and Engineering
FANG, Biqing
(co-supervision)
Individualized Interdisciplinary Program (Artificial Intelligence)
HE, Yingqing
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)
LI, Mingdong
(co-supervision)
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)
LIU, Hongji
(co-supervision)
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)
LIU, Hongyu
Computer Science and Engineering
QIAN, Zian
Computer Science and Engineering
ZHAO, Chao
Electronic and Computer Engineering
WEN, Qiang
Computer Science and Engineering
ZHANG, Tianjia
Individualized Interdisciplinary Program
ZHU, Jiapeng
Computer Science and Engineering
CHENG, Ka Leong
Computer Science and Engineering
FAN, Na
Computer Science and Engineering
XIE, Yueqi
Computer Science and Engineering
MENG, Guotao
Electronic and Computer Engineering
CEN, Jun
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)( Completed in 2024 )
CHENG, Jie
Electronic and Computer Engineering( Completed in 2024 )
GAO, Rongrong
Computer Science and Engineering( Completed in 2024 )
JI, Liya
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)( Completed in 2024 )
OUYANG, Hao
Computer Science and Engineering( Completed in 2024 )
QI, Chenyang
Computer Science and Engineering( Completed in 2024 )
SHI, Zifan
(co-supervision)
Computer Science and Engineering( Completed in 2024 )
XIE, Jiaxin
Computer Science and Engineering( Completed in 2024 )
XING, Yazhou
Computer Science and Engineering( Completed in 2024 )
YE, Maosheng
Computer Science and Engineering( Completed in 2024 )
WANG, Tengfei
Computer Science and Engineering( Completed in 2023 )
WU, Yue
Computer Science and Engineering( Completed in 2023 )
HE, Yisheng
Computer Science and Engineering( Completed in 2022 )
LEI, Chenyang
Computer Science and Engineering( Completed in 2022 )
SONG, Haoran
(co-supervision)
Mechanical Engineering( Completed in 2021 )
ZHAN, Dekun
Electronic and Computer Engineering( Completed in 2024 )
PARK, Chan Ho
Computer Science and Engineering( Completed in 2023 )
WANG, Qijun
(co-supervision)
Individualized Interdisciplinary Program (Artificial Intelligence)( Completed in 2023 )
YIN, Zhaoheng
Electronic and Computer Engineering( Completed in 2023 )
KHANG, Minsoo
Computer Science and Engineering( Completed in 2022 )
LEE, Ka Ho
Individualized Interdisciplinary Program (Robotics and Autonomous Systems)( Completed in 2022 )
QIAN, Zian
Computer Science and Engineering( Completed in 2022 )
LIU, Yuezhang
(co-supervision)
Computer Science and Engineering( Completed in 2021 )
PARK, Chang Dae
Computer Science and Engineering( Completed in 2021 )
WAN, Ching Pui
Computer Science and Engineering( Completed in 2021 )
YOO, Ji Hyeong
Computer Science and Engineering( Completed in 2021 )
Update your browser to view this website correctly. Update your browser now