PhD in Computer Engineering
University of Toronto, 2015
FlashPS: Efficient Generative Image Editing with Mask-aware Caching and Scheduling
Conference paper
ChatCAD: An MLLM-Guided Framework for Zero-shot CAD Drawing Restoration
Article
FedPHE: A Secure and Efficient Federated Learning via Packed Homomorphic Encryption
Article
Guest Editorial Special Issue on Federated Learning for Big Data Applications
Article
Parameter estimation of structural dynamics with neural operators enabled surrogate modeling
Article
Pheromone: Restructuring Serverless Computing With Data-Centric Function Orchestration
Article
SP-Chain: Boosting Intrashard and Cross-Shard Security and Performance in Blockchain Sharding
Article
Structural Clustering for Bipartite Graphs
Article
Top Ten Challenges Towards Agentic Neural Graph Databases
Article
BRIDGING AND MODELING CORRELATIONS IN PAIRWISE DATA FOR DIRECT PREFERENCE OPTIMIZATION
Conference paper
Effective and General Distance Computation for Approximate Nearest Neighbor Search
Conference paper
GPU-Disaggregated Serving for Deep Learning Recommendation Models at Scale
Conference paper
Conference paper
INFLUENCE-GUIDED DIFFUSION FOR DATASET DISTILLATION
Conference paper
KATZ: Efficient Workflow Serving for Diffusion Models with Many Adapters
Conference paper
Locally Balancing Signed Graphs
Conference paper
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing
Conference paper
Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity
Conference paper
Low-Contrast Medical Image Segmentation via Transformer and Boundary Perception
Article
MorphDAG: A Workload-Aware Elastic DAG-Based Blockchain
Article
Self-Supervised Feature Learning for Appliance Recognition in Nonintrusive Load Monitoring
Article
Article
Towards Efficient and Deposit-Free Blockchain-Based Spatial Crowdsourcing
Article
Dordis: Efficient Federated Learning with Dropout-Resilient Differential Privacy
Conference paper
DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores
Conference paper
SAFE: Intelligent Online Scheduling for Collaborative DNN Inference in Vehicular Network
Conference paper
A deep neural network for general scattering matrix
Article
Accelerating Distributed Learning in Non-Dedicated Environments
Article
Deep Learning for Approximate Nearest Neighbour Search: A Survey and Future Directions
Article
DifFormer: Multi-Resolutional Differencing Transformer With Dynamic Ranging for Time Series Analysis
Article
Expanding Reverse Nearest Neighbors
Article
GIFT: Toward Accurate and Efficient Federated Learning With Gradient-Instructed Frequency Tuning
Article
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration
Article
LMSFC: A Novel Multidimensional Index Based on Learned Monotonic Space Filling Curves
Article
Article
Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning
Article
Towards Efficient Synchronous Federated Training: A Survey on System Optimization Strategies
Article
Unbiased quasi-hyperbolic nesterov-gradient momentum-based optimizers for accelerating convergence
Article
Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent
Conference paper
Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation
Conference paper
Cardinality Estimation of Subgraph Search Queries with Direction Learner
Conference paper
CoChain: High Concurrency Blockchain Sharding via Consensus on Consensus
Conference paper
CoSaR: Combating Label Noise Using Collaborative Sample Selection and Adversarial Regularization
Conference paper
DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
Conference paper
Distance Maximization and Defences on Deep Hashing Based Image Retrieval
Conference paper
Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks
Conference paper
Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing
Conference paper
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
Conference paper
Golgi: Performance-Aware, Resource-Efficient Function Scheduling for Serverless Computing
Conference paper
Graph Self-supervised Learning with Augmentation-aware Contrastive Learning
Conference paper
Lion: Adversarial Distillation of Proprietary Large Language Models
Conference paper
Conference paper
A home energy management system incorporating data-driven uncertainty-aware user preference
Article
A Single-to-Multi Network for Latency-Free Non-Intrusive Load Monitoring
Article
Data predictive control of nonlinear process feature dynamics through latent variable behaviours
Article
Editorial: Advances in Mobile, Edge and Cloud Computing
Article
Enabling Cost-Effective, SLO-Aware Machine Learning Inference Serving on Public Cloud
Article
GCP: Graph Encoder with Content-Planning for Sentence Generation from Knowledge Bases
Article
Incentivizing WiFi-Based Multilateration Location Verification
Article
Article
Towards Dependency-Aware Cache Management for Data Analytics Applications
Article
Article
Costly Price Adjustment and Automated Pricing: The Case of Airbnb: An Abstract
Book chapter
Jenga: Orchestrating Smart Contracts in Sharding-Based Blockchain for Efficient Processing
Conference paper
MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction
Conference paper
MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters
Conference paper
Neural Subgraph Counting with Wasserstein Estimator
Conference paper
Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud
Conference paper
Pisces: Efficient Federated Learning via Guided Asynchronous Training
Conference paper
POSTER: An LLVM-based Open-Source Compiler for NVIDIA GPUs
Conference paper
Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching
Conference paper
Conference paper
Workload Consolidation in Alibaba Clusters: The Good, the Bad, and the Ugly
Conference paper
A Quantitative Survey of Communication Optimizations in Distributed Deep Learning
Article
Guest Editorial: Interplay between Machine Learning and Networking Systems
Article
Toward Privacy-Preserving Task Assignment for Fully Distributed Spatial Crowdsourcing
Article
A smart adversarial attack on deep hashing based image retrieval
Conference paper
Citadel: Protecting data privacy and model confidentiality for collaborative learning
Conference paper
Communication-efficient federated learning with adaptive parameter freezing
Conference paper
CrystalPerf: Learning to characterize the performance of dataflow computation through code analysis
Conference paper
George: Learning to place long-lived containers in large clusters with operation constraints
Conference paper
Gillis: Serving large neural networks in serverless functions with automatic model partitioning
Conference paper
High-Dimensional Similarity Query Processing for Data Science
Conference paper
KS-GNN: Keywords Search over Incomplete Graphs via Graph Neural Network
Conference paper
Morphling: Fast, near-optimal auto-configuration for cloud-native model serving
Conference paper
Conference paper
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
Conference paper
Simplifying low-level GPU programming with GAS
Conference paper
Achieving Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition
Article
BatchCrypt: Efficient homomorphic encryption for cross-silo federated learning
Conference paper
Demystifying Tensor Cores to Optimize Half-Precision Matrix Multiply
Conference paper
Metis: Learning to schedule long-running applications in shared container clusters at scale
Conference paper
Not All Explorations Are Equal: Harnessing Heterogeneous Profiling Cost for Efficient MLaaS Training
Conference paper
Optimizing batched winograd convolution on GPUs
Conference paper
RepBun: Load-Balanced, Shuffle-Free Cluster Caching for Structured Data
Conference paper
Semi-dynamic load balancing: Efficient distributed learning in non-dedicated environments
Conference paper
Abnormal Client Behavior Detection in Federated Learning
Conference paper
Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud
Conference paper
CMFL: Mitigating communication overhead for federated learning
Conference paper
LACS: Load-aware cache sharing with isolation guarantee
Conference paper
Mark: Exploiting cloud services for cost-effective, slo-aware machine learning inference serving
Conference paper
Round-Robin Synchronization: Mitigating Communication Bottlenecks in Parameter Servers
Conference paper
Towards framework-independent, non-intrusive performance characterization for dataflow computation
Conference paper
Task Selection and Scheduling for Food Delivery: A Game-Theoretic Approach
Article
Unraveling the RTT-fairness Problem for BBR: A Queueing Model
Article
Continuum: A platform for cost-aware, low-latency continual learning
Conference paper
Fair coflow scheduling without prior knowledge
Conference paper
Fast distributed deep learning via worker-adaptive batch sizing
Conference paper
OpuS: Fair and efficient cache sharing for in-memory data analytics
Conference paper
Performance-Aware Fair Scheduling: Exploiting Demand Elasticity of Data Analytics Jobs
Conference paper
SP-cache: Load-balanced, redundancy-free cluster caching with selective partition
Conference paper
Stay fresh: Speculative synchronization for fast distributed machine learning
Conference paper
Utopia: Near-optimal Coflow Scheduling with Isolation Guarantee
Conference paper
LERC: Coordinated cache management for data-parallel systems
Article
Towards Online Checkpointing Mechanism for Cloud Transient Servers
Article
Cluster fair queueing: Speeding up data-parallel jobs with delay guarantees
Conference paper
Coflex: Navigating the fairness-efficiency tradeoff for coflow scheduling
Conference paper
LRC: Dependency-aware cache management for data analytics clusters
Conference paper
Speculative Slot Reservation: Enforcing Service Isolation for Dependent Data-Parallel Computations
Conference paper
Friends or foes: Revisiting strategy-proofness in cloud network sharing
Conference paper
Multi-resource Fair Sharing for Datacenter Jobs with Placement Constraints
Conference paper
Dynamic cloud instance acquisition via IaaS cloud brokerage
Article
Multi-Resource Fair Allocation in Heterogeneous Cloud Computing Systems
Article
Optimal Online Multi-Instance Acquisition in IaaS Clouds
Article
Designing truthful spectrum double auctions with local markets
Article
Local cooperation architecture for self-healing femtocell networks
Article
Dominant resource fairness in cloud computing systems with heterogeneous servers
Conference paper
Low complexity multi-resource fair queueing with bounded delay
Conference paper
On the fairness-Efficiency tradeoff for packet processing with multiple resources
Conference paper
Multi-resource generalized processor sharing for packet processing
Article
Revenue maximization with dynamic auctions in IaaS cloud markets
Article
Dynamic cloud resource reservation via cloud brokerage
Conference paper
Multi-resource round robin: A low complexity packet scheduler with dominant resource fairness
Conference paper
On fairness-efficiency tradeoffs for multi-resource packet processing
Conference paper
To Reserve or Not to Reserve: Optimal Online Multi-Instance Acquisition in IaaS Clouds
Conference paper
Towards optimal capacity segmentation with hybrid cloud pricing
Conference paper
District: Embracing local markets in truthful spectrum double auctions
Conference paper
ChatCAD: An MLLM-Guided Framework for Zero-shot CAD Drawing Restoration
FedPHE: A Secure and Efficient Federated Learning via Packed Homomorphic Encryption
Guest Editorial Special Issue on Federated Learning for Big Data Applications
Parameter estimation of structural dynamics with neural operators enabled surrogate modeling
Pheromone: Restructuring Serverless Computing With Data-Centric Function Orchestration
SP-Chain: Boosting Intrashard and Cross-Shard Security and Performance in Blockchain Sharding
BRIDGING AND MODELING CORRELATIONS IN PAIRWISE DATA FOR DIRECT PREFERENCE OPTIMIZATION
Effective and General Distance Computation for Approximate Nearest Neighbor Search
GPU-Disaggregated Serving for Deep Learning Recommendation Models at Scale
KATZ: Efficient Workflow Serving for Diffusion Models with Many Adapters
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing
Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity
Accelerating Distributed Learning in Non-Dedicated Environments
Deep Learning for Approximate Nearest Neighbour Search: A Survey and Future Directions
DifFormer: Multi-Resolutional Differencing Transformer With Dynamic Ranging for Time Series Analysis
GIFT: Toward Accurate and Efficient Federated Learning With Gradient-Instructed Frequency Tuning
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration
LMSFC: A Novel Multidimensional Index Based on Learned Monotonic Space Filling Curves
Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning
Towards Efficient Synchronous Federated Training: A Survey on System Optimization Strategies
Unbiased quasi-hyperbolic nesterov-gradient momentum-based optimizers for accelerating convergence
Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent
Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation
Cardinality Estimation of Subgraph Search Queries with Direction Learner
CoChain: High Concurrency Blockchain Sharding via Consensus on Consensus
CoSaR: Combating Label Noise Using Collaborative Sample Selection and Adversarial Regularization
DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
Distance Maximization and Defences on Deep Hashing Based Image Retrieval
Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks
Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
Golgi: Performance-Aware, Resource-Efficient Function Scheduling for Serverless Computing
Graph Self-supervised Learning with Augmentation-aware Contrastive Learning
Lion: Adversarial Distillation of Proprietary Large Language Models
A home energy management system incorporating data-driven uncertainty-aware user preference
A Single-to-Multi Network for Latency-Free Non-Intrusive Load Monitoring
Data predictive control of nonlinear process feature dynamics through latent variable behaviours
Enabling Cost-Effective, SLO-Aware Machine Learning Inference Serving on Public Cloud
GCP: Graph Encoder with Content-Planning for Sentence Generation from Knowledge Bases
Incentivizing WiFi-Based Multilateration Location Verification
Towards Dependency-Aware Cache Management for Data Analytics Applications
Jenga: Orchestrating Smart Contracts in Sharding-Based Blockchain for Efficient Processing
MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction
MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters
Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud
Pisces: Efficient Federated Learning via Guided Asynchronous Training
Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching
Workload Consolidation in Alibaba Clusters: The Good, the Bad, and the Ugly
A Quantitative Survey of Communication Optimizations in Distributed Deep Learning
Guest Editorial: Interplay between Machine Learning and Networking Systems
Toward Privacy-Preserving Task Assignment for Fully Distributed Spatial Crowdsourcing
A smart adversarial attack on deep hashing based image retrieval
Citadel: Protecting data privacy and model confidentiality for collaborative learning
Communication-efficient federated learning with adaptive parameter freezing
CrystalPerf: Learning to characterize the performance of dataflow computation through code analysis
George: Learning to place long-lived containers in large clusters with operation constraints
Gillis: Serving large neural networks in serverless functions with automatic model partitioning
High-Dimensional Similarity Query Processing for Data Science
KS-GNN: Keywords Search over Incomplete Graphs via Graph Neural Network
Morphling: Fast, near-optimal auto-configuration for cloud-native model serving
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
BatchCrypt: Efficient homomorphic encryption for cross-silo federated learning
Demystifying Tensor Cores to Optimize Half-Precision Matrix Multiply
Metis: Learning to schedule long-running applications in shared container clusters at scale
Not All Explorations Are Equal: Harnessing Heterogeneous Profiling Cost for Efficient MLaaS Training
RepBun: Load-Balanced, Shuffle-Free Cluster Caching for Structured Data
Semi-dynamic load balancing: Efficient distributed learning in non-dedicated environments
Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud
CMFL: Mitigating communication overhead for federated learning
Mark: Exploiting cloud services for cost-effective, slo-aware machine learning inference serving
Round-Robin Synchronization: Mitigating Communication Bottlenecks in Parameter Servers
Towards framework-independent, non-intrusive performance characterization for dataflow computation
Task Selection and Scheduling for Food Delivery: A Game-Theoretic Approach
Unraveling the RTT-fairness Problem for BBR: A Queueing Model
Continuum: A platform for cost-aware, low-latency continual learning
Fast distributed deep learning via worker-adaptive batch sizing
OpuS: Fair and efficient cache sharing for in-memory data analytics
Performance-Aware Fair Scheduling: Exploiting Demand Elasticity of Data Analytics Jobs
SP-cache: Load-balanced, redundancy-free cluster caching with selective partition
Stay fresh: Speculative synchronization for fast distributed machine learning
Utopia: Near-optimal Coflow Scheduling with Isolation Guarantee
LERC: Coordinated cache management for data-parallel systems
Towards Online Checkpointing Mechanism for Cloud Transient Servers
Cluster fair queueing: Speeding up data-parallel jobs with delay guarantees
Coflex: Navigating the fairness-efficiency tradeoff for coflow scheduling
LRC: Dependency-aware cache management for data analytics clusters
Speculative Slot Reservation: Enforcing Service Isolation for Dependent Data-Parallel Computations
Multi-resource generalized processor sharing for packet processing
Revenue maximization with dynamic auctions in IaaS cloud markets
Towards optimal capacity segmentation with hybrid cloud pricing
District: Embracing local markets in truthful spectrum double auctions
Achieving Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition
Article
BatchCrypt: Efficient homomorphic encryption for cross-silo federated learning
Conference paper
Demystifying Tensor Cores to Optimize Half-Precision Matrix Multiply
Conference paper
Metis: Learning to schedule long-running applications in shared container clusters at scale
Conference paper
Not All Explorations Are Equal: Harnessing Heterogeneous Profiling Cost for Efficient MLaaS Training
Conference paper
Optimizing batched winograd convolution on GPUs
Conference paper
RepBun: Load-Balanced, Shuffle-Free Cluster Caching for Structured Data
Conference paper
Semi-dynamic load balancing: Efficient distributed learning in non-dedicated environments
Conference paper
Abnormal Client Behavior Detection in Federated Learning
Conference paper
Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud
Conference paper
CMFL: Mitigating communication overhead for federated learning
Conference paper
LACS: Load-aware cache sharing with isolation guarantee
Conference paper
Mark: Exploiting cloud services for cost-effective, slo-aware machine learning inference serving
Conference paper
Round-Robin Synchronization: Mitigating Communication Bottlenecks in Parameter Servers
Conference paper
Towards framework-independent, non-intrusive performance characterization for dataflow computation
Conference paper
Task Selection and Scheduling for Food Delivery: A Game-Theoretic Approach
Article
Unraveling the RTT-fairness Problem for BBR: A Queueing Model
Article
Continuum: A platform for cost-aware, low-latency continual learning
Conference paper
Fair coflow scheduling without prior knowledge
Conference paper
Fast distributed deep learning via worker-adaptive batch sizing
Conference paper
OpuS: Fair and efficient cache sharing for in-memory data analytics
Conference paper
Performance-Aware Fair Scheduling: Exploiting Demand Elasticity of Data Analytics Jobs
Conference paper
SP-cache: Load-balanced, redundancy-free cluster caching with selective partition
Conference paper
Stay fresh: Speculative synchronization for fast distributed machine learning
Conference paper
Utopia: Near-optimal Coflow Scheduling with Isolation Guarantee
Conference paper
LERC: Coordinated cache management for data-parallel systems
Article
Towards Online Checkpointing Mechanism for Cloud Transient Servers
Article
Cluster fair queueing: Speeding up data-parallel jobs with delay guarantees
Conference paper
Coflex: Navigating the fairness-efficiency tradeoff for coflow scheduling
Conference paper
LRC: Dependency-aware cache management for data analytics clusters
Conference paper
Speculative Slot Reservation: Enforcing Service Isolation for Dependent Data-Parallel Computations
Conference paper
Friends or foes: Revisiting strategy-proofness in cloud network sharing
Conference paper
Multi-resource Fair Sharing for Datacenter Jobs with Placement Constraints
Conference paper
Dynamic cloud instance acquisition via IaaS cloud brokerage
Article
Multi-Resource Fair Allocation in Heterogeneous Cloud Computing Systems
Article
Optimal Online Multi-Instance Acquisition in IaaS Clouds
Article
Designing truthful spectrum double auctions with local markets
Article
Local cooperation architecture for self-healing femtocell networks
Article
Dominant resource fairness in cloud computing systems with heterogeneous servers
Conference paper
Low complexity multi-resource fair queueing with bounded delay
Conference paper
On the fairness-Efficiency tradeoff for packet processing with multiple resources
Conference paper
Multi-resource generalized processor sharing for packet processing
Article
Revenue maximization with dynamic auctions in IaaS cloud markets
Article
Dynamic cloud resource reservation via cloud brokerage
Conference paper
Multi-resource round robin: A low complexity packet scheduler with dominant resource fairness
Conference paper
On fairness-efficiency tradeoffs for multi-resource packet processing
Conference paper
To Reserve or Not to Reserve: Optimal Online Multi-Instance Acquisition in IaaS Clouds
Conference paper
Towards optimal capacity segmentation with hybrid cloud pricing
Conference paper
District: Embracing local markets in truthful spectrum double auctions
Conference paper
| COMP3511 | Operating Systems |
| COMP4971A | Independent Work |
| COMP4651 | Cloud Computing and Big Data Systems |
| COMP4981H | Final Year Thesis |
| CPEG4902 | Computer Engineering Final Year Thesis in COMP |
| CSIT5970 | Advanced Cloud Computing |
| COMP4971A | Independent Work |
| No Teaching Assignments |
| No Teaching Assignments |
| No Teaching Assignments |
FENG, Tianyu
Computer Science and Engineering
LU, Hanfeng
Computer Science and Engineering
CAO, Lunxi
Computer Science and Engineering
CHANG, Chaokun
Computer Science and Engineering
JIANG, Xiaoxiao
Computer Science and Engineering
WU, Tianyuan
Computer Science and Engineering
YAO, Sheng
Computer Science and Engineering
ZHOU, Yukun
Computer Science and Engineering
YANG, Yuchen
Computer Science and Engineering
CHEN, Dong
Computer Science and Engineering
YE, Peng
(co-supervision)
Computer Science and Engineering
YU, Haoxuan
Computer Science and Engineering
ZHAO, Yuheng
Computer Science and Engineering
LI, Suyi
Computer Science and Engineering
AN, Dakai
Computer Science and Engineering
AN, Jiayu
(co-supervision)
Computer Science and Engineering
SUN, Heyang
(co-supervision)
Computer Science and Engineering
YANG, Lingyun
Computer Science and Engineering( Completed in 2025 )
JIANG, Zhifeng
Computer Science and Engineering( Completed in 2024 )
WANG, Zhili
(co-supervision)
Individualized Interdisciplinary Program (Data Science and Analytics)( Completed in 2024 )
YU, Minchen
Computer Science and Engineering( Completed in 2023 )
LI, Mingzhe
Computer Science and Engineering( Completed in 2022 )
TIAN, Huangshi
Computer Science and Engineering( Completed in 2022 )
WENG, Qizhen
Computer Science and Engineering( Completed in 2022 )
YAN, Da
Computer Science and Engineering( Completed in 2022 )
ZHENG, Yunchuan
Computer Science and Engineering( Completed in 2023 )
Update your browser to view this website correctly. Update your browser now