Faculty Profiles - RAO Anyi | The Hong Kong University of Science and Technology

Assistant Professor
Division of Arts and Machine Creativity
Department of Computer Science and Engineering
Division of Emerging Interdisciplinary Areas

Associate Director of Media Intelligence Research Center

Research Interest

AI for creativity

Computer vision

Human-computer interaction

Computer graphics

Film studies

Publications

2026 2

Simulating the Real World: A Unified Survey of Multimodal Generative Models

IEEE Transactions on Pattern Analysis and Machine Intelligence, p. 1-20, article number 11509284
Hu, Yuqi; Wang, Longguang; Liu, Xian; Chen, Ling Hao; Guo, Yuwei; Shi, Yukai; Liu, Ce; Rao, Anyi; Wang, Zeyu; Xiong, Hui
Article

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper presented at IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026), Colorado , United States
Kong, Xianghao; Zhang, Zeyu; Guo, Yuwei; Zhao, Zhuoran; Zhang, Songchun; Rao, Anyi
Conference paper

2025 8

AI for Creative Visual Content Generation, Editing and Understanding

Proceedings - SIGGRAPH 2025 Frontiers / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2025, p. 1-2article number 17
Patashnik, Or; Parmar, Gaurav; Rao, Anyi; Kara, Ozgur; Caba Heilbron, Fabian; Cohen-Or, Daniel; Matthew Rehg, James; Zhu, Jun Yan
Conference paper

CineVision: An Interactive Pre-visualization Storyboard System for Director-Cinematographer Collaboration

UIST 2025 - Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology / edited by Bianchi Andrea; Glassman Elena L.; Mackay Wendy E.; Zhao Shengdong; Oakley Ian; Kim Jeeeun. New York: Association for Computing Machinery, Inc, 2025, p. 1-18article number 18
WEI, Zheng; WU, Hongtao; ZHANG, Lvmin; XU, Xian; ZHENG, Yanfeng; HUI, Pan; AGRAWALA, Maneesh; QU, Huamin; RAO, Anyi
Conference paper

Generative AI for Film Creation: A Survey of Recent Advances

Proceedings - 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025, IEEE Computer Society, 2025, p. 6257-6269article number 11147450
Zhang, Ruihan; Yu, Borou; Min, Jiajian; Xin, Yetong; Wei, Zheng; Shi, Juncheng Nemo; Huang, Mingzhen; Kong, Xianghao; Xin, Nix Liu; Jiang, Shanshan; Bahuguna, Praagya; Chan, Mark; Hora, Khushi; Yang, Lijian; Liang, Yongqi; Bian, Runhe; Liu, Yunlei; Valencia, Isabela Campillo; Tredinick, Patricia Morales; Kozlov, Ilia; Jiang, Sijia; Huang, Peiwen; Chen, Na; Liu, Xuanxuan; Rao, Anyi
Conference paper

Generative Models for Visual Content Editing and Creation

Proceedings - SIGGRAPH Asia 2025 Courses, SA Courses 2025 / edited by Spencer Stephen N.; Komura Taku; Skouras Melina. Association for Computing Machinery, Inc, 2025, p. 1-3article number 5
Wei, Zheng; Xu, Xian; Liu, Yuqing; Han, Grace; Rao, Anyi
Conference paper

Keyframe-Guided Creative Video Inpainting

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 13009-13020, article number 11093671
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Meng, Chenlin; Bar-Tal, Omer; Ding, Shuangrui; Agrawala, Maneesh; Lin, Dahua; Dai, Bo
Conference paper

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper presented at International Conference on Computer Vision (ICCV 2025), Honolulu, United States
Zhou, Yujie; Bu, Jiazi; Lin, Pengyang; Zhang, Pan; Wu, Tong; Huang, Qidong; Li, Jinsong; Dong, Xiaoyi; Zang, Yuhang; Cao, Yuhang; RAO, Anyi; Wang, Jiaqi; Niu, Li
Conference paper

Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport

13th International Conference on Learning Representations, ICLR 2025, International Conference on Learning Representations, ICLR, 2025, p. 84422-84439
Zhang, Lvmin; Rao, Anyi; Agrawala, Maneesh
Conference paper

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

Computer Vision – ECCV 2024 - 18th European Conference, Proceedings / edited by Leonardis Aleš; Ricci Elisa; Roth Stefan; Russakovsky Olga; Sattler Torsten; Varol Gül. Springer Science and Business Media Deutschland GmbH, 2025, p. 330-348
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Agrawala, Maneesh; Lin, Dahua; Dai, Bo
Conference paper

2024 5

ANIMATEDIFF: ANIMATE YOUR PERSONALIZED TEXT-TO-IMAGE DIFFUSION MODELS WITHOUT SPECIFIC TUNING

Paper presented at 12th International Conference on Learning Representations, ICLR 2024, Hybrid, Vienna, Austria
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Liang, Zhengyang; Wang, Yaohui; Qiao, Yu; Agrawala, Maneesh; Lin, Dahua; Dai, Bo
Conference paper

Cinematic Behavior Transfer via NeRF-based Differentiable Filming

Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, IEEE Computer Society, 2024, p. 6723-6732
Jiang, Xuekun; Rao, Anyi; Wang, Jingbo; Lin, Dahua; Dai, Bo
Conference paper

Cinematic Behavior Transfer via NeRF-based Differential Filming

Paper presented at The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Dai, Bo; Jiang, Xuekun; Lin, Dahua; Rao, Anyi; Wang, Jingbo
Conference paper

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Proceedings of Machine Learning Research, v. 235, p. 44960-44990
Shi, Dachuan; Tao, Chaofan; Rao, Anyi; Yang, Zhendong; Yuan, Chun; Wang, Jiaqi
Conference paper

ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database

UIST 2024 - Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Association for Computing Machinery, Inc, 2024, article number 21
Rao, Anyi; Chou, Jean Peïc; Agrawala, Maneesh
Conference paper

2023 7

A Coarse-to-Fine Framework for Automatic Video Unscreen

IEEE Transactions on Multimedia, v. 25, p. 2723-2733
Rao, Anyi; Xu, Linning; Li, Zhizhong; Huang, Qingqiu; Kuang, Zhanghui; Zhang, Wayne; Lin, Dahua
Article

Adding Conditional Control to Text-to-Image Diffusion Models

Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023, Institute of Electrical and Electronics Engineers Inc., 2023, p. 3813-3824
Zhang, Lvmin; Rao, Anyi; Agrawala, Maneesh
Conference paper

Automated Conversion of Music Videos into Lyric Videos

UIST 2023 - Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, Association for Computing Machinery, Inc, 2023, article number 13
Ma, Jiaju; Rao, Anyi; Wei, Li Yi; Kazi, Rubaiat Habib; Shin, Hijung Valentina; Agrawala, Maneesh
Conference paper

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

Proceedings - SIGGRAPH 2023 Posters / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2023, article number 5
Rao, Anyi; Jiang, Xuekun; Guo, Yuwei; Xu, Linning; Yang, Lei; Jin, Libiao; Lin, Dahua; Dai, Bo
Conference paper

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE

Proceedings of the 32nd International Joint Conference on Artificial Intelligence, IJCAI 2023 / edited by Elkind Edith. International Joint Conferences on Artificial Intelligence, 2023, p. 4903-4911
Wei, Zikai; Rao, Anyi; Dai, Bo; Lin, Dahua
Conference paper

Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

AAAI-23 Technical Tracks 3 / edited by Williams Brian; Chen Yiling; Neville Jennifer. AAAI Press, 2023, p. 3825-3833
Zhou, Yujie; Duan, Haodong; Rao, Anyi; Su, Bing; Wang, Jiaqi
Conference paper

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, Association for Computing Machinery, Inc, 2023, p. 5302-5310
Zhou, Yujie; Qiang, Wenwen; Rao, Anyi; Lin, Ning; Su, Bing; Wang, Jiaqi
Conference paper

2022 5

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos

IEEE Transactions on Multimedia, v. 24, p. 3049-3059
Jiang, Xuekun; Jin, Libiao; Rao, Anyi; Xu, Linning; Lin, Dahua
Article

AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation

Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, IEEE Computer Society, 2022, p. 11614-11624
Liu, Xueyi; Xu, Xiaomeng; Rao, Anyi; Gan, Chuang; Yi, Li
Conference paper

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

Computer Vision – ECCV 2022 - 17th European Conference, Proceedings / edited by Avidan Shai; Brostow Gabriel; Cissé Moustapha; Farinella Giovanni Maria; Hassner Tal. Springer Science and Business Media Deutschland GmbH, 2022, p. 106-122
Xiangli, Yuanbo; Xu, Linning; Pan, Xingang; Zhao, Nanxuan; Rao, Anyi; Theobalt, Christian; Dai, Bo; Lin, Dahua
Conference paper

Shoot360: Normal View Video Creation from City Panorama Footage

Proceedings - SIGGRAPH 2022 Conference Papers / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2022, article number 13
Rao, Anyi; Xu, Linning; Lin, Dahua
Conference paper

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

Paper presented at 2022 European Conference on Computer Vision (ECCV 2022)
Dai, Bo; Guo, Yuwei; Jiang, Xuekun; Jin, Libiao; Lin, Dahua; Rao, Anyi; Wang, Sichen; Wu, Xiaoyu
Conference paper

2021 1

BlockPlanner: City Block Generation with Vectorized Graph Representation

Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Institute of Electrical and Electronics Engineers Inc., 2021, p. 5057-5066
Xu, Linning; Xiangli, Yuanbo; Rao, Anyi; Zhao, Nanxuan; Dai, Bo; Liu, Ziwei; Lin, Dahua
Conference paper

2020 4

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 10143-10152, article number 9157529
Rao, Anyi; Xu, Linning; Xiong, Yu; Xu, Guodong; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua
Conference paper

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 17-34
Rao, Anyi; Wang, Jiaze; Xu, Linning; Jiang, Xuekun; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua
Conference paper

MovieNet: A Holistic Dataset for Movie Understanding

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 709-727
Huang, Qingqiu; Xiong, Yu; Rao, Anyi; Wang, Jiaze; Lin, Dahua
Conference paper

Online Multi-modal Person Search in Videos

Computer Vision – ECCV 2020 - 16th European Conference, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 174-190
Xia, Jiangyue; Rao, Anyi; Huang, Qingqiu; Xu, Linning; Wen, Jiangtao; Lin, Dahua
Conference paper

2018 1

HotFlip: White-Box Adversarial Examples for NLP

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, v. 2, July 2018, p. 31-36
Ebrahimi, Javid; Rao, Anyi; Lowd, Daniel; Dou, Dejing
Conference paper

Article 1

Simulating the Real World: A Unified Survey of Multimodal Generative Models

IEEE Transactions on Pattern Analysis and Machine Intelligence, p. 1-20, article number 11509284
Hu, Yuqi; Wang, Longguang; Liu, Xian; Chen, Ling Hao; Guo, Yuwei; Shi, Yukai; Liu, Ce; Rao, Anyi; Wang, Zeyu; Xiong, Hui

Conference paper 1

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper presented at IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026), Colorado , United States
Kong, Xianghao; Zhang, Zeyu; Guo, Yuwei; Zhao, Zhuoran; Zhang, Songchun; Rao, Anyi

Conference paper 8

AI for Creative Visual Content Generation, Editing and Understanding

Proceedings - SIGGRAPH 2025 Frontiers / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2025, p. 1-2article number 17
Patashnik, Or; Parmar, Gaurav; Rao, Anyi; Kara, Ozgur; Caba Heilbron, Fabian; Cohen-Or, Daniel; Matthew Rehg, James; Zhu, Jun Yan

CineVision: An Interactive Pre-visualization Storyboard System for Director-Cinematographer Collaboration

UIST 2025 - Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology / edited by Bianchi Andrea; Glassman Elena L.; Mackay Wendy E.; Zhao Shengdong; Oakley Ian; Kim Jeeeun. New York: Association for Computing Machinery, Inc, 2025, p. 1-18article number 18
WEI, Zheng; WU, Hongtao; ZHANG, Lvmin; XU, Xian; ZHENG, Yanfeng; HUI, Pan; AGRAWALA, Maneesh; QU, Huamin; RAO, Anyi

Generative AI for Film Creation: A Survey of Recent Advances

Proceedings - 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025, IEEE Computer Society, 2025, p. 6257-6269article number 11147450
Zhang, Ruihan; Yu, Borou; Min, Jiajian; Xin, Yetong; Wei, Zheng; Shi, Juncheng Nemo; Huang, Mingzhen; Kong, Xianghao; Xin, Nix Liu; Jiang, Shanshan; Bahuguna, Praagya; Chan, Mark; Hora, Khushi; Yang, Lijian; Liang, Yongqi; Bian, Runhe; Liu, Yunlei; Valencia, Isabela Campillo; Tredinick, Patricia Morales; Kozlov, Ilia; Jiang, Sijia; Huang, Peiwen; Chen, Na; Liu, Xuanxuan; Rao, Anyi

Generative Models for Visual Content Editing and Creation

Proceedings - SIGGRAPH Asia 2025 Courses, SA Courses 2025 / edited by Spencer Stephen N.; Komura Taku; Skouras Melina. Association for Computing Machinery, Inc, 2025, p. 1-3article number 5
Wei, Zheng; Xu, Xian; Liu, Yuqing; Han, Grace; Rao, Anyi

Keyframe-Guided Creative Video Inpainting

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 13009-13020, article number 11093671
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Meng, Chenlin; Bar-Tal, Omer; Ding, Shuangrui; Agrawala, Maneesh; Lin, Dahua; Dai, Bo

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper presented at International Conference on Computer Vision (ICCV 2025), Honolulu, United States
Zhou, Yujie; Bu, Jiazi; Lin, Pengyang; Zhang, Pan; Wu, Tong; Huang, Qidong; Li, Jinsong; Dong, Xiaoyi; Zang, Yuhang; Cao, Yuhang; RAO, Anyi; Wang, Jiaqi; Niu, Li

Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport

13th International Conference on Learning Representations, ICLR 2025, International Conference on Learning Representations, ICLR, 2025, p. 84422-84439
Zhang, Lvmin; Rao, Anyi; Agrawala, Maneesh

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

Computer Vision – ECCV 2024 - 18th European Conference, Proceedings / edited by Leonardis Aleš; Ricci Elisa; Roth Stefan; Russakovsky Olga; Sattler Torsten; Varol Gül. Springer Science and Business Media Deutschland GmbH, 2025, p. 330-348
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Agrawala, Maneesh; Lin, Dahua; Dai, Bo

Conference paper 5

ANIMATEDIFF: ANIMATE YOUR PERSONALIZED TEXT-TO-IMAGE DIFFUSION MODELS WITHOUT SPECIFIC TUNING

Paper presented at 12th International Conference on Learning Representations, ICLR 2024, Hybrid, Vienna, Austria
Guo, Yuwei; Yang, Ceyuan; Rao, Anyi; Liang, Zhengyang; Wang, Yaohui; Qiao, Yu; Agrawala, Maneesh; Lin, Dahua; Dai, Bo

Cinematic Behavior Transfer via NeRF-based Differentiable Filming

Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, IEEE Computer Society, 2024, p. 6723-6732
Jiang, Xuekun; Rao, Anyi; Wang, Jingbo; Lin, Dahua; Dai, Bo

Cinematic Behavior Transfer via NeRF-based Differential Filming

Paper presented at The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Dai, Bo; Jiang, Xuekun; Lin, Dahua; Rao, Anyi; Wang, Jingbo

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Proceedings of Machine Learning Research, v. 235, p. 44960-44990
Shi, Dachuan; Tao, Chaofan; Rao, Anyi; Yang, Zhendong; Yuan, Chun; Wang, Jiaqi

ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database

UIST 2024 - Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Association for Computing Machinery, Inc, 2024, article number 21
Rao, Anyi; Chou, Jean Peïc; Agrawala, Maneesh

Article 1

A Coarse-to-Fine Framework for Automatic Video Unscreen

IEEE Transactions on Multimedia, v. 25, p. 2723-2733
Rao, Anyi; Xu, Linning; Li, Zhizhong; Huang, Qingqiu; Kuang, Zhanghui; Zhang, Wayne; Lin, Dahua

Conference paper 6

Adding Conditional Control to Text-to-Image Diffusion Models

Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023, Institute of Electrical and Electronics Engineers Inc., 2023, p. 3813-3824
Zhang, Lvmin; Rao, Anyi; Agrawala, Maneesh

Automated Conversion of Music Videos into Lyric Videos

UIST 2023 - Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, Association for Computing Machinery, Inc, 2023, article number 13
Ma, Jiaju; Rao, Anyi; Wei, Li Yi; Kazi, Rubaiat Habib; Shin, Hijung Valentina; Agrawala, Maneesh

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

Proceedings - SIGGRAPH 2023 Posters / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2023, article number 5
Rao, Anyi; Jiang, Xuekun; Guo, Yuwei; Xu, Linning; Yang, Lei; Jin, Libiao; Lin, Dahua; Dai, Bo

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE

Proceedings of the 32nd International Joint Conference on Artificial Intelligence, IJCAI 2023 / edited by Elkind Edith. International Joint Conferences on Artificial Intelligence, 2023, p. 4903-4911
Wei, Zikai; Rao, Anyi; Dai, Bo; Lin, Dahua

Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

AAAI-23 Technical Tracks 3 / edited by Williams Brian; Chen Yiling; Neville Jennifer. AAAI Press, 2023, p. 3825-3833
Zhou, Yujie; Duan, Haodong; Rao, Anyi; Su, Bing; Wang, Jiaqi

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, Association for Computing Machinery, Inc, 2023, p. 5302-5310
Zhou, Yujie; Qiang, Wenwen; Rao, Anyi; Lin, Ning; Su, Bing; Wang, Jiaqi

Article 1

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos

IEEE Transactions on Multimedia, v. 24, p. 3049-3059
Jiang, Xuekun; Jin, Libiao; Rao, Anyi; Xu, Linning; Lin, Dahua

Conference paper 4

AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation

Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, IEEE Computer Society, 2022, p. 11614-11624
Liu, Xueyi; Xu, Xiaomeng; Rao, Anyi; Gan, Chuang; Yi, Li

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

Computer Vision – ECCV 2022 - 17th European Conference, Proceedings / edited by Avidan Shai; Brostow Gabriel; Cissé Moustapha; Farinella Giovanni Maria; Hassner Tal. Springer Science and Business Media Deutschland GmbH, 2022, p. 106-122
Xiangli, Yuanbo; Xu, Linning; Pan, Xingang; Zhao, Nanxuan; Rao, Anyi; Theobalt, Christian; Dai, Bo; Lin, Dahua

Shoot360: Normal View Video Creation from City Panorama Footage

Proceedings - SIGGRAPH 2022 Conference Papers / edited by Spencer Stephen N.. Association for Computing Machinery, Inc, 2022, article number 13
Rao, Anyi; Xu, Linning; Lin, Dahua

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

Paper presented at 2022 European Conference on Computer Vision (ECCV 2022)
Dai, Bo; Guo, Yuwei; Jiang, Xuekun; Jin, Libiao; Lin, Dahua; Rao, Anyi; Wang, Sichen; Wu, Xiaoyu

Conference paper 1

BlockPlanner: City Block Generation with Vectorized Graph Representation

Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Institute of Electrical and Electronics Engineers Inc., 2021, p. 5057-5066
Xu, Linning; Xiangli, Yuanbo; Rao, Anyi; Zhao, Nanxuan; Dai, Bo; Liu, Ziwei; Lin, Dahua

Conference paper 4

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 10143-10152, article number 9157529
Rao, Anyi; Xu, Linning; Xiong, Yu; Xu, Guodong; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 17-34
Rao, Anyi; Wang, Jiaze; Xu, Linning; Jiang, Xuekun; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua

MovieNet: A Holistic Dataset for Movie Understanding

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 709-727
Huang, Qingqiu; Xiong, Yu; Rao, Anyi; Wang, Jiaze; Lin, Dahua

Online Multi-modal Person Search in Videos

Computer Vision – ECCV 2020 - 16th European Conference, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 174-190
Xia, Jiangyue; Rao, Anyi; Huang, Qingqiu; Xu, Linning; Wen, Jiangtao; Lin, Dahua

Conference paper 1

HotFlip: White-Box Adversarial Examples for NLP

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, v. 2, July 2018, p. 31-36
Ebrahimi, Javid; Rao, Anyi; Lowd, Daniel; Dou, Dejing

2020 4

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 10143-10152, article number 9157529
Rao, Anyi; Xu, Linning; Xiong, Yu; Xu, Guodong; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua
Conference paper

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 17-34
Rao, Anyi; Wang, Jiaze; Xu, Linning; Jiang, Xuekun; Huang, Qingqiu; Zhou, Bolei; Lin, Dahua
Conference paper

MovieNet: A Holistic Dataset for Movie Understanding

Computer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 709-727
Huang, Qingqiu; Xiong, Yu; Rao, Anyi; Wang, Jiaze; Lin, Dahua
Conference paper

Online Multi-modal Person Search in Videos

Computer Vision – ECCV 2020 - 16th European Conference, Proceedings / edited by Vedaldi Andrea; Bischof Horst; Brox Thomas; Frahm Jan-Michael. Springer Science and Business Media Deutschland GmbH, 2020, p. 174-190
Xia, Jiangyue; Rao, Anyi; Huang, Qingqiu; Xu, Linning; Wen, Jiangtao; Lin, Dahua
Conference paper

2018 1

HotFlip: White-Box Adversarial Examples for NLP

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, v. 2, July 2018, p. 31-36
Ebrahimi, Javid; Rao, Anyi; Lowd, Daniel; Dou, Dejing
Conference paper

Teaching Assignment

AMCC5150	Visual Computing for Visual Arts and Creativity
AMCC5250	Filmmaking with AI Innovations
AMCC6950A	Special Projects in Arts and Machine Creativity

AMCC5000	Creative Convergence: Foundations of Arts and Machine Creativity
AMCC5140	AI for Visual Arts and Creativity
AMCC6950A	Special Projects in Arts and Machine Creativity
EMIA6950F	Independent Study

EMIA6500K

Visual Computing for Visual Content Creation

No Teaching Assignments

Research Interest

Publications

2026 2

2025 8

2024 5

2023 7

2022 5

2021 1

2020 4

2018 1

Article 1

Conference paper 1

Conference paper 8

Conference paper 5

Article 1

Conference paper 6

Article 1

Conference paper 4

Conference paper 1

Conference paper 4

Conference paper 1

2020 4

2018 1

Teaching Assignment

Research Postgraduate (RPG) Supervision

From January 2023 to December 2026 (As of 29 July 2026)

Current RPGs

Projects

From January 2024 to December 2026

Your browser is out of date!