DEng in Pattern Recognition and Intelligent Systems
University of Chinese Academy of Sciences, 2015
AttnZero: Efficient Attention Discovery for Vision Transformers
Conference paper
Auto-GAS: Automated Proxy Discovery for Training-Free Generative Architecture Search
Conference paper
Can LLMs" Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Conference paper
CHATEVAL: Towards better LLM-based evaluators through multi-agent debate
Conference paper
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Conference paper
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Conference paper
DetKDS: Knowledge Distillation Search for Object Detectors
Conference paper
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
Conference paper
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Conference paper
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection
Conference paper
RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation
Conference paper
VIDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Conference paper
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Conference paper
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech
Article
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Conference paper
Conference paper
Generated Therapeutic Music Based on the ISO Principle
Conference paper
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Conference paper
Marble: Music audio representation benchmark for universal evaluation
Conference paper
MoMusic: A Motion-Driven Human-AI Collaborative Music Composition and Performing System
Conference paper
Conference paper
Causal System Identification based Compensation for Reverberation-Robust DOA Estimation
Conference paper
Neural Kalman filtering for speech enhancement
Conference paper
Speech Enhancement Based on Modulation-Domain Parametric Multichannel Kalman Filtering
Article
Conference paper
Sound event localization and detection based on multiple DOA beamforming and multi-task learning
Conference paper
The JD AI speaker verification system for the FFSVC 2020 challenge
Conference paper
Noise covariance matrix estimation for rotating microphone arrays
Article
Direct-path signal cross-correlation estimation for sound source localization in reverberation
Conference paper
Estimation of the Noise Covariance Matrix for Rotating Sensor Arrays
Conference paper
Modulation-Domain Multichannel Kalman Filtering for Speech Enhancement
Article
Binaural mask-informed speech enhancement for hearing AIDS with head tracking
Conference paper
Modulation-domain parametric multichannel Kalman filtering for speech enhancement
Conference paper
Multichannel Kalman Filtering for Speech Ehnancement
Conference paper
Conference paper
Multilingual I-vector based statistical modeling for music genre classification
Conference paper
Conference paper
Conference paper
Conference paper
Semi-supervised learning of bottleneck feature for music genre classification
Conference paper
Under-modelled blind system identification for time delay estimation in reverberant environments
Conference paper
Article
Joint optimization of recurrent networks exploiting source auto-regression for source separation
Conference paper
Two-stage multi-target joint learning for monaural speech separation
Conference paper
Article
Conference paper
Weighted spatial bispectrum correlation matrix for DOA estimation in the presence of interferences
Conference paper
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio
Article
Conference paper
Direction of arrival estimation based on subband weighting for noisy conditions
Conference paper
AttnZero: Efficient Attention Discovery for Vision Transformers
Auto-GAS: Automated Proxy Discovery for Training-Free Generative Architecture Search
Can LLMs" Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
CHATEVAL: Towards better LLM-based evaluators through multi-agent debate
ChatMusician: Understanding and Generating Music Intrinsically with LLM
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection
RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation
VIDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Marble: Music audio representation benchmark for universal evaluation
MoMusic: A Motion-Driven Human-AI Collaborative Music Composition and Performing System
Modulation-Domain Multichannel Kalman Filtering for Speech Enhancement
Article
Binaural mask-informed speech enhancement for hearing AIDS with head tracking
Conference paper
Modulation-domain parametric multichannel Kalman filtering for speech enhancement
Conference paper
Multichannel Kalman Filtering for Speech Ehnancement
Conference paper
Conference paper
Multilingual I-vector based statistical modeling for music genre classification
Conference paper
Conference paper
Conference paper
Conference paper
Semi-supervised learning of bottleneck feature for music genre classification
Conference paper
Under-modelled blind system identification for time delay estimation in reverberant environments
Conference paper
Article
Joint optimization of recurrent networks exploiting source auto-regression for source separation
Conference paper
Two-stage multi-target joint learning for monaural speech separation
Conference paper
Article
Conference paper
Weighted spatial bispectrum correlation matrix for DOA estimation in the presence of interferences
Conference paper
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio
Article
Conference paper
Direction of arrival estimation based on subband weighting for noisy conditions
Conference paper
EMIA4110 | Practical Machine Learning |
IIMP6090 | Postgraduate Seminar |
EMIA4110 | Practical Machine Learning |
IIMP6090 | Postgraduate Seminar |
IIMP6090 | Postgraduate Seminar |
No Teaching Assignments |
No Teaching Assignments |
No Teaching Assignments |
CHAN, Chi-min
(co-supervision)
Individualized Interdisciplinary Program
CHENG, Sitong
Individualized Interdisciplinary Program
JIA, Xianzhang
(co-supervision)
Individualized Interdisciplinary Program
JIANG, Chunyang
(co-supervision)
Individualized Interdisciplinary Program
JIN, Yizhu
Individualized Interdisciplinary Program
LIU, Yulong
(co-supervision)
Computer Science and Engineering
LU, Yiwen
(co-supervision)
Individualized Interdisciplinary Program
YU, Zhouliang
(co-supervision)
Individualized Interdisciplinary Program
ZHU, Chuanbo
(co-supervision)
Individualized Interdisciplinary Program
CHEN, Jianyi
Individualized Interdisciplinary Program
CHEN, Xi
(co-supervision)
Individualized Interdisciplinary Program
LI, Lujun
(co-supervision)
Individualized Interdisciplinary Program
TIAN, Zeyue
(co-supervision)
Individualized Interdisciplinary Program
YE, Zhen
Individualized Interdisciplinary Program
YUAN, Ruibin
(co-supervision)
Individualized Interdisciplinary Program
ZHOU, Ziya
(co-supervision)
Individualized Interdisciplinary Program
Update your browser to view this website correctly. Update your browser now