PhD in Control Science and Engineering
Tsinghua University, 2022
Conference paper
Deployment Efficient Reward-Free Exploration with Linear Function Approximation
Conference paper
Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs
Conference paper
| No Publications |
| No Publications |
| No Publications |
| No Publications |
| COMP4211 | Machine Learning |
| No Teaching Assignments |
| No Teaching Assignments |
| No Teaching Assignments |
| No Teaching Assignments |
| No Teaching Assignments |
Update your browser to view this website correctly. Update your browser now