Department of Computer Science and Engineering
Chinese University of Hong Kong
Sha Tin, New Territories
Office: Room 1019
Ho Sin-Hang Engineering Building
Publications (my favorite selection) and Google Scholar
Yufei Tao's research aims to develop "small-and-sweet" algorithms: (i) small: easy to implement for deployment in practice, and (ii) sweet: having non-trivial theoretical guarantees.
He has contributed such algorithms in multiple subfields of computer science: clustering, classification, data structures, computational geometry (a.k.a. spatial databases), relational databases, data streams, graph processing, to name a few. In recent years, he has been particularly interested in algorithms dealing with massive datasets that do not fit in the memory of a single machine: such as algorithms in the external memory model (a.k.a. I/O-efficient algorithms), massively parallel models, and models of data streams. He enjoys very much working on problems that arise at the cross-intersection of databases, machine learning, and theoretical computer science.
Short Bio Major Awards
PODS Best Paper Award 2018
Google Faculty Research Award 2016
ACM Distinguished Scientist (Awarded 2016)
SIGMOD Best Paper Award 2015
SIGMOD Best Paper Award 2013
Hong Kong Young Scientist Award 2002
Keynote Speaker of ICDT 2016.
Selected Program Chairmanships (full list)
PC chair of PODS 2020.
PC co-chair of ICDE 2014.
Selected Program Committee Memberships (full list)
SIGMOD: 2007-2009, 2012, 2015, 2017 (group leader), 2018, 2019.
VLDB: 2005, 2009, 2010, 2012-2015, 2017, 2018.
PODS: 2014, 2016, 2017, 2019, 2020 (PC chair).
ICDT: 2015, 2018.
ICDE: 2005, 2007-2010, 2011 (area chair), 2012, 2013, 2014 (PC co-chair), 2016 (area chair), 2017, 2019.
ACM Transactions on Database Systems (TODS) (2008-2015).
IEEE Transactions on Knowledge and Data Engineering (TKDE) (2012-2014).
CSCI2100/ESTR2102 Data Structures.
CMSC5724 Data Mining and Knowledge Discovery.
Past Courses at CUHK
ENGG1410 Linear Algebra and Vector Calculus.
BMEG3120 Database and Security for Biomedical Engineering.
CSCI5010 Computational Geometry.
CSCI5020 External Memory Data Structures.
Past Courses at KAIST
WST501 Fundamentals of Searching Web-Scale Datasets.
WST540 Web Search and Text Analysis.
Past Courses at UQ
COMP3506/7505 Algorithms and Data Structures.
INFS4205/7205 Advanced Techniques for High Dimensional Data.
I believe in supervising a very small number of PhD students simultaneously. The number has never exceeded 2 in my career. I will consider taking a 3rd PhD only if this student has an exceptional background. Applications can be sent in by email. Each application must include a detailed transcript (of the applicant's undergraduate study) and a CV that lists the applicant's awards (since high school) and publications. Applicants with background in math or theoretical algorithms are especially welcome.
Shangqi Lu (PhD student since 2018)
Yu Wang (PhD student admitted in 2017, on leave since Sep 2018)
Dr. Junhao Gan (PhD 2017, now Lecturer at the Uni of Melbourne) Winner of the Australasian Distinguished Doctoral Dissertation Award (John Makepeace Bennett Award) 2018.
Dr. Xiaocheng Hu (PhD 2015, now at Google Moutain View)
Dr. Cheng Sheng (PhD 2012, now at Google Switzerland)
Prof. Xiaokui Xiao (PhD 2008, now Associate Professor at the National Uni of Singapore) Winner of the Hong Kong Young Scientist Award 2009 Winner of the ACM-HK Prof. Francis Chin Research Award 2009
I also had the pleasure of working with master students Jiexing Li, Ling Ding, Xiaobing Wu, and Sze Man Yuen.