Announcements
  • Sept 4, 2017: Lectures will start on Sept 5 and Labs will start on Sept 12. The lectures this week and the first lab will give you important information for you to decide whether you should enrol in this course, so do not miss them.
  • Sept 4, 2017: Course schedule and lecture notes may be updated every week according to the progress of teaching.

  • Instructor
    Prof. James CHENG
    Email: jcheng at cse.cuhk.edu.hk
    Office: SHB 107
    Office hour: Tuesday 3:30pm - 5:30pm

    Tutors
    Tatiana Jin
    Email: tjin at cse.cuhk.edu.hk
    Office: SHB 116
    Office hour: Wednesday 12:30pm - 2:30pm
    Yunjian Zhao
    Email: yjzhao at cse.cuhk.edu.hk
    Office: SHB 116
    Office hour: Wednesday 4:30pm - 6:30pm

    Lecture/Lab Time and Place
    Time Venue
    Lecture Tuesday 13:30 - 14:15 William M W Mong Engg Bldg (ERB) 703
    Lecture Wednesday 14:30 - 16:15 Basic Medical Sciences Building - BMS 2
    Lab Tuesday 14:30 - 15:15 Ho Sin-Hang Eng Bldg (SHB) 904


    Course Schedule
    Topic Title Tentative Schedule Notes
    Introduction Week 1 intro, intro2
    Distributed Data Analytics Systems Week 2 - 5 Hadoop, HaLoop, FlumeJava, Dryad, Spark, Naiad, Husky
    Prerequisite: Relational Database Systems Self Reading RelationalDB 1, RelationalDB 2, RelationalDB 3, RelationalDB 4
    Prerequisite: Distributed Database Systems Self Reading Database System Architectures, Parallel Databases, Distributed Databases
    NoSQL Week 5 - 8 Intro2NoSQL/NewSQL, Bigtable, Pnuts, Dynamo, NoSQL tutorial by Felix Gessert
    NewSQL Week 9 - 10 H-Store, Schism, Horticulture, Calvin
    Distributed Graph Processing Systems Week 11 - 12 Pregel, GraphLab, PowerGraph, GraphChi, Pregel+, Blogel, Quegel
    Distributed Stream Processing Systems Week 12 - 13 S4, Storm, Summingbird, Sparkstreaming, MillWheel
    Column Stores Week 12 - 13 C-Store
    Extra --- Paxos


    Course Project
    Please refer to the course project webpage for details.



    Reading Lists
    Big Data Analysis Systems
    NoSQL
    NewSQL
    Graph Computing Systems
    Distributed Stream Processing Systems
    Other Data Storage and Database Systems

    Reference Materials