Course Professor: Kin Hong
Wong
Web based support : https://blackboard.cuhk.edu.hk
Final Exam: Date: Wed. 11
Dec. 2024.
Venue:
WMY 406 (if your surname starts with A-L.),
WMY 407 (if your
surname starts with M-Z.). See classroom_info and cuhk-campus-map.
The exam will last for 2 hours, it is a closed-book exam and no cheat sheet allowed. However, the CMSC5707_formulas.docx (draft will be updated later) will be printed and attached to your exam question paper. We will provide each student an answer book, supplementary sheets and scrap paper for calculation. Materials in the appendices of lecture notes will not be included in the exam. A non-programmable calculator can be used. We accept calculators that are (or similar to that) in the list
See AVSU
and cuhk-campus-map
Titles: Teaching materials will be updated regularly |
Week/date (tentative) |
Topics |
Additional information |
wav sounds/ Matlab code |
Assignments |
Overview |
|||||
Ch1: Introduction to audio signal processing | 1(4/9) | Time/frequency signals,Spectrogram, Mel Scale
Cepstral coeff., MFCC |
octave_guide | tz1, tz2, sor1, violin3x A4_oboe , trumpet , sor1, A4_violin ,A5_flute num1, num5 |
|
Ch1 Continues |
2 (11/9) |
Vector quantization and K-means, Dynamic programming for speech recognition |
|
||
No lecture (Holiday) |
(18/9) |
Mid-Autumn festival |
|||
Ch2 :
ensemble methods. |
3(25/9) |
Classifiers, boosting |
|
Assignment1( released at https://blackboard.cuhk.edu.hk ) | |
Ch3
: Face detection |
4(2/10) |
Face features, Attentional,cascade, |
|
|
|
Ch 4
: Neural Networks |
5(9/10) 4 |
Basic neural network and training | |||
Ch
5 : Convolution Neural Network CNN |
6(16/10) |
Convolution Neural Network CNN | Tensorflow
tutorials 5707_tf_keras.pptx https://colab.research.google.com/ |
Assignment2( To be released at https://blackboard.cuhk.edu.hk ) | |
Ch 6 : AutoEncoder | 7(23/10) |
Classical and Variational AutoEncoder |
|
||
Ch 7
: RNN_LSTM |
8(30/10) |
Recurrent Neural Net, Long short-Term Memory |
|
||
Ch 8 : Word representation and
seq2seq |
9(6/11) |
Word embedding, Machine translation, |
Assignment3 (To be released at https://blackboard.cuhk.edu.hk ) Tutorial_LSTM_music_genre_classification.docx |
||
Ch 9 :
Transformer |
10(13/11) | The transformer model |
|||
Ch10 : Decision tree | 11(20/11) |
Decision tree using Gini and information gain
by entropy |
https://scikit-learn.org/stable/ | ||
Ch 11 : Fuzzy | 12(27/11) |
Fuzzy inference systems |
|||
No lecture | 13 (4 /12) |
self revision |
|||
Final Exam, time/venue: see above | |||||
Important
information