Teaching staff

  • Prof. Qi DOU (Instructor)

    Dept. of Computer Science & Engineering
    Email: qidou@cuhk.edu.hk
    Office: Room 911, 9/F, SHB, CUHK
    Consultation hours: Mon 14:00-16:00

  • Mr. Dongchen HE (TA)

    Dept. of Computer Science & Engineering
    Email: dche@link.cuhk.edu.hk
    Office: Room 904, 9/F, SHB, CUHK
    Consultation hours: Wed 14:00-16:00

  • Mr. Yanruisheng SHAO (TA)

    Dept. of Computer Science & Engineering
    Email: 1155231342@link.cuhk.edu.hk
    Office: Room 904, 9/F, SHB, CUHK
    Consultation hours: Wed 14:00-16:00

  • Mr. Wing Yat Alpha CHEUNG (TA)

    Dept. of Biomedical Engineering
    Email: 1155133865@link.cuhk.edu.hk
    Office: Room 1107, 11/F, ERB, CUHK
    Consultation hours: Fri 14:30-16:30

Useful links

News

  • Welcome to BMEG3102 Spring2025!
  • First lecture will start on 8 Jan 2025.
  • Piazza homepage is set up!

Course information

The goal of this course is to introduce the basic concepts in bioinformatics. Topics to be covered include introduction to bioinformatics, reviews of molecular biology and genetics, biomedical data types and databases, sequence alignment and searching methods, mutation models and molecular phylogenetics, molecular structures, clinical bioinformatics for genetic diseases diagnosis. On the theoretical side, students are expected to learn the relevant knowledge in computer science, biology and mathematics from lectures and tutorials. On the practical side, students are given assignments and tutorials to get hands-on experience in locating, studying and using developed tools to apply the learned concepts in performing standard analytical tasks on biomedical data.


Class time and venue

Lectures:

Wednesdays      10:30AM - 11:15AM (MMW 702)
Thursdays          11:30AM - 01:15PM (MMW 702)

Tutorials:

Wednesdays      11:30AM - 12:15PM (MMW 702)

Piazza

For discussions, questions and sharings:
link: https://piazza.com/cuhk.edu.hk/spring2025/bmeg3102


Assessments:

Assignments:                              45%
Topic presentation & report:        15%
Final examination:                       35%
Class participation:                       5%


Tentative class schedule

Week Time Topic Assignment

1 Jan 8 - 9 Introduction --
2 Jan 15 - 16 Sequence alignment and searching (I) --
3 Jan 22 - 23 Sequence alignment and searching (I) --
4 Jan 29 - 30 Lunar new year holiday --
5 Feb 5 - 6 Sequence alignment and searching (II) Asg 1
6 Feb 12 - 13 Sequence alignment and searching (II) --
7 Feb 19 - 20 Mutation models and molecular phylogenetics (I) --
8 Feb 26 - 27 Mutation models and molecular phylogenetics (II) Asg2
9 Mar 05 - 06 Reading week --
10 Mar 12 - 13 Motifs and domains --
11 Mar 19 - 20 High-throughput data processing and analysis (I) --
12 Mar 26 - 27 High-throughput data processing and analysis (II) Asg 3
13 Apr 2-3 Functional annotations --
14 Apr 9 - 10 Molecular structures Asg 4
15 Apr 16 - 17 Topic presentation --


Presentation topics

1. Protein folding prediction with AI
Highly accurate protein structure prediction with AlphaFold
Improved protein structure prediction using potentials from deep learning
After AlphaFold: protein-folding contest seeks next big breakthrough
Accurate structure prediction of biomolecular interactions with AlphaFold 3
2. Drug Design
Leveraging molecular structure and bioactivity with chemical language models for de novo drug design
Accurate prediction of molecular properties and drug targets using a self-supervised image representation learning framework
Comprehensive assessment of deep generative architectures for de novo drug design
3. Functional genomics data privacy
Data Sanitization to Reduce Private Information Leakage from Functional Genomics
Functional genomics data: privacy risk assessment and technological mitigation
Sociotechnical safeguards for genomic data privacy
4. Protein function prediction
Structure-based protein function prediction using graph convolutional networks
DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction
DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks
5. Cancer diagnosis
The evolutionary history of 2,658 cancers
Toward best practice in cancer mutation detection with whole genome and whole-exome sequencing
Tumor fractions deciphered from circulating cell-free DNA methylation for cancer early diagnosis
6. Discover the genetic patterns of complex diseases
Repeat DNA expands our understanding of autism spectrum disorder
Patterns of de novo tandem repeat mutations and their role in autism
Molecular mechanisms underlying nucleotide repeat expansion disorders
7. Spatially resolved transcriptomics
Method of the Year: spatially resolved transcriptomics
Spatially resolved transcriptomics adds a new dimension to genomics
Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention autoencoder
8. Genome-Based drug repurposing
Opportunities for drug repositioning from phenome-wide association studies
Single-cell-led drug repurposing for Alzheimer’s disease
Integrating 3D genomic and epigenomic data to enhance target gene discovery and drug repurposing in transcriptome-wide association studies
9. Transcription factor binding sites prediction
Deep neural networks identify sequence context features predictive of transcription factor binding
Recurrent neural network for predicting transcription factor binding sites
Enhancing the interpretability of transcription factor binding site prediction using attention mechanism
10. Methods to assess off-target CRISPR edits
Inactivation of porcine endogenous retrovirus in pigs using CRISPR-Cas9
CIRCLE-seq: a highly sensitive in vitro screen for genome-wide CRISPR–Cas9 nuclease off-targets
Mapping the genomic landscape of CRISPR–Cas9 cleavage
11. Sequence assembly
Chromosome-scale, haplotype-resolved assembly of human genomes
Sequence assembly demystified
New advances in sequence assembly
12. Multimodal Single-Cell Data Integration
Computational principles and challenges in single-cell data integration
BABEL enables cross-modality translation between multiomic profiles at single-cell resolution
Integrated analysis of multimodal single-cell data

Reference materials

- Bioinformatics Micro-modules, YouTube channel, by Prof. Kevin Yip.
- Fundamental Concepts of Bioinformatics, by Dan E. Krane, Michael L. Raymer and Benjamin Cummings, Pearson Education, 2003.
- Algorithms in Bioinformatics: A Practical Introduction, by Wing-Kin Sung, Chapman & Hall, 2009. [with free online materials]


© BMEG3102 | Spring2025 | The Chinese University of Hong Kong