6.4.2 BLAST Algorithm
7. Multiple Sequence Alignment and Phylogenetics
7.1 Significance of multiple sequence alignment
7.2 Progressive Alignment (ClustalW)
7.3 Basics of phylogeny: Characters, traits, nodes, branches, lineages
7.4 Molecular clock and modeling sequence evolution
7.5 Distances and clustering algorithm: UPGMA and Neighbor Joining (NJ)
7.6 From sequence alignments to trees: Parsimony methods
7.7 Probability based approach: Maximum likelihood methods
III Next Generation Sequencing (NGS) and cancer genomics
Hours: 8
1. NGS and Short reads mapping
Introduction to Genomic Technologies
From Sanger sequencing to NGS
Principles of NGS: Massive parallel sequencing
Features of NGS data: Short reads
Uses Trie structure (Trie and Suffix Array) to search a reference genome
Burrows–Wheeler transform(BWT)
2. Variant calling and output
Genetic variants: structure variants, SNV, CNV
SAM format for mapped reads
Approaches for variants calling
VCF format for saving called variants
3. Cancer genomics and single cell cancer genomics
Calling variants in cancer genomics
Single cell cancer genomes
Tumor microevolution
IV Transcriptomic and epigenomic analysis
Hours: 10
1.Gene expression profiling and RNA-seq
What’s the advantage of RNA-seq compared with microarray?
What factors should we consider for RNA-seq data normalization?
What’s the advantage of single cell sequencing over bulk cells?
2. Single cell RNA-seq
Cellular heterogeneity
Single cell RNA-seq technologies
Distinct cell populations
Pseudo-time inference
3. Epigenome and data anlysis
Definition of epigenetics?
How to detect genome-wide DNA methylation?
How to detect genome-
wide nucleosome positioning and chromatin
accessibility?
How to identify genome-wide TF binging sites? How to do the peak calling?