This chapter mainly introduces the theoretical model of linear regression and various techniques for
solving optimization problems.
3.1 Liner Regression (2 hours)
This section mainly introduces the basic concepts of linear regression, and focuses on the basic methods of
linear regression and nonlinear regression, as well as the analysis of variance and significance of regression
equations.
3.2 Decision and Optimization (2 hours)
This chapter mainly explains the definition, classification and various techniques for solving optimization
problems.
Chapter 4 :Classification: Basic Concepts, Decision Trees, and Model Evaluation (5 hours)
4.1 Preliminaries (1 hours)
This section mainly describes the basic concepts of classification and introduces the general approach to
solving classification problems.
4.3 Decision Tree Induction (2 hours)
This part mainly explains the working principle of the decision tree classification method and the method of
establishing the decision tree.
4.4 Model Over fitting (1 hours)
This section focuses on the model over-fitting problem and how to deal with over-fitting in decision tree
induction.
4.5 Evaluating the Performance of a Classifier (1 hours)
This section mainly introduces some commonly methods for evaluating the performance of a classifier, such
as retention methods, random subsampling, and cross-validation.
Chapter 5 :Classification: Alternative Techniques (5 hours)
5.1 Rule-Based Classifier (1 hours)
This section mainly explains the working principle and characteristics of the rule-based classifier and
introduces the method of rule extraction.
5.2 Nearest-Neighbour classifiers (1 hours)
This section mainly explains the algorithm of the nearest neighbour classification and the characteristics of
the nearest neighbour classifier.
5.3 Bayesian Classifiers (1 hours)
This section mainly explains Bayesian and the application of Bayesian in classification.
5.4 Artificial Neural Network (ANN) (1 hours)
This section mainly explains the structure and characteristics of artificial neural networks.
5.5 Support Vector Machine (SVM) (1 hours)
This section mainly explains the basic idea of support vector machine, and explains how to train SVM on
linear separable data and nonlinear separable data.
Chapter 6 :Association Analysis: Basic Concepts and Algorithms (5 hours)
6.1 Problem Definition (1 hours)
This section mainly explains the formal description of the mining rules of association rules and the strategies
commonly used in association rules mining algorithms.
6.2 The Generation of Frequent Itemset and Rule (2 hours)
This section mainly explains the effective techniques for generating frequent itemsets and rules in the
association rule mining algorithm, focusing on the frequent itemsets and rules generation of the Apriori
algorithm.
6.4 FP-Growth Algorithm (1 hours)
This section mainly introduces other algorithms that generate frequent itemsets—the FP growth algorithm.
6.5 Evaluation of Association Patterns (1 hours)
This section focuses on methods for assessing the quality of associated models.
Chapter 7 :Cluster Analysis: Basic Concepts and Algorithms (4 hours)
7.1 K-means, Agglomerative Hierarchical Clustering, DBSCAN (3 hours)
This section mainly introduces the K-means, condensed hierarchical clustering algorithm, the basic
principles of the DBSCAN algorithm, and the advantages and disadvantages of each algorithm.
7.2 Cluster Evaluation (1 hours)
This section mainly introduces methods for evaluating clusters generated by clustering algorithms.
Chapter 8: Applications (3 hours)
This chapter mainly introduces the application of data mining algorithms in practical cases.