1. Business Problems and Data Science Solutions: From Business Problems to Data
Mining Tasks, The Data Mining Process, Other Analytics Techniques and
Technologies; CRISP - DM
2. Introduction to data mining: Basic concepts of data mining, Different types of data
repositories, Data mining functionalities, Concept of interesting patterns, Data mining
tasks, Current trends, Major issues and ethics in data mining
3. Introduction to mining using Python: Introduction to python, Introduction to
various packages as tool, features of related packages
4. Introduction to Statistical Programming Methods with Python: Basic Probability
and Statistics with python and hands on.
5. Extracting Data Sets: Importing data using different formats of files, Storing and
retrieving data, Graphical representation of data, Hands on practice problems on data
import/export.
6. Data Preprocessing: Data cleaning, Data integration and transformation, Data
reduction, Discretization and concept hierarchy generation
7. Association and Correlation Analysis: Basic concepts of frequent pattern and
association rule, frequent itemset generation with Apriori algorithm and FP Growth
algorithm, Rule generation, Applications of Association rules
8. Machine Learning: Overview of machine learning (ML), Supervised learning,-
Classification, Regression; Unsupervised learning – Clustering , Introduction to Neural
Networks.
9. Classification: Introduction to classification, Introduction to Classification methods,
Basic concepts of binary classification, Bayes theorem and Naive Bayes classifier,
Association based classification, Rule based classifiers, Nearest neighbor classifiers,
Decision Trees, Random Forest,
10. Prediction and Classification using python: Applying model for prediction,
Bayesian Classification on new imported data, Bayesian Classification on existed
dummy data set, Decision Tree classification on dummy data sets, Practice problems
on classification methods, Applications of classification for web mining
11. Clustering Algorithms and Cluster Analysis: Measures of similarity, K means
partitioning method, k medoids method, CLARANS method, Agglomerative and
divisive clustering hierarchical method, etc.
12. Clustering methods using python: Introduction to clustering, Introduction to
Clustering algorithms, differentiate clustering and classification, K-means clustering,
Hierarchical clustering algorithm,
13. Analysis & Visualizations using python: Data Visualization - Basic Charts,
Multidimensional Visualization, Specialized Visualizations, Intro to PowerBI or Tableau
(any one shall be explained)
14. Web Dashboards with python: App development;