Machine Learning

Course Code: bspkaml

Duration: 21 hours

Prerequisites:

Python Knowledge

Overview:

This course will be a combination of theory and practical work with specific examples used throughout the event.

Course Outline:

Introduction

This section provides a general introduction of when to use 'machine learning', what should be considered and what it all means including the pros and cons. Datatypes (structured/unstructured/static/streamed), data validity/volume, data driven vs user driven analytics, statistical models vs. machine learning models/ challenges of unsupervised learning, bias-variance trade off, iteration/evaluation, cross-validation approaches, supervised/unsupervised/reinforcement.

MAJOR TOPICS

1. Understanding naive Bayes

Basic concepts of Bayesian methods
Probability
Joint probability
Conditional probability with Bayes' theorem
The naive Bayes algorithm
The naive Bayes classification
The Laplace estimator
Using numeric features with naive Bayes

2. Understanding decision trees

Divide and conquer
The C5.0 decision tree algorithm
Choosing the best split
Pruning the decision tree

3. Understanding neural networks

From biological to artificial neurons
Activation functions
Network topology
The number of layers
The direction of information travel
The number of nodes in each layer
Training neural networks with backpropagation
Deep Learning

4. Understanding Support Vector Machines

Classification with hyperplanes
Finding the maximum margin
The case of linearly separable data
The case of non-linearly separable data
Using kernels for non-linear spaces

5. Understanding clustering

Clustering as a machine learning task
The k-means algorithm for clustering
Using distance to assign and update clusters
Choosing the appropriate number of clusters

6. Measuring performance for classification

Working with classification prediction data
A closer look at confusion matrices
Using confusion matrices to measure performance
Beyond accuracy – other measures of performance
The kappa statistic
Sensitivity and specificity
Precision and recall
The F-measure
Visualizing performance tradeoffs
ROC curves
Estimating future performance
The holdout method
Cross-validation
Bootstrap sampling

7. Tuning stock models for better performance

Using caret for automated parameter tuning
Creating a simple tuned model
Customizing the tuning process
Improving model performance with meta-learning
Understanding ensembles
Bagging
Boosting
Random forests
Training random forests
Evaluating random forest performance

MINOR TOPICS

8. Understanding classification using the nearest neighbors

The kNN algorithm
Calculating distance
Choosing an appropriate k
Preparing data for use with kNN
Why is the kNN algorithm lazy?

9. Understanding classification rules

Separate and conquer
The One Rule algorithm
The RIPPER algorithm
Rules from decision trees

10. Understanding regression

Simple linear regression
Ordinary least squares estimation
Correlations
Multiple linear regression

11. Understanding regression trees and model trees

Adding regression to trees

12. Understanding association rules

The Apriori algorithm for association rule learning
Measuring rule interest – support and confidence
Building a set of rules with the Apriori principle

Extras

Spark/PySpark/MLlib and Multi-armed bandits

Sites Published:

United Arab Emirates - Machine Learning

Qatar - Machine Learning

Egypt - Machine Learning

Saudi Arabia - Machine Learning

South Africa - Machine Learning

Brasil - Machine Learning

Canada - Machine Learning

香港 - Machine Learning

澳門 - Machine Learning

台灣 - Machine Learning

USA - Machine Learning

Österreich - Machine Learning

Schweiz - Machine Learning

Deutschland - Machine Learning

Czech Republic - Machine Learning

Denmark - Machine Learning

Estonia - Machine Learning

Finland - Machine Learning

Greece - Machine Learning

Magyarország - Machine Learning

Ireland - Machine Learning

Luxembourg - Machine Learning

Latvia - Machine Learning

España - Machine Learning

Italia - Machine Learning

Lithuania - Machine Learning

Nederland - Machine Learning

Norway - Machine Learning

Portugal - Machine Learning

România - Machine Learning

Sverige - Machine Learning

Türkiye - Machine Learning

Malta - Machine Learning

Belgique - Machine Learning

France - Machine Learning

日本 - Machine Learning

Australia - Machine Learning

Malaysia - Machine Learning

New Zealand - Machine Learning

Philippines - Machine Learning

Singapore - Machine Learning

Thailand - Machine Learning

Vietnam - Machine Learning

India - Machine Learning

Argentina - Machine Learning

Chile - Machine Learning

Costa Rica - Machine Learning

Ecuador - Machine Learning

Guatemala - Machine Learning

Colombia - Machine Learning

México - Machine Learning

Panama - Machine Learning

Peru - Machine Learning

Uruguay - Machine Learning

Venezuela - Machine Learning

United Kingdom - Machine Learning

South Korea - Machine Learning

Pakistan - Machine Learning

Sri Lanka - Machine Learning

Bulgaria - Machine Learning

Bolivia - Machine Learning

Indonesia - Machine Learning

Kazakhstan - Machine Learning

Moldova - Machine Learning

Morocco - Machine Learning

Tunisia - Machine Learning

Kuwait - Machine Learning

Oman - Machine Learning

Slovakia - Machine Learning

Kenya - Machine Learning

Nigeria - Machine Learning

Botswana - Machine Learning

Slovenia - Machine Learning

Croatia - Machine Learning

Serbia - Machine Learning

Bhutan - Machine Learning

Nepal - Machine Learning

Uzbekistan - Machine Learning