Course Code: bespokedatascience
Duration: 84 hours
Course Outline:

Python Programming (Audience Programmers)

  •  Python programming concepts such as data & file operations in Python,
  • Object-oriented concepts in Python
  • Various Python libraries such as Pandas, Numpy, Matplotlib, Seaborn and so on
  • Real Time Scenarios where python can be used for data science
  • Exploratory Data Analysis

 Data Engineering (Audience Data Stewards, ETL and BI Engineers)

  • Basic Language Requirements
  • In-Depth Database Knowledge
  • Data Warehousing/Big Data Skills
  • Hadoop and MapReduce
  • Hive, Mongo DB and PIG
  • Apache Spark
  •  ELK
  • Courses with a mixture of the above frameworks
  • Kafka
  • Talend 

Advance AI and ML Training (Audience to be decided depending on the performance of nominated people for previous training)

  • Intro to Machine Learning and AI
  • Linear Regression, Model Evaluation, Logistic Regression
  • Ensemble Technique Decision Trees, Random Forest, Ensemble Methods - Bagging, Boosting and Stacking
  • Feature Engineering, Pipelining, Sampling, Model Performance Measures
  • K-Means Clustering, Hierarchical Clustering
  • Live Use Cases