Course Code:
bespokedatascience
Duration:
84 hours
Course Outline:
Python Programming (Audience Programmers)
- Python programming concepts such as data & file operations in Python,
- Object-oriented concepts in Python
- Various Python libraries such as Pandas, Numpy, Matplotlib, Seaborn and so on
- Real Time Scenarios where python can be used for data science
- Exploratory Data Analysis
Data Engineering (Audience Data Stewards, ETL and BI Engineers)
- Basic Language Requirements
- In-Depth Database Knowledge
- Data Warehousing/Big Data Skills
- Hadoop and MapReduce
- Hive, Mongo DB and PIG
- Apache Spark
- ELK
- Courses with a mixture of the above frameworks
- Kafka
- Talend
Advance AI and ML Training (Audience to be decided depending on the performance of nominated people for previous training)
- Intro to Machine Learning and AI
- Linear Regression, Model Evaluation, Logistic Regression
- Ensemble Technique Decision Trees, Random Forest, Ensemble Methods - Bagging, Boosting and Stacking
- Feature Engineering, Pipelining, Sampling, Model Performance Measures
- K-Means Clustering, Hierarchical Clustering
- Live Use Cases