Course Code:
bspkpyth
Duration:
42 hours
Prerequisites:
- An understanding of ___.
- Experience with ___.
- ___ programming experience.
Audience
- ___
- ___
- ___
Course Outline:
Programming in Python
- Running Python
- Algorithmic thinking
- Getting help
- Debugging
Features of the Python language
- Variables, expressions and statements
- Functions
- Conditional Execution and recursion
Data Structures
- Strings
- Lists
- Dictionaries
- Tuples
Object-oriented programming
- Types
- Mutability
- Attributes
- Classes
- Methods
Importing data
- Pandas and numpy
- Reading in data from files and the web
- Reading in financial data from online sources
Data wrangling
- Numpy arrays
- Common matrix operations
- Optimizing performance by avoiding loops
- Sorting
- Filtering
- Aggregating
- Working with time series data
- Imputation of missing data
Exploratory data analysis
- Cross-tabs
- Graphics in matplotlib and seaborn
Model development and evaluation
- Scikit-learn for modelling
- Regression models
- Classification techniques (logistic regression, random forests)
- Statsmodel
- Principal component analysis
Case study: US housing data
- End-to-end analysis of a real dataset
- Read data
- Impute missing values
- Set up a data analytics pipeline
- Make suitable visualizations
- Compare different predictive models