Python: Automate the Boring Stuff - dedicated  ( pythonautomationded | 14 hours )

Prerequisites:
  • No programming experience is required.
  • Basic computer skills: surfing websites, running programs, saving and opening documents, etc.
Overview:

This instructor-led training is based on the popular book, "Automate the Boring Stuff with Python", by Al Sweigart. It is aimed at beginners and covers essential Python programming concepts through practical, hands-on exercises and discussions. The focus is on learning to write code to dramatically increase office productivity.

By the end of this training, participants will know how to program in Python and apply this new skill for:

  • Automating tasks by writing simple Python programs.
  • Writing programs that can do text pattern recognition with "regular expressions".
  • Programmatically generating and updating Excel spreadsheets.
  • Parsing PDFs and Word documents.
  • Crawling web sites and pulling information from online sources.
  • Writing programs that send out email notifications.
  • Use Python's debugging tools to quickly resolve bugs.
  • Programmatically controlling the mouse and keyboard to click and type for you.

Audience

  • Non-programmers wishing to learn programming with Python
  • Professionals and company teams wishing to optimize their office productivity
  • Managers wishing to automate tedious processes and workflows

Format of the course

  • Part lecture, part discussion, exercises and heavy hands-on practice
Course Outline:

Introduction to programming in Python:

  1. language overview
  2. basic operations
  3. data types
  4. conditional statements, loops and data stuctures
  5. modules and packages
  6. functions and classes
  7. exceptions
  8. best practices

Introduction to various approaches for Extract-Transform-Load operations – overview of selected tools:

  1. Python standard library for ETL operations

Python “non-standard” libraries for ETL:

  1. Blaze
  2. Dask
  3. Bonobo
  4. Real-time Streaming ETL with Structured Streaming in Apache Spark
  5. Introduction to Pandas - Python data analysis library

Focusing on Bonobo - Python ETL Framework

  1. Building custom ETL frameworks using Bonobo

Integrating Bonobo with:

  1. SQLAlchemy - accessing PostgreSQL and MSSQL databases
  2. Pandas
  3. Building automated pipelines
  4. Visualising Bonobo pipelines and collected data
  5. Business cases