Course Code: dask
Duration: 14 hours
Prerequisites:
  • Experience with data analysis
  • Python programming experience

Audience

  • Data scientists
  • Software engineers
Overview:

Dask is a flexible and high-performance Python library for parallel computing. It scales and accelerates big data processing with other Python-based data science libraries, such as Pandas, Numpy, and Scikit-Learn.

This instructor-led, live training (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets.

By the end of this training, participants will be able to:

  • Set up the environment to start building big data processing with Dask and Python.
  • Explore the features, libraries, tools, and APIs available in Dask.
  • Understand how Dask accelerates parallel computing in Python.
  • Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
  • Optimize the Dask environment to maintain high performance in handling large datasets.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • To request a customized training for this course, please contact us to arrange.
Course Outline:

Introduction

  • Overview of Dask features and advantages
  • Parallel computing in Python

Getting Started

  • Installing Dask
  • Dask libraries, components, and APIs
  • Best practices and tips

Scaling NumPy, SciPy, and Pandas

  • Dask arrays examples and use cases
  • Chunks and blocked algorithms
  • Overlapping computations
  • SciPy stats and LinearOperator
  • Numpy slicing and assignment
  • DataFrames and Pandas

Dask Internals and Graphical UI

  • Supported interfaces
  • Scheduler and diagnostics
  • Analyzing performance
  • Graph computation

Optimizing and Deploying Dask

  • Setting up adaptive deployments
  • Connecting to remote data
  • Debugging parallel programs
  • Deploying Dask clusters
  • Working with GPUs
  • Deploying Dask on cloud environments

Troubleshooting

Summary and Next Steps

Sites Published:

United Arab Emirates - Scaling Data Analysis with Python and Dask

Qatar - Scaling Data Analysis with Python and Dask

Egypt - Scaling Data Analysis with Python and Dask

Saudi Arabia - Scaling Data Analysis with Python and Dask

South Africa - Scaling Data Analysis with Python and Dask

Brasil - Scaling Data Analysis with Python and Dask

Canada - Scaling Data Analysis with Python and Dask

中国 - Scaling Data Analysis with Python and Dask

香港 - Scaling Data Analysis with Python and Dask

澳門 - Scaling Data Analysis with Python and Dask

台灣 - Scaling Data Analysis with Python and Dask

USA - Scaling Data Analysis with Python and Dask

Österreich - Scaling Data Analysis with Python and Dask

Schweiz - Scaling Data Analysis with Python and Dask

Deutschland - Scaling Data Analysis with Python and Dask

Czech Republic - Scaling Data Analysis with Python and Dask

Denmark - Scaling Data Analysis with Python and Dask

Estonia - Scaling Data Analysis with Python and Dask

Finland - Scaling Data Analysis with Python and Dask

Greece - Scaling Data Analysis with Python and Dask

Magyarország - Scaling Data Analysis with Python and Dask

Ireland - Scaling Data Analysis with Python and Dask

Luxembourg - Scaling Data Analysis with Python and Dask

Latvia - Scaling Data Analysis with Python and Dask

España - Scaling Data Analysis with Python and Dask

Italia - Scaling Data Analysis with Python and Dask

Lithuania - Scaling Data Analysis with Python and Dask

Nederland - Scaling Data Analysis with Python and Dask

Norway - Scaling Data Analysis with Python and Dask

Portugal - Scaling Data Analysis with Python and Dask

România - Scaling Data Analysis with Python and Dask

Sverige - Scaling Data Analysis with Python and Dask

Türkiye - Scaling Data Analysis with Python and Dask

Malta - Scaling Data Analysis with Python and Dask

Belgique - Scaling Data Analysis with Python and Dask

France - Scaling Data Analysis with Python and Dask

日本 - Scaling Data Analysis with Python and Dask

Australia - Scaling Data Analysis with Python and Dask

Malaysia - Scaling Data Analysis with Python and Dask

New Zealand - Scaling Data Analysis with Python and Dask

Philippines - Scaling Data Analysis with Python and Dask

Singapore - Scaling Data Analysis with Python and Dask

Thailand - Scaling Data Analysis with Python and Dask

Vietnam - Scaling Data Analysis with Python and Dask

India - Scaling Data Analysis with Python and Dask

Argentina - Scaling Data Analysis with Python and Dask

Chile - Scaling Data Analysis with Python and Dask

Costa Rica - Scaling Data Analysis with Python and Dask

Ecuador - Scaling Data Analysis with Python and Dask

Guatemala - Scaling Data Analysis with Python and Dask

Colombia - Scaling Data Analysis with Python and Dask

México - Scaling Data Analysis with Python and Dask

Panama - Scaling Data Analysis with Python and Dask

Peru - Scaling Data Analysis with Python and Dask

Uruguay - Scaling Data Analysis with Python and Dask

Venezuela - Scaling Data Analysis with Python and Dask

Polska - Scaling Data Analysis with Python and Dask

United Kingdom - Scaling Data Analysis with Python and Dask

South Korea - Scaling Data Analysis with Python and Dask

Pakistan - Scaling Data Analysis with Python and Dask

Sri Lanka - Scaling Data Analysis with Python and Dask

Bulgaria - Scaling Data Analysis with Python and Dask

Bolivia - Scaling Data Analysis with Python and Dask

Indonesia - Scaling Data Analysis with Python and Dask

Kazakhstan - Scaling Data Analysis with Python and Dask

Moldova - Scaling Data Analysis with Python and Dask

Morocco - Scaling Data Analysis with Python and Dask

Tunisia - Scaling Data Analysis with Python and Dask

Kuwait - Scaling Data Analysis with Python and Dask

Oman - Scaling Data Analysis with Python and Dask

Slovakia - Scaling Data Analysis with Python and Dask

Kenya - Scaling Data Analysis with Python and Dask

Nigeria - Scaling Data Analysis with Python and Dask

Botswana - Scaling Data Analysis with Python and Dask

Slovenia - Scaling Data Analysis with Python and Dask

Croatia - Scaling Data Analysis with Python and Dask

Serbia - Scaling Data Analysis with Python and Dask

Bhutan - Scaling Data Analysis with Python and Dask

Nepal - Scaling Data Analysis with Python and Dask

Uzbekistan - Scaling Data Analysis with Python and Dask