Course Code: smackstack
Duration: 14 hours
Prerequisites:
  • An understanding of data processing systems

Audience

  • Data Scientists
Overview:

SMACK is a collection of data platform softwares, namely Apache Spark, Apache Mesos, Apache Akka, Apache Cassandra, and Apache Kafka. Using the SMACK stack, users can create and scale data processing platforms.

This instructor-led, live training (online or onsite) is aimed at data scientists who wish to use the SMACK stack to build data processing platforms for big data solutions.

By the end of this training, participants will be able to:

  • Implement a data pipeline architecture for processing big data.
  • Develop a cluster infrastructure with Apache Mesos and Docker.
  • Analyze data with Spark and Scala.
  • Manage unstructured data with Apache Cassandra.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • To request a customized training for this course, please contact us to arrange.
Course Outline:

Introduction

SMACK Stack Overview

  • What is Apache Spark? Apache Spark features
  • What is Apache Mesos? Apache Mesos features
  • What is Apache Akka? Apache Akka features
  • What is Apache Cassandra? Apache Cassandra features
  • What is Apache Kafka? Apache Kafka features

Scala Language

  • Scala syntax and structure
  • Scala control flow

Preparing the Development Environment

  • Installing and configuring the SMACK stack
  • Installing and configuring Docker

Apache Akka

  • Using actors

Apache Cassandra

  • Creating a database for read operations
  • Working with backups and recovery

Connectors

  • Creating a stream
  • Building an Akka application
  • Storing data with Cassandra
  • Reviewing connectors

Apache Kafka

  • Working with clusters
  • Creating, publishing, and consuming messages

Apache Mesos

  • Allocating resources
  • Running clusters
  • Working with Apache Aurora and Docker
  • Running services and jobs
  • Deploying Spark, Cassandra, and Kafka on Mesos

Apache Spark

  • Managing data flows
  • Working with RDDs and dataframes
  • Performing data analysis

Troubleshooting

  • Handling failure of services and errors

Summary and Conclusion

Sites Published:

United Arab Emirates - SMACK Stack for Data Science

Qatar - SMACK Stack for Data Science

Egypt - SMACK Stack for Data Science

Saudi Arabia - SMACK Stack for Data Science

South Africa - SMACK Stack for Data Science

Brasil - SMACK Stack for Data Science

Canada - SMACK Stack for Data Science

中国 - SMACK Stack for Data Science

香港 - SMACK Stack for Data Science

澳門 - SMACK Stack for Data Science

台灣 - SMACK Stack for Data Science

USA - SMACK Stack for Data Science

Österreich - SMACK Stack for Data Science

Schweiz - SMACK Stack for Data Science

Deutschland - SMACK Stack for Data Science

Czech Republic - SMACK Stack for Data Science

Denmark - SMACK Stack for Data Science

Estonia - SMACK Stack for Data Science

Finland - SMACK Stack for Data Science

Greece - SMACK Stack for Data Science

Magyarország - SMACK Stack for Data Science

Ireland - SMACK Stack for Data Science

Luxembourg - SMACK Stack for Data Science

Latvia - SMACK Stack for Data Science

España - SMACK Stack for Data Science

Italia - SMACK Stack for Data Science

Lithuania - SMACK Stack for Data Science

Nederland - SMACK Stack for Data Science

Norway - SMACK Stack for Data Science

Portugal - SMACK Stack for Data Science

România - SMACK Stack for Data Science

Sverige - SMACK Stack for Data Science

Türkiye - SMACK Stack for Data Science

Malta - SMACK Stack for Data Science

Belgique - SMACK Stack for Data Science

France - SMACK Stack for Data Science

日本 - SMACK Stack for Data Science

Australia - SMACK Stack for Data Science

Malaysia - SMACK Stack for Data Science

New Zealand - SMACK Stack for Data Science

Philippines - SMACK Stack for Data Science

Singapore - SMACK Stack for Data Science

Thailand - SMACK Stack for Data Science

Vietnam - SMACK Stack for Data Science

India - SMACK Stack for Data Science

Argentina - SMACK Stack for Data Science

Chile - SMACK Stack for Data Science

Costa Rica - SMACK Stack for Data Science

Ecuador - SMACK Stack for Data Science

Guatemala - SMACK Stack for Data Science

Colombia - SMACK Stack for Data Science

México - SMACK Stack for Data Science

Panama - SMACK Stack for Data Science

Peru - SMACK Stack for Data Science

Uruguay - SMACK Stack for Data Science

Venezuela - SMACK Stack for Data Science

Polska - SMACK Stack for Data Science

United Kingdom - SMACK Stack for Data Science

South Korea - SMACK Stack for Data Science

Pakistan - SMACK Stack for Data Science

Sri Lanka - SMACK Stack for Data Science

Bulgaria - SMACK Stack for Data Science

Bolivia - SMACK Stack for Data Science

Indonesia - SMACK Stack for Data Science

Kazakhstan - SMACK Stack for Data Science

Moldova - SMACK Stack for Data Science

Morocco - SMACK Stack for Data Science

Tunisia - SMACK Stack for Data Science

Kuwait - SMACK Stack for Data Science

Oman - SMACK Stack for Data Science

Slovakia - SMACK Stack for Data Science

Kenya - SMACK Stack for Data Science

Nigeria - SMACK Stack for Data Science

Botswana - SMACK Stack for Data Science

Slovenia - SMACK Stack for Data Science

Croatia - SMACK Stack for Data Science

Serbia - SMACK Stack for Data Science

Bhutan - SMACK Stack for Data Science

Nepal - SMACK Stack for Data Science

Uzbekistan - SMACK Stack for Data Science