Course Code: pythonwebscraping
Duration: 7 hours
Prerequisites:
  • Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include more introductory Python exercises.

Audience

  • Developers
Overview:

Web Scraping is a technique for extracting data from a website then saving it to local file or database.

This instructor-led, live training (online or onsite) is aimed at developers who wish to use Python to automate the process of crawling many websites to extract data for processing and analysis.

By the end of this training, participants will be able to:

  • Install and configure Python and all relevant packages.
  • Retrieve and parse data stored across many websites.
  • Understand how websites work and how their HTML is structured.
  • Construct spiders to crawl the web at scale.
  • Use Selenium to crawl AJAX-driven web pages.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • This course assumes knowledge of programming.
  • To request a customized training for this course, please contact us to arrange.
Course Outline:

Introduction

Setting up the Development Environment

Python Primer: Data Structures, Conditionals, File Handling, etc.

Python Packages for Web Scraping: Scrapy and BeautifulSoup

How a Website Works

How HTML is Structured

Making a Web Request

Scraping an HTML Page

Working with XPath and CSS

Filtering Data Using Regular Expressions

Creating a Web Crawler

Crawling AJAX and JavaScript Pages with Selenium.

Web Scraping Best Practices

Troubleshooting

Summary and Conclusion

Sites Published:

United Arab Emirates - Web Scraping with Python

Qatar - Web Scraping with Python

Egypt - Web Scraping with Python

Saudi Arabia - Web Scraping with Python

South Africa - Web Scraping with Python

Brasil - Web Scraping with Python

Canada - Web Scraping with Python

中国 - Web Scraping with Python

香港 - Web Scraping with Python

澳門 - Web Scraping with Python

台灣 - Web Scraping with Python

USA - Web Scraping with Python

Österreich - Web Scraping with Python

Schweiz - Web Scraping with Python

Deutschland - Web Scraping with Python

Czech Republic - Web Scraping with Python

Denmark - Web Scraping with Python

Estonia - Web Scraping with Python

Finland - Web Scraping with Python

Greece - Web Scraping with Python

Magyarország - Web Scraping with Python

Ireland - Web Scraping with Python

Luxembourg - Web Scraping with Python

Latvia - Web Scraping with Python

España - Web Scraping with Python

Italia - Web Scraping with Python

Lithuania - Web Scraping with Python

Nederland - Web Scraping with Python

Norway - Web Scraping with Python

Portugal - Web Scraping with Python

România - Web Scraping with Python

Sverige - Web Scraping with Python

Türkiye - Web Scraping with Python

Malta - Web Scraping with Python

Belgique - Web Scraping with Python

France - Web Scraping with Python

日本 - Web Scraping with Python

Australia - Web Scraping with Python

Malaysia - Web Scraping with Python

New Zealand - Web Scraping with Python

Philippines - Web Scraping with Python

Singapore - Web Scraping with Python

Thailand - Web Scraping with Python

Vietnam - Web Scraping with Python

India - Web Scraping with Python

Argentina - Web Scraping with Python

Chile - Web Scraping with Python

Costa Rica - Web Scraping with Python

Ecuador - Web Scraping with Python

Guatemala - Web Scraping with Python

Colombia - Web Scraping with Python

México - Web Scraping with Python

Panama - Web Scraping with Python

Peru - Web Scraping with Python

Uruguay - Web Scraping with Python

Venezuela - Web Scraping with Python

Polska - Web Scraping with Python

United Kingdom - Web Scraping with Python

South Korea - Web Scraping with Python

Pakistan - Web Scraping with Python

Sri Lanka - Web Scraping with Python

Bulgaria - Web Scraping with Python

Bolivia - Web Scraping with Python

Indonesia - Web Scraping with Python

Kazakhstan - Web Scraping with Python

Moldova - Web Scraping with Python

Morocco - Web Scraping with Python

Tunisia - Web Scraping with Python

Kuwait - Web Scraping with Python

Oman - Web Scraping with Python

Slovakia - Web Scraping with Python

Kenya - Web Scraping with Python

Nigeria - Web Scraping with Python

Botswana - Web Scraping with Python

Slovenia - Web Scraping with Python

Croatia - Web Scraping with Python

Serbia - Web Scraping with Python

Bhutan - Web Scraping with Python

Nepal - Web Scraping with Python

Uzbekistan - Web Scraping with Python