Web scraping with Playwright

Our Python Playwright course equips learners with the skills to efficiently extract data from websites. By mastering Playwright's browser automation capabilities, participants will be able to navigate complex web pages, handle dynamic content, and extract valuable information for analysis or integration into other applications. This course is ideal for those seeking to automate data collection tasks and gain insights from the vast amount of online information.

Last Updated Oct 2024

Course image
  • Understand the fundamentals of web scraping using Python Playwright
  • Master the process of launching browsers, navigating web pages, and interacting with elements
  • Efficiently extract data from HTML elements, including text, attributes, and structured data
  • Handle dynamic content and AJAX requests using Playwright's built-in mechanisms
  • Implement strategies to overcome common challenges like captchas and bot detection

Dates:01 Dec 2024 to 10 Jan 2025

Time:Every Tuesday and Thursday 7pm to 8pm

Mode:01 Dec 2024 to 10 Jan 2025

Lesson 1: Understanding Web Pages
  • Learn about HTML, CSS and Javascript and how they interact to form a website
  • Basics of HTML and CSS code
  • Inspecting HTML elements using browser developer tools
  • Identifying target elements for scraping
Lesson 1.5: Practices
    Lesson 2: Introduction to Playwright
    • Usage of Playwright, installation and set up
    • Comparing Playwright with other scraping packages
    • Learn about chromium and headless mode
    • Launch a browser instance and navigating to a webpage
    • Locating elements using selectors
    • Interact with elements (clicking, typing, submitting forms)
    Lesson 2.5: Practices
      Lesson 3: Extracting data and handling dynamic content
      • Usage of Playwright, installation and set up
      • Learn about chromium and headless mode
      • Launch a browser instance and navigating to a webpage
      • Locating elements using selectors
      • Interact with elements (clicking, typing, submitting forms)
      Lesson 3.5: Practices
        Lesson 4: Dealing with Captchas and Bot Detection
        • Understanding common captcha types
        • Using Playwright's built-in captcha handling methods
        • Implementing workarounds for bot detection mechanisms
        Lesson 4.5: Practices
          Lesson 5: Capstone project
            • Basic knowledge of Python
            Price

            $500 nett

            Classes

            10

            Assignment

            Yes

            Language

            English

            Duration

            5 weeks

            Students

            4 to 5

            Level

            Intermediate