Web Scraping with Python Training Course
Web scraping is a method used to extract data from websites and save it to a local file or database.
This instructor-led, live training (available online or onsite) is designed for developers who want to use Python to automate the process of crawling multiple websites to extract data for processing and analysis.
By the end of this training, participants will be able to:
- Install and configure Python along with all necessary packages.
- Retrieve and parse data stored across numerous websites.
- Understand how websites function and how their HTML is structured.
- Build spiders to crawl the web at scale.
- Use Selenium to crawl AJAX-driven web pages.
Course Format
- Interactive lectures and discussions.
- Numerous exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This course assumes prior knowledge of programming.
- To request a customized training for this course, please contact us to make arrangements.
Course Outline
Introduction
Setting up the Development Environment
Python Primer: Data Structures, Conditionals, File Handling, etc.
Python Packages for Web Scraping: Scrapy and BeautifulSoup
How a Website Works
How HTML is Structured
Making a Web Request
Scraping an HTML Page
Working with XPath and CSS
Filtering Data Using Regular Expressions
Creating a Web Crawler
Crawling AJAX and JavaScript Pages with Selenium.
Web Scraping Best Practices
Troubleshooting
Summary and Conclusion
Requirements
- Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include more introductory Python exercises.
Audience
- Developers
Open Training Courses require 5+ participants.
Web Scraping with Python Training Course - Booking
Web Scraping with Python Training Course - Enquiry
Web Scraping with Python - Consultancy Enquiry
Testimonials (1)
Many different examples and topics has been covered, from basic investigation to login management and dynamic page management.
Daniele Tagliaferro - Creditsafe Italia Srl
Course - Web Scraping with Python
Upcoming Courses
Related Courses
Advanced Python: Best Practices and Design Patterns
28 HoursThis intensive, practical course explores advanced Python techniques, engineering best practices, and widely adopted design patterns to help you build maintainable, testable, and high-performance Python applications. The curriculum highlights modern tooling, type hinting, concurrency models, architectural patterns, and deployment-ready workflows.
This instructor-led, live training (available online or onsite) is designed for intermediate to advanced Python developers aiming to adopt professional practices and patterns for production-grade Python systems.
Upon completion of this training, participants will be equipped to:
- Apply Python typing, dataclasses, and type-checking to enhance code reliability.
- Utilize design patterns and architectural principles to structure robust applications.
- Correctly implement concurrency and parallelism using asyncio and multiprocessing.
- Develop well-tested code through pytest, property-based testing, and CI pipelines.
- Profile, optimize, and harden Python applications for production environments.
- Package, distribute, and deploy Python projects using modern tools and containers.
Course Format
- Interactive lectures and concise demonstrations.
- Hands-on labs and coding exercises every day.
- A capstone mini-project that integrates patterns, testing, and deployment.
Course Customization Options
- To request customized training or focus areas (data, web, or infrastructure), please contact us to arrange it.
Agentic AI Engineering with Python — Build Autonomous Agents
21 HoursThis course provides practical engineering techniques for designing, building, testing, and deploying agentic (autonomous) systems using Python. It covers the agent loop, tool integrations, memory and state management, orchestration patterns, safety controls, and production considerations.
This instructor-led, live training (available online or onsite) is aimed at intermediate-level to advanced-level ML engineers, AI developers, and software engineers who wish to build robust, production-ready autonomous agents using Python.
By the end of this training, participants will be able to:
- Design and implement the agent loop and decision-making workflows.
- Integrate external tools and APIs to extend agent capabilities.
- Implement short-term and long-term memory architectures for agents.
- Coordinate multi-step orchestrations and agent composability.
- Apply safety, access control, and observability best practices for deployed agents.
Format of the Course
- Interactive lecture and discussion.
- Hands-on labs building agents with Python and popular SDKs.
- Project-based exercises that produce deployable prototypes.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Introduction to Data Science and AI using Python
35 HoursThis five-day course provides a comprehensive introduction to Data Science and Artificial Intelligence (AI).
Learning is reinforced through practical examples and exercises conducted using Python.
Artificial Intelligence with Python (Intermediate Level)
35 HoursArtificial Intelligence with Python involves creating intelligent systems by leveraging Python’s comprehensive ecosystem of AI and machine learning libraries.
This instructor-led, live training (available online or onsite) is designed for intermediate-level Python programmers who aim to design, implement, and deploy AI solutions using Python.
By the end of this training, participants will be able to:
- Implement AI algorithms using Python’s core AI libraries.
- Work with supervised, unsupervised, and reinforcement learning models.
- Integrate AI solutions into existing applications and workflows.
- Evaluate model performance and optimize for accuracy and efficiency.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Algorithmic Trading with Python and R
14 HoursThis instructor-led live training in Brazil (online or onsite) targets business analysts who wish to automate trading using algorithmic trading, Python, and R.
Upon completion of this training, participants will be able to:
- Utilize algorithms to rapidly buy and sell securities at specialized increments.
- Decrease costs associated with trading by leveraging algorithmic trading techniques.
- Automatically monitor stock prices and execute trades.
Applied AI from Scratch in Python
28 HoursSpanning four days, this course provides an introduction to Artificial Intelligence and its practical application using Python. Upon completing the course, participants have the option to add an extra day to work on a dedicated AI project.
AWS Cloud9 and Python: A Practical Guide
14 HoursThis instructor-led, live training in Brazil (online or onsite) is designed for intermediate-level Python developers seeking to enhance their Python development experience with AWS Cloud9.
Upon completion of this training, participants will be able to:
- Set up and configure AWS Cloud9 for Python development.
- Navigate the AWS Cloud9 IDE interface and utilize its features.
- Write, debug, and deploy Python applications within AWS Cloud9.
- Collaborate effectively with other developers using the AWS Cloud9 platform.
- Integrate AWS Cloud9 with other AWS services for advanced deployment scenarios.
Building Chatbots in Python
21 HoursChatbots are computer programs that automatically simulate human responses via chat interfaces. Chatbots help organizations maximize their operations efficiency by providing easier and faster options for their user interactions.
In this instructor-led, live training, participants will learn how to build chatbots in Python.
By the end of this training, participants will be able to:
- Understand the fundamentals of building chatbots
- Build, test, deploy, and troubleshoot various chatbots using Python
Audience
- Developers
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- To request a customized training for this course, please contact us to arrange.
GPU Programming with CUDA and Python
14 HoursThis instructor-led, live training in Brazil (online or onsite) is designed for intermediate-level developers who wish to utilize CUDA to build Python applications that run in parallel on NVIDIA GPUs.
By the end of this training, participants will be able to:
- Use the Numba compiler to accelerate Python applications running on NVIDIA GPUs.
- Create, compile and launch custom CUDA kernels.
- Manage GPU memory.
- Convert a CPU based application into a GPU-accelerated application.
Bespoke Applied Artificial Intelligence and LLM Engineering with Python
35 HoursCourse Overview
This practical training program is tailored for data engineering professionals aiming to develop hands-on expertise in artificial intelligence, Python, and large language models. Emphasizing real-world applications, the curriculum covers model utilization, prompt engineering, and the creation of AI-driven solutions. Participants will engage in step-by-step exercises that progress from foundational concepts to the development of deployable AI workflows.
Training Format
• In-person classroom instruction
• Instructor-led sessions featuring guided practice
• Interactive discussions and analysis of real-world case studies
• Daily practical exercises
Course Objectives
• Comprehend fundamental AI and machine learning concepts applicable to contemporary solutions
• Enhance Python proficiency for AI development and data processing
• Gain insight into the mechanics of large language models and effective usage strategies
• Craft and refine prompts to ensure consistent and accurate outputs
• Develop complete AI solutions utilizing APIs and frameworks
• Seamlessly integrate AI capabilities into data engineering pipelines
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led live training in Brazil (online or on-site) is designed for data scientists and software engineers who aim to utilize Dask with the Python ecosystem to construct, scale, and analyze large datasets.
By the conclusion of this training, participants will be able to:
- Configure the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in Brazil (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led live training, offered online or onsite, is designed for developers seeking to utilize the FARM stack (FastAPI, React, and MongoDB) to build dynamic, high-performance, and scalable web applications.
By the conclusion of this training, participants will be able to:
- Set up the requisite development environment for FastAPI, React, and MongoDB.
- Understand the fundamental concepts, features, and benefits of the FARM stack.
- Learn to create REST APIs using FastAPI.
- Acquire skills in designing interactive applications with React.
- Develop, test, and deploy applications (both frontend and backend) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led live training in Brazil (online or onsite) is designed for developers who want to use FastAPI with Python to build, test, and deploy RESTful APIs more quickly and easily.
By the end of this training, participants will be able to:
- Set up the necessary development environment to develop APIs with Python and FastAPI.
- Create APIs quicker and easier using the FastAPI library.
- Learn how to create data models and schemas based on Pydantic and OpenAPI.
- Connect APIs to a database using SQLAlchemy.
- Implement security and authentication in APIs using the FastAPI tools.
- Build container images and deploy web APIs to a cloud server.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in Brazil (online or onsite) is aimed at data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.