Skip to content

Commit 1811a9d

Browse files
authored
Updates to lesson (carpentries-incubator#11)
# Updated example on ep2, which was outdated as Carpentries changed its website in 2025 # Improved text on all episodes, checking for grammar and typos # Made a new line for each sentence that ends in a period, to improve version control of content
1 parent a201221 commit 1811a9d

7 files changed

Lines changed: 597 additions & 372 deletions

File tree

episodes/a-real-website.md

Lines changed: 307 additions & 244 deletions
Large diffs are not rendered by default.

episodes/dynamic-websites.md

Lines changed: 130 additions & 45 deletions
Large diffs are not rendered by default.

episodes/fig/inspect_div_class.gif

1.61 MB
Loading

episodes/fig/inspect_workshop.png

327 KB
Loading

episodes/fig/view_page_source.png

-200 KB
Loading

episodes/hello-scraping.md

Lines changed: 151 additions & 81 deletions
Large diffs are not rendered by default.

learners/setup.md

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,16 @@
22
title: Setup
33
---
44

5-
In this workshop you will learn how to extract data from websites, what you'd call web scraping, using Python. In Episode 1 we begin by reviewing the structure of websites in HTML and how to retrieve information from it using your browser and the `BeautifulSoup` package. In Episode 2 we'll dive deep on how to get the HTML behind any website using the `requests` package and how to parse and find information with `BeautifulSoup`. At the end,you’ll learn about the differences between static and dynamic webpages, and how to scrape the latter with the `Selenium` package.
5+
In this workshop, you’ll learn how to extract data from websites using Python — a process known as web scraping.
66

7-
This workshop is designed for participants who already have a basic understanding of Python programming. In particular, it's best to know how to:
7+
Episode 1 begins with an introduction to how websites are structured using HTML.
8+
You’ll learn how to explore this structure using your browser and how to extract information from it using the `BeautifulSoup` package.
9+
10+
In Episode 2, you’ll learn how to retrieve the HTML of a webpage using the `requests` package and continue practicing how to parse and extract specific content with `BeautifulSoup`.
11+
12+
Toward the end of the workshop, in Episode 3, we’ll explore the difference between static and dynamic webpages, and how to scrape dynamic content using `Selenium`.
13+
14+
This workshop is intended for learners who already have a basic understanding of Python. In particular, you should be comfortable with:
815

916
- Install and import packages and modules
1017
- Use lists and dictionaries

0 commit comments

Comments
 (0)