Event box

Web Scraping with Python's Beautiful Soup

Web Scraping with Python's Beautiful Soup Online

This workshop will introduce attendees to techniques for scraping information from the web using Python’s Beautiful Soup (bs4) toolkit. We will begin with a basic overview of the “anatomy” or structure of a webpage. Students will then learn how to write a script for extracting textual data from websites like Reddit and organizing it into spreadsheets. The second half of the workshop will explore how to use Python's Pandas library to clean and analyze your data. In addition to technical skills, students are encouraged to engage with critical questions like: What is web scraping for and what can we, as researchers, learn from publicly available data? What are the potential ethical and legal challenges of data harvesting, and how do we do it responsibly?

Details: This virtual workshop will be recorded. The recording will be posted to the Sherman Centre's Online Learning Catalogue.

Preliminary Work/Prerequisites: A beginner knowledge of Python is necessary for this workshop. Complete the Sherman Centre's asynchronous "Introduction to Python" learning module or attend the session "Introduction to Python" on Wednesday February 14, 2024.

Facilitator Bio: Chelsea Miya is a Postdoctoral Fellow with the Sherman Centre for Digital Scholarship at McMaster University. Her research and teaching interests include critical code studies, nineteenth-century American literature, and the digital humanities. She has held research positions with the SpokenWeb Network, the Kule Research Institute (Kias), and the Canadian Writing Research Collaboratory (CWRC). She co-edited the anthology Right Research: Modelling Sustainable Research Practices in the Anthropocene (Open Book Publishers 2021), and her article “Student-Driven Digital Learning: A Call to Action” appears in People, Practice, Power: Digital Humanities outside the Center (MIT Press 2021).

 

Date:
Thursday, March 14, 2024
Time:
1:30pm - 3:00pm
Time Zone:
Eastern Time - US & Canada (change)
Online:
This is an online event. Event URL will be sent via registration email.
Audience:
  Everyone  
Categories:
  DASH     SCDS Sponsored Events     Workshops  
Registration has closed.

More information on Sherman Centre Events can be found on the SCDS Events page.

CODE OF CONDUCT

The Sherman Centre and the McMaster University Library are committed to fostering a supportive and inclusive environment for its presenters and participants. As a participant in this session, you agree to support and help cultivate an experience that is collaborative, respectful, and inclusive, as well as free of harassment, discrimination, and oppression. We reserve the right to remove participants who exhibit harassing, malicious or persistently disruptive behaviour. Please refer to our code of conduct webpage for more information.

Event Organizer

Chelsea Miya
Profile photo of Jason Brodeur
Jason Brodeur

More events like this...