Perhaps the data that you need to complete your analysis isn't available from an API. Perhaps you're writing a story about a company that's trying to hide information that's supposed to be public. Or perhaps you want to augment your existing customer database to better understand your customers. In all of these cases, web scraping can help.
Web scraping is a method of extracting information from websites. It's used to transform unstructured web data, typically in HTML format, into structured data that can be stored & analyzed. Python provides a number of powerful tools to quickly create custom web scrapers.
What You Will Learn:
In this course, you learn how to write Python scripts to programmatically retrieve & store (web scrape) data from websites like Reddit. In the process, you learn about basic data structuring & Chrome Developer Tools for investigating a website's HTML format. As well, you can learn to integrate information from an external API to enrich your scraped data.
How to Prepare:
You can't learn EVERYTHING in ~2 hours. But you can learn enough to get excited & comfortable to keep working & learning on your own! The course is for absolute beginners.
A web browser to see what you're working on as others see it (Recommend Google Chrome: [chrome.google.com] (http://chrome.google.com))
You use an online text editor for this workshop. You can sign up here: https://repl.it/