In this section, you'll begin to investigate content from the web. You'll start by learning a bit of HTML and CSS, the foundations for the web, and from there you'll take a look at how to scrape information from the web in order to systematically create and build datasets that may not be otherwise available to you.
While many companies provide access to information via APIs, sometimes you have to scrape the information that you need for your analysis from web pages designed to be read by people.
HTML stands for HyperText Markup Language - the "language of the web". You'll start by learning HTML syntax and practice exploring HTML documents. After that, you'll look into the process for handling new HTML elements that you might not have encountered before.
CSS or Cascading Style Sheets is how you make web pages look snazzy. You'll see more about how proper web development workflows separate content from presentation.
Finally, you'll practice using Beautiful Soup - a package for scraping websites.
You will often find that the information you want to retrieve isn't available via an API. When that's the case, it's incredibly important to be proficient with web scraping so that you can retrieve the information you need for your analysis.