6.1. Introduction

This chapter will explore the concept of analyzing data sets to summarize their main characteristics and information. We will start by exploring the dataset in more depth and use Pandas and Altair to explore and visualize the data in different formats, such as histograms and maps. As we go further in the chapter, you will realize that we do not have access to every data set in the form that we might wish. Often we need to find other means to obtain the data we need or to reformat the data in some way. We will learn how to extract data from websites using web scraping methods. Finally, you will be able to use the Pandas pivot table to summarize the data.

6.1.1. Learning Goals

  • Visualize, analyze, and describe data in various formats

  • Extract data from different sources

  • Summarizes the data of a large data set

6.1.2. Learning Objectives

  • Use Pandas to analyze and describe data

  • Visualize data with histograms and scatter plots

  • Graph data on a map using web API

  • Extract, clean, and save data from web documents

6.1.3. Reading List

In this first part of the module, we will continue with some data that should be familiar to you, but we will use it in Pandas instead of a spreadsheet. In the second part, we will focus on several different kinds of textual analysis using data. Download The world countries, protecting minority investors, and starting a business to get started. We will be working with these files in this chapter. Keep in mind that this starting a business is different than the one used in chapter 2.

Lesson Feedback

    During this lesson I was primarily in my...
  • 1. Comfort Zone
  • 2. Learning Zone
  • 3. Panic Zone
    Completing this lesson took...
  • 1. Very little time
  • 2. A reasonable amount of time
  • 3. More time than is reasonable
    Based on my own interests and needs, the things taught in this lesson...
  • 1. Don't seem worth learning
  • 2. May be worth learning
  • 3. Are definitely worth learning
    For me to master the things taught in this lesson feels...
  • 1. Definitely within reach
  • 2. Within reach if I try my hardest
  • 3. Out of reach no matter how hard I try
You have attempted of activities on this page