5.4. Filtering the Data

Let’s start by only looking at films that cost over a million dollars to make.

Create a variable called budget_df that contains all columns for the movies whose budget was over a million dollars.

budget_df = []
budget_df.shape

With this more manageable list of 7000+ movies, I’d like to have a way to look up the budget of a particular movie.

Create a Series object called budget_lookup such that you are able to use a call to budget_lookup['Dead Presidents'] to find the budget of that movie.

budget_lookup = []
budget_lookup['Dead Presidents']

I have figured out that the first (alphabetically) movie whose title starts with an “A” is “A Bag of Hammers” and the last movie that starts with a “B” is “Byzantium”.

budget_lookup[budget_lookup.index.str.startswith('A')].sort_index()[[0]]
title
A Bag of Hammers    2000000
dtype: int64
budget_lookup[budget_lookup.index.str.startswith('B')].sort_index()[[-1]]
title
Byzantium    10000000
dtype: int64

Use that knowledge to create a series that contains budget informations for all the movies that start with an “A” or a “B”. Hint: No need to use startswith like I did above, just use the movie titles to do a slice.

budget_lookup_as_and_bs = []
budget_lookup_as_and_bs.shape

Lesson Feedback

    During this lesson I was primarily in my...
  • 1. Comfort Zone
  • 2. Learning Zone
  • 3. Panic Zone
    Completing this lesson took...
  • 1. Very little time
  • 2. A reasonable amount of time
  • 3. More time than is reasonable
    Based on my own interests and needs, the things taught in this lesson...
  • 1. Don't seem worth learning
  • 2. May be worth learning
  • 3. Are definitely worth learning
    For me to master the things taught in this lesson feels...
  • 1. Definitely within reach
  • 2. Within reach if I try my hardest
  • 3. Out of reach no matter how hard I try
You have attempted of activities on this page