Tutorial 4 - Pandas Advanced
how we can get more information about the data with the function describe ?
.describe(include."all")
what i need to do in order to know if the value is Nan or not ?
.isna
what are the 4 things that importnt to do in our initial data exploration ?
.shape .head .info .describe
what the function dropna() does ?
The dropna() method drops all rows in which we have a missing value,
what is the function agg()?
method is a function in Python's pandas library that is used to perform aggregation operations on DataFrame or Series objects.
how we can change the defult sperater in a file reading ?
sep = "new separator"
what the function groupby() is doing?
diding the data into group by the input that you give the function
why the function dropna() can be dengoures ?
in our case it's risky, since the entire 'video_release_date' has missing values, so if we will use the dropna() method it will drop the entire dataset. To avoid this we can specify the columns on which we want to use the dropna() method using the subset parameter:
What are the tree things we need to check about our data before we do anything with her ?
Does the file includes an index column? Does the file includes a first row of column names or we have to create them manually? Does the file uses the default separator (',')?
how can i reshape this data ? temperature_data_dict = { 'Day' : ['Sunday', 'Monday', 'Tuesday', 'Wednesday', 'Thursday', 'Friday', 'Saturday'], 'Haifa': [30, 29, 28, 29, 30, 31, 32], 'Tel-Aviv': [28, 27, 27, 28, 29, 30, 31], 'Beersheva': [32, 32, 31, 31, 31, 33, 34] }
df = pd.DataFrame(temperature_data_dict)
How to import pandas
import pandas as pd