All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.
All our activities include solutions with explanations on how they work and why we chose them.
Perform the calculation and store the results in the variable col_missing_values
.
You have to drop this column permanently as we can not use them for any purpose.
Perform this drop permanently in df
.
Make sure to apply this change on the original df
.
Make sure to apply this change on the original df
.
Credit Card
in the Column Payment Type
.Make sure to apply this change to the original df
.
edu_trips_with_GO
.Outliers are defined as any values 3 or more std to the left or right of the mean.
Perform the outlier identification and drop them.
Important Note: Make sure to correctly solve the previous activities before solving this activity.
Invalid values are defined as any value other than Credit Card
or Cash
.
Perform the selection of valid values and store them in column Payment_Type_Fixed
while invalid values should be NaN
. Then select invalid values and store the results in the variable df_invalid_payment_type
.
Invalid values are defined as any value other than PKR
.
Perform the selection of invalid values and drop them from the original df.
Invalid values are defined as any value that is not an integer.
Perform the selection of valid values and store them in column Total_Distance_Fixed
while invalid values should be NaN
. Then select invalid values and store the results in the variable df_invalid_distance
.