basic medium

TOC

basic medium#

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import seaborn as sns
from vega_datasets import data

# if vega_datasets is not installed, you can install it from the
# notebook with
# %pip install vega_datasets
# restart the kernel after installation

let’s load the cars dataset#

cars = data.cars()

Explore the shape and first few lines of the dataset#

# your code

Compute the mean and std for numeric columns#

# your code

Put the name of the columns labels in lower case#

# your code

# check it
cars.head(2)

	Name	Miles_per_Gallon	Cylinders	Displacement	Horsepower	Weight_in_lbs	Acceleration	Year	Origin
0	chevrolet chevelle malibu	18.0	8	307.0	130.0	3504	12.0	1970-01-01	USA
1	buick skylark 320	15.0	8	350.0	165.0	3693	11.5	1970-01-01	USA

Create a column “consommation (l/km)”, and remove the column miles_per_gallon#

Tip: miles_per_gallon/235.2 = litre_per_100km

# your code

Create a columns “poids (kg)” and remove the column weight_in_lbs#

Tip: 1lb = 0.454 kg

# your code

Count the number of different origin#

# your code

Check the memory usage of the origin column, convert the column ‘origin’ to category, check the new memory usage*#

# your code