HSS 611: Programming for HSS
Sep 18, 2025
What is NumPy good for?
ndarray
(n-dimensional array) from NumPy is faster and more efficient than Python’s native lists for a variety of reasons
ndarray
, Series
, and DataFrame
ndarray
), and Pandas’ key data structures (Series
and DataFrame
) are built on it
Broad use cases of NumPy and Pandas
Generating ndarray
np.array()
functionndarray
Generating ndarray
Do not get confused with lists
Other ways to generate ndarray
np.zeros()
, np.ones()
Other ways to generate ndarray
np.arange()
(similar to range()
)
Shape, dimension, and dtype
.ndim
: dimension.shape
: shape.dtype
: data type
Casting with astype()
method
Casting with astype()
method
Precision and memory consumption
Precision and memory consumption
Vectorization
Loop, comprehension, and vectorized array arithmetic
Loop, comprehension, and vectorized array arithmetic
Loop, comprehension, and vectorized array arithmetic
Loop, comprehension, and vectorized array arithmetic
Indexing and slicing
Indexing and slicing
Indexing and slicing
Series
One-dimensional array-like object
Series
index
Series
index
Series
index
Series
name
, values
, etc.
Series
Indexing and slicing
Series
Indexing and slicing
Series
Indexing and slicing
[]
-based indexing treat integers as labels if the index contains integersSeries
Indexing and slicing
[]
-based indexing treat integers as labels if the index contains integersSeries
Using NumPy functions or NumPy-like operations
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
Series
Commonly used methods
DataFrame
Represents a rectangular table of data
DataFrame
Represents a rectangular table of data
DataFrame
Many ways to construct a DataFrame
DataFrame
Indexing and slicing
loc
indexer works with labels (integer or string)loc
DataFrame
Indexing and slicing
loc
indexer works with labels (integer or string)loc
DataFrame
Indexing and slicing
DataFrame
Indexing and slicing
loc
DataFrame
Indexing and slicing
iloc
indexer works with positions (0, 1, 2, etc.)
DataFrame
Indexing and slicing
DataFrame
Indexing and slicing
iloc
, like thisDataFrame
Indexing and slicing
DataFrame
Indexing and slicing
DataFrame
Indexing and slicing
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Sorting
DataFrame
Handling missing data
isna()
or isnull()
DataFrame
Handling missing data
dropna()
DataFrame
Handling missing data
dropna()
DataFrame
Handling missing data
dropna()
DataFrame
Removing duplicates
duplicated()
return Boolean valuesDataFrame
Removing duplicates
duplicated()
return Boolean valuesDataFrame
Removing duplicates
drop_duplicates()
remove duplicatesDataFrame
Removing duplicates
keep
argument False, it deletes all duplicatesDataFrame
Removing duplicates
drop_duplicates()
remove duplicatesDataFrame
Removing duplicates
drop_duplicates()
remove duplicates