In the world of data science sometimes data do not appear the way we would like.
One such case is when data are either top coded or bottom coded.
For example, you may be interested in household income. You run to the American Community Survey only to find the following.
- Household earning less than $10,000 are given a lumped into one income category (i.e. bottom coded).
- Household earning more than $1,000,000 are given a lumped into the top income cateogry (i.e. top coded).
These variables are missing due to exogenous reasons.