The data is from the US National Health and Nutrition Examination Study, or NHANES that records anthropometric and health-related indicators from participants. This specific subset of the NHANES dataset is of the 2009-2010 and 2011-2012 sample years.The dataset has various type of interesting variables inclusive of demographic variables, physical measurements and health metrics. Our main objective is to understand the distribution of weight among the population and determine the relevent factors associated with increase in weight.
The histogram and density graphs describes the distribution of overall weight which is bi modal and the majority of weight is clustered between 50kg and 100 kg.
The boxplot describes the weight distribution by race. Black race has the highest observation of weight and the highest median as compared to other race categories.
All race categories show a very positive relationship between weight and height, that is as the height increases the weight also increases.