Overview

Row

Dashboard guide

This dashboard analyzes the Climate Change Dataset with attention to data cleaning, missing-value treatment, and visual best practices.

Because climate variables are seasonal, missing numeric values were imputed using the mean within each month (for example, missing July temperature values were filled using the average of other July values). If a month had no available value for a variable, the overall mean for that variable was used as a backup.

The design follows common data visualization principles emphasized in course readings: use position and length for accurate comparison, reduce chartjunk, prefer readable labels, and add interpretation directly next to the visuals.

Summary

Metric Value
Rows in dataset 53
Date range Jan 2020 to May 2024
Variables with missing values 14
Individual values imputed 89

References to course best practices

This dashboard applies several visualization ideas commonly emphasized in data visualization courses: Tufte's focus on clarity and high data-ink ratio, Cleveland and McGill's evidence that position and aligned scales support accurate comparison, and Wilke/Cairo-style guidance on restrained color, thoughtful annotation, and explanatory titles.

That is why the dashboard relies mainly on line charts, scatterplots, and ordered bar charts with limited decoration and direct interpretive text.

Row

Total observations

53

Variables imputed

14

Values imputed

89

Row

Carbon dioxide concentration over time

Cleaning and Missing Data

Row

Missing values before imputation

Missing values after imputation

Row

Missing-data handling notes

The dataset contains missing values and a few non-standard placeholders such as "Unknown" and 99999.
To handle these responsibly:

  • text placeholders were converted to missing values first
  • impossible placeholder values like 99999 were also treated as missing
  • missing numeric values were imputed using the mean within month
  • if a month-specific mean was unavailable, the overall mean was used as a fallback

Using monthly means is more appropriate than a single grand mean because climate variables are seasonal. For example, replacing a missing winter temperature with the overall annual mean would distort the seasonal pattern.

Missing-value summary table

Variable Missing_Before Missing_After
Min_Temp (°C) 10 0
Humidity (%) 9 0
Max_Temp (°C) 8 0
Sea_Surface_Temp (°C) 8 0
Avg_Temp (°C) 7 0
Precipitation (mm) 6 0
Solar_Irradiance (W/m²) 6 0
CO2_Concentration (ppm) 6 0
ENSO_Index 6 0
Cloud_Cover (%) 5 0
Urbanization_Index 5 0
Particulate_Matter (µg/m³) 5 0
Wind_Speed (m/s) 4 0
Vegetation_Index 4 0

Appendix

Row

Recent records after cleaning

Year Month Avg_Temp (°C) Precipitation (mm) CO2_Concentration (ppm) Sea_Surface_Temp (°C) ENSO_Index
2024 Jan 6.48 44.01 436.92 23.06 -0.13
2024 Feb 14.76 77.79 437.25 15.72 -0.17
2024 Mar 12.92 128.96 435.32 15.92 0.98
2024 Apr 27.83 107.55 410.97 27.49 0.74
2024 May 8.11 25.14 408.49 27.84 0.87
2023 Jan 4.68 12.19 405.61 29.99 1.00
2023 Feb 28.28 194.19 440.14 12.60 -0.31
2023 Mar 23.07 56.04 436.69 25.25 0.09
2023 Apr 8.19 18.13 400.49 19.93 -0.53
2023 May -4.97 67.50 426.50 27.22 -0.08
2023 Jun 6.70 171.48 421.51 19.06 -0.57
2023 Jul 29.04 85.02 418.40 23.23 -0.63
2023 Aug 11.27 58.02 439.55 16.19 0.13
2023 Sep 3.11 26.64 437.47 24.87 -0.75
2023 Oct 13.05 183.98 429.94 13.68 1.00

Notes for interpretation

This dashboard emphasizes readable comparisons instead of decorative complexity. That choice reflects visualization best practices discussed in many introductory readings: use clear scales, direct titles, and simple encodings that let the data stand out. If you want to adapt this further for your class, you can also add a short final paragraph naming the exact readings assigned in your semester.