This dashboard analyzes the Climate Change Dataset including data cleaning, missing-value treatment, and visual best practices.
Because climate variables are seasonal, missing numeric values were imputed using the mean within each month (for example, missing July temperature values were filled using the average of other July values). If a month had no available value for a variable, the overall mean for that variable was used as a backup.
The design follows common data visualization principles emphasized in course readings: use position and length for accurate comparison, reduce chartjunk, prefer readable labels, and add interpretation directly next to the visuals.
| Metric | Value |
|---|---|
| Rows in dataset | 53 |
| Date range | Jan 2020 to May 2024 |
| Variables with missing values | 14 |
| Individual values imputed | 89 |
This dashboard applies several visualization ideas commonly emphasized in data visualization courses: Tufte’s focus on clarity and a high data-ink ratio, and Wexler’s emphasis on clear communication through well-structured layouts, readable labels, and audience-friendly explanations.
That is why the dashboard relies mainly on line charts, scatterplots, and ordered bar charts with limited decoration and direct interpretive text.
The dataset contains missing values and a few non-standard
placeholders such as "Unknown" and
99999.
To handle these responsibly:
99999 were also
treated as missingUsing monthly means is more appropriate than a single grand mean because climate variables are seasonal. For example, replacing a missing winter temperature with the overall annual mean would distort the seasonal pattern.
| Variable | Missing_Before | Missing_After |
|---|---|---|
| Min_Temp (°C) | 10 | 0 |
| Humidity (%) | 9 | 0 |
| Max_Temp (°C) | 8 | 0 |
| Sea_Surface_Temp (°C) | 8 | 0 |
| Avg_Temp (°C) | 7 | 0 |
| Precipitation (mm) | 6 | 0 |
| Solar_Irradiance (W/m²) | 6 | 0 |
| CO2_Concentration (ppm) | 6 | 0 |
| ENSO_Index | 6 | 0 |
| Cloud_Cover (%) | 5 | 0 |
| Urbanization_Index | 5 | 0 |
| Particulate_Matter (µg/m³) | 5 | 0 |
| Wind_Speed (m/s) | 4 | 0 |
| Vegetation_Index | 4 | 0 |
Several patterns stand out. The temperature lines suggest strong seasonal swings, while the CO2 series shows a general increase across the time span. The monthly summary reinforces that climate variables are seasonal, which supports the choice to impute within month rather than across the entire dataset. The scatterplot suggests a positive relationship between sea-surface temperature and air temperature, and the correlation heatmap helps identify broader associations among temperature, humidity, radiation, and atmospheric indicators.
| Year | Month | Avg_Temp (°C) | Precipitation (mm) | CO2_Concentration (ppm) | Sea_Surface_Temp (°C) | ENSO_Index |
|---|---|---|---|---|---|---|
| 2024 | Jan | 6.48 | 44.01 | 436.92 | 23.06 | -0.13 |
| 2024 | Feb | 14.76 | 77.79 | 437.25 | 15.72 | -0.17 |
| 2024 | Mar | 12.92 | 128.96 | 435.32 | 15.92 | 0.98 |
| 2024 | Apr | 27.83 | 107.55 | 410.97 | 27.49 | 0.74 |
| 2024 | May | 8.11 | 25.14 | 408.49 | 27.84 | 0.87 |
| 2023 | Jan | 4.68 | 12.19 | 405.61 | 29.99 | 1.00 |
| 2023 | Feb | 28.28 | 194.19 | 440.14 | 12.60 | -0.31 |
| 2023 | Mar | 23.07 | 56.04 | 436.69 | 25.25 | 0.09 |
| 2023 | Apr | 8.19 | 18.13 | 400.49 | 19.93 | -0.53 |
| 2023 | May | -4.97 | 67.50 | 426.50 | 27.22 | -0.08 |
| 2023 | Jun | 6.70 | 171.48 | 421.51 | 19.06 | -0.57 |
| 2023 | Jul | 29.04 | 85.02 | 418.40 | 23.23 | -0.63 |
| 2023 | Aug | 11.27 | 58.02 | 439.55 | 16.19 | 0.13 |
| 2023 | Sep | 3.11 | 26.64 | 437.47 | 24.87 | -0.75 |
| 2023 | Oct | 13.05 | 183.98 | 429.94 | 13.68 | 1.00 |
This dashboard was designed with ideas from Tufte and Wexler in mind. From Tufte, the main goal was to maximize clarity by focusing attention on the data rather than decoration. That is why the charts avoid unnecessary fancy effects and heavy backgrounds. The line charts, bar charts, and heatmap rely on simple position, length, and color differences so the viewer can compare values quickly and accurately.
From Wexler, the dashboard applies practical communication choices that make graphics easier to read for a broad audience. Titles are written to explain the point of the plot, labels are spelled out clearly, and related visuals are grouped with short interpretation text so the dashboard does not feel like a collection of disconnected figures. The missing-value section tells the viewer how the data were cleaned before analysis, which improves transparency and credibility.