UQ July 2018

Overview

Overview

  • Data
  • Exploration
  • Multiple Linear Regression
  • Testing Assumptions
  • Outliers

You will find Chapters 5 and Chapter 8 in An Introduction to R for Spatial Analysis and Mapping by Chris Brunsdon and Lex Comber useful.

Data

  • you will use georgia dataset
  • in GISTools and the GWmodel packages w
  • Some package installation may be needed
  • only needs to be done once

Once we are sure all the packages are installed, you need to load them into the current session:

The Data Variables

The data set contains a number of variables for the counties in Georgia from the 1990 census including the percentage of the population in each County that

  • is Rural (PctRural)
  • have a college degree (PctBach)
  • are elderly (PctEld)
  • that are foreign born (PctFB)
  • that are classed as being in poverty (PctPov)
  • that are black (PctBlack)

and the median income of the county (MedInc) (in 1000s of dollars)

The Data Itself

MedInc PctRural PctBach PctEld PctFB PctPov PctBlack
32.152 75.6 8.2 11.43 0.64 19.9 20.76
27.657 100.0 6.4 11.77 1.58 26.0 26.86
29.342 61.7 6.6 11.11 0.27 24.1 15.42
29.610 100.0 9.4 13.17 0.11 24.8 51.67
36.414 42.7 13.3 8.64 1.43 17.5 42.39
41.783 100.0 6.4 11.37 0.34 15.1 3.49

Initial Explorations

Initial Explorations

Visually, it seems that there may be some colinearity between PctPov,PctBlack and PctEld.

Alternative View