Lecture 14 Statistics: The Next Generation

Eamonn Mallon
12/10/2020

BS2004: Contemporary Techniques in Biological Data Analysis

  • Optional 15 credit second year course
  • 11 x (Online lectures, Discussion sessions, Practicals, Help sessions)
  • Second semester

Do I not know all the stats?

  • What we taught you is a good basis
  • If your experiments are simple, they will be fine
  • multiway ANOVAs (with interactions), nested ANOVAs, ANCOVAs etc. etc
  • You could learn these piecemeal as required or

BS2004: Contemporary Techniques in Biological Data Analysis

  • Model formulae
  • General and generalised linear models

Model formulae

  • 50 male squirrels' weight, 50 female squirrels' weight
  • Does the weight of the squirrel depend on its sex?
  • Model formula: WEIGHT=SEX
  • In R:
WEIGHT~SEX
  • ~ means “depends on” (Dependent variable LHS, Independent variables RHS)

General and generalised linear models

  • t-tests, ANOVA, ANCOVA, and regressions are types of General linear models
  • The difference between general and generalised linear models is simply how error is handled
    • General linear models assume errors are independent and follow a normal distribution
    • Generalized linear models can use a wide range of distributions
    • i.e. your data doesn't have to be normal (bye bye non-parametric tests)
  • lm is the R command for General linear models
  • glm is the R command for Generalised linear models

See you next year

cube