Program
- This is a course of the Master of Biostatistics
Some important facts
- It is assumed that you have some experience with statistical concepts, such as, independence, estimators, multiple linear regression model, hypothesis tests.
- In this course we will write programs in the language R. But, if you code in another language, for instance, Python or Matlab, it is also fine.
- Each session have a duration of two hours.
General Objective
To build, evaluate and compare predictive statistical learning models.
Topics
In this course we will cover the following topics:
Multiple Linear Regression
The model
Estimating the parameters
Asymptotic properties of the estimators
Testing hypotheses about the parametes
Confidence intervals about the parameters
Prediction interval for a new response value
Model Checking
K-Nearest Neighbors
Methods of Validation
Best Subset of Predictors
Ridge and Lasso Regression
Decision Regression Trees
Logistic Regression
Discriminant Analysis: Linear and Quadratic
Decision Clasification Trees
Ensemble Models: Bagging y Boosting
Naive Bayes Classifier
Support Vector Machines
Neuronal Nets
Grading
This course will be grade based on an applied project. The weigths of each stage of the project are:
- 29/02 Formulation of the project 20%
- 28/03 Preliminary report 20%
- 23/05 Final paper 50%
- 30/05 Oral exposition 10%
The Final paper must have a minimum of eigth pages and a maximum of twelve pages.
Bibliography
- T. Hastie, R. Tibshirani, J. Friedman (2009).The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition, Springer Series in Statistics.
- Casella G. and Berger, (2000). Statistical Inference, Second Edition.
- Gareth, J., Witten, D., Hastie, T., and Tibshirani R. (2017). An introduction to Statistical Learning with applications in R, Springer.
- R Core Team, R: A language and environment for statistical computing, R Foundation for Statistical Computing, 2019.
- M. Kuhn, K. Johnson (2013). Applied Predictive Modeling. Springer.
- D. Olson, D. Wu (2020). Predictive Data Mining Models. Second Edition. Springer Verlag.
- C. Bishop (2006). Pattern Recognition and Machine Learning. Springer Verlag.