- Analytics toolkits:
- Business Intelligence
- Data Analysis
- Data Science
- Statistical programming
- Transparancy
- Reproducibility
- Practical considerations
- Take a break
August 2, 2018
One possible interpretation of analytics roles and toolkits
Because a hammer is a hammer, but we all know some hammers just feel better than others.
Why statistical programming is valuable
transparent adj
Related to transparency, reproducibility is important for 2 reasons:
There are also extremely practical reasons to use R.
What is a predictive model?
predictive adj
model noun
So simply – a predictive model is a system that tries to predict an outcome.
Interviewer: What is your biggest strength?
Me: I'm an expert in machine learning.
Interviewer: What's 6+10?
Me: Zero.
Interviewer: That's not even close. It's 16.
Me: Okay, it's 16.
Interviewer: What is 10+20?
Me: It's 16.
Machine learning is just one kind of predictive modeling, but in the analytics world that the two terms are often used interchangeably. Let's talk through the basic process.Let's assume we have a perfectly clean and complete data set.
We we separate the data set into 2-3 chunks:
- Testing data
- Validation data (sometimes optional)
What are some things to consider when building a predictive model?
We'll discuss, and I'll present a far-from-exhaustive list.
We have a model. Now what?
What makes a model "good"?
Machine Learning flow chart
There are no hard-and-fast rules. Some things to think about: