Max K. Goff
21 November 2015
Fewer than 1% of Yelp users have ever achieved elite status
Project Paper:
Githup project assets: https://github.com/maxgoff/YelpDataScienceProject.git
The Yelp Academic Data Sets were ingested, flattened, tidied, combined, and modified, resulting in a test set to facilitate predictive modeling:
| FieldName | RType | VariableType |
|---|---|---|
| isElite | factor | Response |
| Review Count | numeric | Independent |
| Votes Count | integer | Independent |
| Fans | integer | Independent |
| Ave Review Len | integer | Independent |
| Average Stars | numeric | Independent |
| Flesch Kincaid | numeric | Independent |
| Friends Count | numeric | Independent |
| Days Yelping | integer | Independent |
| Compliments | integer | Independent |
| Activity | Performance Level |
|---|---|
| Write a review | 5 or more per month |
| Characters per review | at least 1095 |
| Target reading level | 5th grade |
| Vote on other reviews | at least 16.2 per month |
| Get a compliment | at least 2.5 per month |
| Make a friend | at least 1 per month |
| Get a fan | at least 1 per quarter |
13 Classification Models tested, selected: caret::xgbTree