2026-03-26
This project will be taken data from the Tempe Pre-Calls for the SunDevil Days event from the Administration Services Department. The goal is to identify the pattern in outreach records and results after every call, student common majors, locations, and event-related trends.
Main questions:
vm = voicemail, conf = confirm)Tempe Pre-Calls_ Texts - SDD Mon 3_30.csv (I am a student worker for Admission Services, I used the old data to do this project)03-30-26) and event location (Tempe)Status is constant (Registered) and Type is constant (First Time Freshman), so they are useful for context but not for variation-based analysisNotes, Mailing.City, Mailing.State.Province, Plan, Primary.Interest..Campaign.Name, Location, and Stagelibrary(dplyr)
library(ggplot2)
library(plotly)
tempe
<- read.csv("Tempe Pre-Calls_ Texts - SDD Mon 3_30.csv", stringsAsFactors = FALSE)
names(tempe)
<- make.names(names(tempe))
analysis_df
<- tempe %>%
mutate(
notes_lower = tolower(trimws(Notes)),
call_result_group = case_when(
grepl("^vm", notes_lower) ~ "Voicemail",
grepl("^conf", notes_lower) ~ "Confirmed",
TRUE ~ "Other"
)
)
vm is appearing to be the dominant note by a large amount of our pre-call results.conf which means confirmation is the second most result.Location, Stage, and grouped call result.| Relationship | Statistic | DF | P.value | Significant |
|---|---|---|---|---|
| Stage vs Call Result | 9.49 | 10 | 0.486 | No |
| Location vs Call Result | 9.09 | 6 | 0.168 | No |
| Status vs Call Result | NA | NA | NA | N/A |
Stage vs Call Result has p-value 0.486.Location vs Call Result has p-value 0.168.Status vs Call Result was not testable because every record is Registered.vm is the most common call result, followed by conf. Resulting in Voicemail and Confirm.Status cannot explain call-result differences because every record is Registered.