Instructions:
Use the data from the class survey (“math2203survey.csv”) to answer the following questions. The survey question items are provided below (variable name, followed by question/prompt). Use this information and your understanding of statistics to answer the following questions. Make sure you answer ALL questions included below.

Survey Items:

Duration - Duration of survey (in seconds)
C1 -How many of your classes are fully in person
C2 - How many of your classes are mixed synchronous online (Zoom) and in person
C3 - How many of your classes are mixed asynchronous online (recorded videos) and in person
C4 - How many of your classes are fully online synchronous (Zoom) class sessions
C5 - How many of your classes are fully online asynchronous online (recorded videos)
format_learn - In which course format do feel you learn the most?
format_general - Which course format do prefer in general?
format_stats - Which course format would you prefer for this course?
exp_grade - What grade to you expect to get in this course?
credit_hours - How many credit hours are you enrolled in this semester?
work - Do you have a job?
work_hours - (If you work) How many hours do you work each week on average?
gender - What is your gender?
age - How old are you? like_most - What do you like most about this course?
like_least - What do you like the least about this course?
change - If you could change one thing about this course, what would it be?

1 Characterization of study - 15pts

2 Analysis of Data - 15 pts

  1. Load the data into R.

  2. How many students took this survey.

2.1 Course Format -

  1. Find the mean and median of the C1, C2, C3, C4, C5 variables. Use this information to determine which type of class format is the most common for students this semester.

2.2 Format Preferences

  1. Make a table of format_learn, format_general, format_stats variables and use this table to make a barplot of each variable. Be sure to include axis lables and plot headings.

2.3 Gender differences

  1. Generate a two-way table of gender and format_general variables

  2. Given that someone are female, what is the probability that she prefers a fully in person class.

  3. Given that someone is male, what is the probability that he prefers a fully in person class?

  4. Does there appear to be a difference in student course preferences by gender?

2.4 Work factors

  1. Make a 2-way table of the work and format_general variables.

  2. Use information to find what proportion of students who prefer fully in person classes:
    1. do not work
    2. work off campus
    3. work on campus
  3. Given that someone does not work, what is the probability that they prefer a fully in person class.

  4. Given that someone works off campus, what is the probability that they prefer a fully in person class.

  5. Does working appear impact student course preferences?

2.5 Enrollment

Plot number of credit hours enrolled vs. number of work hours per week (including labels and title). Does there seem to be a correlation between hours worked and number of credit hours?

2.6 Validity of data

Find the mean, median, and standard deviation variables.

Plot a boxplot of the duration variable.

Use this information to discuss if you think any of students responses might be invalid (e.g. students weren’t paying attention to taking the survey).

2.7 Other Analysis

Feel free to add extra analysis in this section (up to 5 points extra credit for any other interesting findings).

3 Conclusion

What recommendations do you have going foward. Justify your answer using your previous summary and anlaysis of the data.