Submit your report in htmlknitted from R markdown along with the .rmd file.
Organize your report using different level of headers.
Include the question, code, result/graph, and explanation for each problem in your report.
Polish graphs for visual comfort.
AI is NOT allowed for this assignment.
gpa
data setThe gpa data set is available through
openintro package in R. Answer the following questions with
an appropriate graph. Summarize your finding in plain text for each
graph to answer the question.
By doing your own research, give the precise meaning of each variable.
Visualize the relationship between studyweek and
gpa. What does your graph indicate?
Visualize the relationship between out and
gpa. What does your graph indicate?
Visualize the relationship between out and
sleepnight. What does your graph indicate?
Visualize the relationship between gender and
studyweek. What does your graph indicate?
Visualize the relationship between gender and
out. What does your graph indicate?
Present a question of your own interest related to this data set. Answer your question with analysis or visualization.
loans_full_schema data
setFinish the following data visualization tasks using the full
loans_full_schema data set (55 columns) in
openintro library. For each task, you need to summarize
what you learn from the graph accurately and concisely.
Create a histogram of a numeric variable that you select and plot a density curve on top of the histogram. Carefully select bin numbers/sizes/boundaries to make the plot informative. What does this graph indicate?
Create a graph to study the effect of a categorical/discrete variable on the distributions of a numeric variable. What does this graph indicate?
Create a bin heatmap (2d density plot) to study the relationship between two numeric variables that you select. Summarize the findings from the graph.
Use facet_wrap to create an informative plot.
Summarize the findings from the graph
Use facet_grid to create an informative plot.
Summarize the findings from the graph.
Present a question of your own interest related to this data set. Answer your question with analysis or visualization.
ames
data setThe ames data set is available through
openintro package in R.
Write an introductory paragraph to the data set which provides the basic information - what the data set is about; the number of samples and features; the scope that the features cover.
Use a plot to analyze how area correlates with
price. Summarize your finding from the graph.
Use a plot to analyze how Bldg.Type correlates with
price. Explain the meaning of each label for
Bldg.Type and summarize your finding from the
graph.
Use a plot to analyze how Bldg.Type and
area altogether correlates with price.
Summarize your finding from the graph.
(Bonus - 5 Points) You may need to self-study to fulfill this
task: use a plot to study how area and
Year.Built together correlates with price.
Summarize your finding from the graph.
(Bonus - 5 Points) Present a question of your own interest related to this data set. Answer your question with analysis or visualization.