Data Analysis: focusing on the basics.Covering aspects dealing with data and less is MORE in statistics
Research methods: covering the theoretical and philosophical aspects of doing science. Making sense of science and working on writing and reading skills.
Warning: `as.tibble()` was deprecated in tibble 2.0.0.
ℹ Please use `as_tibble()` instead.
ℹ The signature and semantics have changed, see `?as_tibble`.
`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
##types of variables categorical: ordinal: categories that maintain an order / Nominal: that has no ranking order / binary; nominal variables with two categories. Numerical: Discrete; numbered values that can only take certain values/ continuous; numbered values that are measured can be any number within a particular range.
##Inductive VS Deductivism? They are opposite approaches to reasoning that differ in how they start and what they use to reach a conclusion. Inductive: Observation/ pattern/ hypothesis/ theory Deductivism : Theory/ hypothesis/ observation/ confirmation
##types of good and bad questions Bad questions: 1.is there any difference between a and b? 2.is A bigger than B? 3.Can X influence Y?
Good questions: 1.what explains the differences between A and B? 2.What makes A bigger than B? 3. How X can influence Y?
##diamonds
diamonds%>% #utilizes the diamonds dataset group_by(color,clarity)%>% #groups data by the color and clarity variables. mutate(price200=mean(price))%>% #creates new variables (average price by groups) ungroup()%>% #data no longer grouped by color and clarity mutate(random=10+price)%>% #new variable,original price+$10 select(cut,color,clarity,price,price200,random10)%>% #retain only these listed columns. arrange(color)%>% #visualize data ordered by color. group_by(cut)%>% #group data by cut mutate(dis=n_distinct(price) #counts the number of unique price values per cut. rowID=row_number())%>% #numbers each row consecutively for each cut ungroup() #final ungrouping of data.