1.1

1. Statistics is the science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions. In addition, statistics is about providing a measure of confidence in any conclusions.

3. Individual

5. Statistic, Parmeter

7. Parameter (summary of population of governors in US)

9. Statistic (summary of the sample of HS students)

11. Parameter

13. Statistic

15. Qualitative

17. Quantitative

19. Quantitative

21. Qualitative

23. Discrete

25. Continuous

27. Continuous

29. Discrete

39 Population: all US teenagers age 13-17 Sample: 1,028 US teenagers age 13-17

40 Population: all bottles of soda from Oct 15 Sample: 50 bottles of soda from Oct 15

41 Population: all plants of soybeans Sample: 100 plants of soybean

42 Population: all US households Sample: 50,000 US households

1.2

11. Experiment

13. Observational

2.1

China
50 million
350 million
The graph represents the Top 10 Internet Users by frequency and not by percentage of users within the context of that countries population, which could be misleading.

69%
55.2%
Descriptive, because it just summarizes data.

45%, 61%
55+
18-34
The older you are the more likely you are to buy something when it has been made in America

Never: .0261

Rarely: .0678

Sometimes: .1156

Most of the time: .2632

Always: .5272

52.72%
2.61%, 6.78%

d e f

my_data <- c(125, 324, 552, 1257, 2518)

groups <- c("Never", "Rarely", "Sometimes", "Most", "Always")

barplot(my_data, main = "Wearing Seatbelts", names.arg = groups)

barplot(my_data, main = "Wearing Seatbelts", names.arg = groups, col = c("red","blue","green","yellow", "black"))

rel_freq <- my_data / sum(my_data)

barplot(rel_freq, main = "Wearing Seatbelts", names.arg = groups, col = c("red","blue","green","yellow","black"))

pie(my_data, labels = groups, main = "Wearing Seatbelts")

Inferential because it uses a sample to make generalizations.

More then 1 hour: .367

Up to 1 hour: .187

A few time a week: .128

A few times a month: .079

Never: .237

23.7%

c d e

my_data <- c(377, 192, 132, 81, 243)

groups <- c("More 1", "Up to 1", "Few times week", "Few times month", "Never")

barplot(my_data, main = "Use the internet", names.arg = groups)

barplot(my_data, main = "Use the internet", names.arg = groups, col = c("red","blue","green","yellow", "black"))

rel_freq <- my_data / sum(my_data)

barplot(rel_freq, main = "Use the internet", names.arg = groups, col = c("red","blue","green","yellow","black"))

pie(my_data, labels = groups, main = "Use the internet")

They are taking results based on a sample and generalizing it.

2.2

8
2
15
4
15%
bell-shaped distribution

4
9
9%
Bell-shaped

200
10
60-69: 2 70-79: 3 80-89: 13 90-99: 42 100-109: 58 110-119: 40 120-129: 31 130-139: 8 140-149: 2 150-159: 1
100-109
150-159
5.5%
No

200
Skip this problem
0-200
Skewed right
The data is not put into the perspective of the population size or miles traveled, a fair comparison can be made by comparing percentages that are relative to population or miles traveled not simply frequency.

Skewed right, most incomes will be below the median
Bell shaped, most scores will be close to the median with balanced scores on either side.
Skewed right, most households will have a number of occupants below the median.
Skewed left, most alzheimer’s patients ages will be above the median.

Skewed right, most people will consume a number of alcoholic drinks that is below the median.
Uniform, most grades with respective ages will have the same amount of students throughout the district.
Skewed left, most hearing-aid patients ages will be above the median.
Bell shaped, most heights will be close to the median with balanced heights on either side.

HW1S_skeleton

Harrel Blatt

July 4, 2016

1.1

1.2

2.1

2.2