Homework: September 16, 2015

Reading 2.1; #7-15 odds

China
50 million users
350 million users
It would have been better to use relative frequency because China’s population is much larger compared to the other countries.

69%
55.2 million
Inferential, because this data is based on a sample.

0.42; 0.61
55+
18-34
As age increases, so does likelihood to buy American.

Relative frequency distribution:

Never: 0.0262

Rarely: 0.0678

Sometimes: 0.1156

Most of the time: 0.2632

52.7%
9.4%

my_data <- c (125, 324, 552, 1257, 2518)

groups <- c ("never", "rarely", "sometimes", "most of the time", "always")

barplot(my_data, main = "How Often College Students Wear Seat Belts", names.arg = groups)

my_data <- c (125, 324, 552, 1257, 2518)

groups <- c ("never", "rarely", "sometimes", "most of the time", "always")

rel_freq <- my_data / sum(my_data)

barplot(rel_freq, main = "How Often College Students Wear Seat Belts", names.arg = groups)

my_data <- c (125, 324, 552, 1257, 2518)

groups <- c ("never", "rarely", "sometimes", "most of the time", "always")

pie(my_data, labels = groups, main = "How Often College Students Wear Seat Belts")

This is an inferential statement because it is taking in data from a sample.

Relative frequency distribution:

More than 1 hour a day: 0.3678

Up to 1 hour a day: 0.1873

A few times a week 0.1288

A few times a month or less 0.0790

Never 0.2371

0.2371

my_data <- c (377, 192, 132, 81, 243)

groups <- c ("more than 1 hour a day", "up to 1 hour a day", "a few times a week", "a few times a month or less", "never")

barplot(my_data, main = "Time Spent on the Internet", names.arg = groups)

my_data <- c (377, 192, 132, 81, 243)

groups <- c ("more than 1 hour a day", "up to 1 hour a day", "a few times a week", "a few times a month or less", "never")

rel_freq <- my_data / sum(my_data)

barplot(rel_freq, main = "Time Spent on the Internet", names.arg = groups)

my_data <- c (377, 192, 132, 81, 243)

groups <- c ("more than 1 hour a day", "up to 1 hour a day", "a few times a week", "a few times a month or less", "never")

pie(my_data, labels = groups, main = "Time Spent on the Internet")

No level of confidence is provided along with the estimate.

Reading 2.2; #9-14

8
2
15
4
15%
Bell shaped

4
9
17%
Bell shaped

200
10
60-69, 2; 70-79, 3; 80-89, 13; 90-99, 42; 100-109, 58; 110-119, 40; 120-129, 31; 130-139, 8; 140-149, 2; 150-159, 1.
100-109
150-159
5.5%
No

200
0-199, 200-399, 400-600, 1000-1200, 1400-1600
0-199
Skewed right
It does not take into account that Texas’ population is much larger than Vermont’s population.

Skewed right. Most household incomes will be to the left with fewer higher incomes to the right.
Bell-shaped. Most scores will occur near the middle range, with scores tapering off equally in both directions.
Skewed right. Most households will have, say, 1 to 4 occupants, with fewer households having a higher number of occupants.
Skewed left. Most Alzheimer’s patients will fall in older-aged categories, with fewer patients being younger.

Skewed right. There are only so many drinks a person can consume.
Uniform. There should be about the same number of students in each grade and thus age groups.
Skewed left. Older people are far more likely to have hearing aids.
Bell shaped. Most heights will fall somewhere in the middle with some extreme lows (very short men) and highs (very tall men).

Additional problem:

hist(iris$Sepal.Length)

This histogram is bell-shaped and is not skewed, but not uniformed either.

Homework: September 16, 2015

Christy Tan

September 15, 2015