Part 1: Variables, Hypothesis, Designs

Title: Offshore outsourcing: Its advantages, disadvantages, and effect on the American economy

Abstract: The United States has trained some of the world’s best computer programmers and technology experts. Despite all of this training, many businesses do not have a full understanding of information technology. As the importance of technology in the business world grows, many companies are wasting money on extensive technology projects. When problems arise, they expect that further investment will solve these issues. To prevent such problems, many companies have begun to outsource these functions in an effort to reduce costs and improve performance. The majority of these outsourced information technology and call center jobs are going to low-wage countries, such as India and China where English-speaking college graduates are being hired at substantially lower wages. The purpose of this study is to evaluate the positive and negative aspects of offshore outsourcing with a focus on the outsourcing markets in India and China, arguably the two most popular destinations for outsourcers. The cost savings associated with offshore outsourcing will be evaluated in relation to the security risks and other weakness of offshore outsourcing. In addition, an analysis of the number of jobs sent overseas versus the number of jobs created in the United States will be used to assess the effects that outsourcing is having on the American economy and job market. Finally, the value of jobs lost from the American economy will be compared to the value of jobs created. The goal of these analyses is to create a clear picture of this increasingly popular business strategy.

Answer the following questions about the abstract above:

  1. What is a potential hypothesis of the researchers? The impact of jobs lost to the American economy is less than the impact of jobs created.

  2. What is one of the independent variables? The number of jobs lost

    1. What type of variable is the independent variable? Numerical
  3. What is one of the dependent variables? The value created by outsourcing

    1. What type of variable is the dependent variable? Numerical
  4. What might cause some measurement error in this experiment? Random error with the collected samples that cannot represent the reality or the systematic error due to the inaccurate relations between independent variables and dependent variables.

  5. What type of research design is the experiment? correlational

    1. Why? Because we are comparing different group of data with no controlled condition.
  6. How might you measure the reliability of your dependent variable? Repeat the measurement

  7. Is this study ecologically valid? Yes

  8. Can this study claim cause and effect?

    1. Why/why not? Yes, because the cost saved (effect) is caused by outsourcing the jobs oversea (cause).
  9. What type of data collection did the researchers use (please note that #5 is a different question)? Observations

Part 2: Use the assessment scores dataset (03_lab.csv) to answer these questions.

The provided dataset includes the following information created to match the abstract:

Calculate the following information:

  1. Create a frequency table of the percent of outsourced jobs.
Data = read.csv("03_data.csv", header = TRUE)
Outsourced_jobs=table(Data$jobs)
hist(Data$jobs)

  1. Create histograms of the two types of cost savings. You will want to add the breaks argument to the hist() function. This argument adds more bars to the histogram, which makes it easier to answer the following questions:

hist(dataset$column, breaks = 15)

15 is a great number to pick, but it can be any number. For this assignment, try 15 to see a medium number of bars.

hist(Data$cost, breaks = 15)

hist(Data$cost2, breaks = 15)

  1. Examine these histograms to answer the following questions:

    1. Which cost savings appears the most normal? Cost2

    2. Which cost savings data is multimodal? Cost

    3. Which cost savings data looks the most skewed (and in which direction positive or negative)? Cost1. Left-skewed (negative skewed)

    4. Which cost savings data looks the most kurtotic? Cost2

  2. Calculate the z-scores for each cost savings, so they are all on the same scale.

Data$cost_z=scale(Data$cost)
Data$cost2_z=scale(Data$cost2)
  1. How many of the cost saving scores were more extreme than 95% of the data (i.e., number of z-scores at a p < .05)?
sum(abs(Data$cost_z)>1.96)
## [1] 10
sum(abs(Data$cost2_z)>1.96)
## [1] 13
a.  Cost Savings 1: 10

c.  Cost Savings 2: 13
  1. Which business had:

    1. the highest cost savings?

    2. the the lowest cost savings?

    3. Use both cost savings columns and find the ID number of the business with the lowest and highest z-score.

max(Data$cost_z)
## [1] 2.573922
max(Data$cost2_z)
## [1] 3.085539
Data$id[Data$cost_z==max(Data$cost_z)]
## [1] S100
## 200 Levels: S001 S002 S003 S004 S005 S006 S007 S008 S009 S010 S011 ... S200
Data$id[Data$cost2_z==max(Data$cost2_z)]
## [1] S097
## 200 Levels: S001 S002 S003 S004 S005 S006 S007 S008 S009 S010 S011 ... S200
min(Data$cost_z)
## [1] -2.647601
min(Data$cost2_z)
## [1] -2.92971
Data$id[Data$cost_z==min(Data$cost_z)]
## [1] S190
## 200 Levels: S001 S002 S003 S004 S005 S006 S007 S008 S009 S010 S011 ... S200
Data$id[Data$cost2_z==min(Data$cost2_z)]
## [1] S092
## 200 Levels: S001 S002 S003 S004 S005 S006 S007 S008 S009 S010 S011 ... S200