1. Research Background

Personality psychology has long examined the ways that stable behavioral tendencies influence everyday functioning, achievement, and interpersonal behavior (John, Naumann, & Soto, 2008). One of the most widely accepted frameworks in personality psychology is the Big Five personality model, which categorizes personality into five broad dimensions: extraversion, neuroticism, agreeableness, conscientiousness, and openness to experience (Goldberg, 1993). Among these traits, conscientiousness has consistently emerged as one of the strongest predictors of performance-related outcomes across educational, occupational, and behavioral contexts (Roberts et al., 2009).

Conscientiousness is generally associated with organization, persistence, self-discipline, responsibility, planning, and attention to detail. Individuals who score highly in conscientiousness are often described as dependable, efficient, and goal-oriented. Prior research has linked conscientiousness to academic achievement, workplace productivity, healthier lifestyle behaviors, and long-term goal attainment (Roberts et al., 2009). Because conscientious individuals are more likely to regulate their behavior effectively and remain focused on tasks, conscientiousness is frequently associated with greater productivity and stronger performance outcomes (Costa & McCrae, 1992).

The relationship between conscientiousness and productivity is particularly important because productivity-related behaviors influence success in many areas of life, including education, employment, and personal goal achievement. Productive individuals often demonstrate time management skills, persistence, organization, and consistency when completing tasks. Many of these behaviors overlap conceptually with characteristics commonly associated with conscientiousness.

The present study examined whether conscientiousness predicts productivity-related behavior using a large public dataset from the Open Psychometrics Big Five personality inventory (Open Psychometrics, n.d.). The use of a large archival dataset provides an opportunity to explore personality-behavior relationships using a substantial and diverse sample of participants. In addition, quantitative analysis using regression modeling allows for the examination of the strength and direction of the relationship between conscientiousness and productivity-related behavior.

The current study focuses specifically on the following research question:

To what extent does conscientiousness predict productivity-related behavior?

Based on prior personality research, it was hypothesized that individuals with higher levels of conscientiousness would report higher levels of productivity-related behavior.


2. Load Data

library(readr)

data <- read_table("PSY 290 Final/data.csv")

3. Data Preparation

Reverse Coding (1–5 scale)

data$C2 <- 6 - data$C2
data$C4 <- 6 - data$C4
data$C6 <- 6 - data$C6
data$C8 <- 6 - data$C8

Create Variables

data$conscientiousness <- rowMeans(data[, c(
  "C1","C2","C3","C4","C5",
  "C6","C7","C8","C9","C10"
)], na.rm = TRUE)

data$productivity <- rowMeans(data[, c(
  "C3","C5","C7","C9"
)], na.rm = TRUE)

4. Methods

Dataset and Participants

This study used archival data from the Open Psychometrics Big Five personality dataset, a publicly available dataset containing responses from individuals who completed an online Big Five personality inventory. The dataset included responses from 19,719 participants across multiple countries and demographic backgrounds. Variables included demographic information, personality inventory responses, and measures related to the five major personality dimensions.

Because the dataset is publicly available and fully anonymized, no direct participant recruitment or informed consent procedures were required for the current project. The use of archival data also allowed for the analysis of a large sample size that would otherwise be difficult to collect within the scope of a class project.

Measures

Conscientiousness

Conscientiousness was operationalized using ten items from the conscientiousness dimension of the Big Five inventory (C1–C10). These items measured behaviors and attitudes associated with organization, responsibility, planning, persistence, and self-discipline.

Several items within the conscientiousness scale were negatively worded. To ensure that higher scores consistently reflected higher levels of conscientiousness, reverse coding was conducted for items C2, C4, C6, and C8. Reverse coding was performed using the formula:

new score = 6 − original score

Following reverse coding, a composite conscientiousness score was created by calculating the mean score across all ten conscientiousness items.

Procedure

Data were imported into R using the readr package. Following data loading, reverse coding procedures were conducted for negatively worded conscientiousness items. Composite scores for conscientiousness and productivity-related behavior were then calculated using row means.

Descriptive statistics were computed to examine the distribution of the variables within the sample. A scatterplot was generated to visualize the relationship between conscientiousness and productivity-related behavior.

Finally, a linear regression analysis was conducted to determine whether conscientiousness significantly predicted productivity-related behavior. Linear regression is commonly used in psychological research to evaluate predictive relationships between continuous variables. Because both conscientiousness and productivity-related behavior were measured continuously, regression analysis was appropriate for examining the direction and strength of the relationship.

Statistical Analysis

The primary statistical analysis used in this study was simple linear regression. Linear regression was selected because the study aimed to examine whether one continuous predictor variable (conscientiousness) significantly predicted another continuous outcome variable (productivity-related behavior).

The regression model estimated the direction and strength of the relationship between conscientiousness and productivity-related behavior. Statistical significance was evaluated using p-values, while the amount of explained variance was assessed using the statistic.


5. Descriptive Data Analysis

Descriptive statistics were calculated to examine the distribution of conscientiousness and productivity-related behavior within the sample.

Participants generally reported moderate levels of conscientiousness and productivity-related behavior. The mean conscientiousness score was 3.35 on a five-point scale, while the mean productivity-related behavior score was 3.39. These values suggest that participants tended to perceive themselves as relatively organized, disciplined, and task-oriented.

The conscientiousness variable ranged from 1.00 to 5.00, indicating substantial variability in conscientiousness levels across participants. Similarly, productivity-related behavior ranged from 0.00 to 5.00, suggesting that some participants reported very low levels of productivity-related behavior while others reported very high levels.

The scatterplot demonstrated a strong positive relationship between conscientiousness and productivity-related behavior. As conscientiousness scores increased, productivity scores also tended to increase. The overall pattern of the data appeared relatively linear, supporting the use of linear regression analysis.

The visualization also suggested a relatively consistent relationship across the range of conscientiousness scores. Although some variability was present, participants with higher conscientiousness scores generally clustered around higher productivity scores.

These descriptive findings provided preliminary support for the hypothesis that conscientiousness would positively predict productivity-related behavior.

summary(data$conscientiousness)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   2.800   3.400   3.347   3.900   5.000
summary(data$productivity)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   0.000   2.750   3.500   3.389   4.000   5.000

6. Visualization

library(ggplot2)

ggplot(data, aes(x = conscientiousness, y = productivity)) +
  geom_point(alpha = 0.2) +
  geom_smooth(method = "lm", se = TRUE) +
  labs(
    title = "Relationship Between Conscientiousness and Productivity",
    x = "Conscientiousness",
    y = "Productivity-Related Behavior"
  ) +
  theme_minimal()
## `geom_smooth()` using formula = 'y ~ x'

Figure 1 displays the relationship between conscientiousness and productivity-related behavior. The regression line demonstrates a clear positive association between the variables, indicating that higher conscientiousness scores are generally associated with higher productivity-related behavior scores.


7. Statistical Analysis

model <- lm(productivity ~ conscientiousness, data = data)
summary(model)
## 
## Call:
## lm(formula = productivity ~ conscientiousness, data = data)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -2.48279 -0.25466  0.00222  0.25222  1.70846 
## 
## Coefficients:
##                   Estimate Std. Error t value Pr(>|t|)    
## (Intercept)       0.187807   0.013474   13.94   <2e-16 ***
## conscientiousness 0.956241   0.003933  243.14   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.4035 on 19717 degrees of freedom
## Multiple R-squared:  0.7499, Adjusted R-squared:  0.7499 
## F-statistic: 5.912e+04 on 1 and 19717 DF,  p-value: < 2.2e-16

8. Results and Interpretation

A linear regression analysis was conducted to examine whether conscientiousness significantly predicted productivity-related behavior.

Results indicated that conscientiousness was a statistically significant positive predictor of productivity-related behavior, b = 0.96, SE = 0.004, t(19717) = 243.14, p < .001. The positive regression coefficient indicates that individuals with higher conscientiousness scores also tended to report higher levels of productivity-related behavior.

Specifically, the results suggest that for every one-unit increase in conscientiousness, productivity-related behavior increased by approximately 0.96 units. This finding supports the original hypothesis that conscientiousness would positively predict productivity-related behavior.

The model explained a substantial proportion of variance in productivity-related behavior, = .75. This indicates that approximately 75% of the variability in productivity-related behavior was explained by conscientiousness.

The extremely high R² value suggests a very strong relationship between the variables. However, this finding should be interpreted cautiously because the productivity measure was derived from items within the conscientiousness scale itself. As a result, conceptual overlap between the independent and dependent variables may have inflated the strength of the observed relationship.

Despite this limitation, the findings remain consistent with broader personality research suggesting that conscientiousness is strongly associated with performance-related outcomes (Roberts et al., 2009). Highly conscientious individuals are often characterized by self-discipline, organization, persistence, and goal-directed behavior, all of which may contribute to greater productivity.

These findings are also consistent with prior research linking conscientiousness to academic achievement, workplace performance, and behavioral self-regulation (Costa & McCrae, 1992). Individuals with higher conscientiousness scores may be more likely to establish goals, maintain schedules, complete tasks efficiently, and persist through challenges.

The large sample size used in the present study strengthens confidence in the stability of the findings. Because the dataset included nearly 20,000 participants, the results are less likely to reflect random sampling variability.

Overall, the results support the conclusion that conscientiousness is strongly associated with productivity-related behavior.


9. Conclusion

The present study examined whether conscientiousness predicts productivity-related behavior using a large public personality dataset. Results from the regression analysis demonstrated a strong and statistically significant positive relationship between conscientiousness and productivity-related behavior.

Individuals who reported higher levels of conscientiousness also tended to report higher levels of organization, planning, attention to detail, and task completion. These findings support the hypothesis that conscientiousness is strongly associated with productive and goal-directed behavior.

The study also demonstrates the usefulness of quantitative statistical methods for examining psychological constructs using large-scale archival datasets. Through the use of descriptive statistics, data visualization, and regression analysis, the present project illustrates how personality variables can be analyzed and interpreted using R and R Markdown.

Overall, the findings contribute to the broader understanding of how personality traits are associated with behavioral and performance-related outcomes.


10. Limitations

One limitation is that productivity was operationalized using a subset of conscientiousness items, which may introduce conceptual overlap. Additionally, because productivity was derived from items within the conscientiousness scale, the strength of the relationship may be inflated. Additionally, the cross-sectional nature of the dataset limits the ability to make causal conclusions. Although conscientiousness was strongly associated with productivity-related behavior, the present analysis cannot determine whether conscientiousness directly causes increased productivity.

All measures are self-reported and may be subject to response bias.


11. Future Research

Future research could use independent behavioral measures of productivity to better isolate the relationship between personality and performance. Future research may also examine whether demographic variables such as age, education, or cultural background moderate the relationship between conscientiousness and productivity-related behavior. Longitudinal studies could further examine how conscientiousness influences productivity over time.


References

Costa, P. T., & McCrae, R. R. (1992). Revised NEO personality inventory (NEO PI-R) and NEO five-factor inventory (NEO-FFI) professional manual. Psychological Assessment Resources.

Goldberg, L. R. (1993). The structure of phenotypic personality traits. American Psychologist, 48(1), 26–34.

John, O. P., Naumann, L. P., & Soto, C. J. (2008). Paradigm shift to the integrative Big Five trait taxonomy: History, measurement, and conceptual issues. In O. P. John, R. W. Robins, & L. A. Pervin (Eds.), Handbook of personality: Theory and research (3rd ed., pp. 114–158). Guilford Press.

Open Psychometrics. (n.d.). Raw data from online personality tests. https://openpsychometrics.org/_rawdata/

Roberts, B. W., Jackson, J. J., Fayard, J. V., Edmonds, G., & Meints, J. (2009). Conscientiousness. In M. R. Leary & R. H. Hoyle (Eds.), Handbook of individual differences in social behavior (pp. 369–381). Guilford Press.