Dmitriy Tsimokha
10/09/18
Scientists: David Bamman (Pennsylvania), Jacob Eisenstein (Georgia) & Tyler Schnoebelen (California)
Study of the relationship between:
Data: a novel corpus of 14,000 Twitter users
Features of this qualitative research (in the comparison with their previous quantitative work):
Gender for sociolinguists were constructed, maintained, and disrupted by linguiscit practices.
Instrumentalist paradigm that emphasizes prediction of latent attributes from text:
Assembled 19,320 English blogs (681,288 posts, 140 million words), and built a predictive model of gender that achieves 80.5 percent accuracy
A post-hoc factor analysis found that:
Trained a classifier on a dataset of posts (‘tweets’) by 1,000 authors
They found that:
Identified author gender by linking 184,000 Twitter accounts to blog profiles with gender metadata
They found that automatic prediction of author gender is more accurate that the judgments of human raters.
Informational word classes were found to be used preferentially by men Involvement and interaction are associated with women
The involvement and informational word classes were associated with these genres, and the genres were in turn associated with gender. But, within each genre, there were no significant gender differences in the frequency of the word classes.
Examined the interaction between gender and the local categories of school-oriented ‘jocks’ and anti-school ‘burnouts’:
the social meaning of linguistic variables depends crucially on the social and linguistic context in which they are deployed
Rather than describing variables like ING/IN as a direct reflection of gender or class, they can be seen as reflecting a field of meanings: educated/uneducated, effortful/easygoing, articulate/inarticulate, pretentious/unpretentious, formal/relaxed, and so on
This view has roots in Butler’s (1990: 179) casting of gender as a stylized repetition of acts, creating a relationship between (at least) an individual, an audience, and a topic
gender and other social categories are performances, and these categories are performed differently in different situations
ways in which the interaction between language and gender are mediated by situational contexts
Each of these studies demonstrates a richness of interactions between language, gender, and situational context.