The stability and salience of variable (ING) (workin’ ~ working) (Labov 2001a:86) have foregrounded (ING)’s role in the study of language variation and change (Fischer 1958, Labov 1966/1982, Trudgill 1974, Campbell-Kibler 2007, Houston 1985), yet researchers emphasize that our knowledge of “…broad cross-variety patterns of [ING] usage” (Tagliamonte 2004:394) remains scant. We analyze constraints on (ING) across grammatical category in Philadelphia English and show that nominal, verbal and quantifier (ING) are conditioned differently. We argue that (ING)’s variant stylistic conditioning supports a conception of (ING) as more than one variable and venture that (ING) may be less stable than is widely believed.
#i have a more concrete idea now of the graph i intend to put here (one graph, showing that style conditions nom verb and quant ing differently); however, i'm having some issues with my coding and had to remove my in-process coding bc the document wouldn't run with it in
(ING)’s nominal/verbal distinction can be traced to the suffix’s evolution during the Middle English period (Tagliamonte 2004). -ing resulted from the combination of at least two grammatical forms: the present participle, which in Old English was -ende, and the verbal noun, which in Old English was formed with the -ung suffix. In Middle English, a sequence of reduction processes gradually replaced -ende with -end, from which point it evolved to -en and finally to -in. Concurrently, spellings of “ing” or “ynge” began to replace the Old English -ung spelling. An explosion of variation in usage of (ING), connected to social evaluation, occurred once the -ing and -in suffices became commonly understood as variants of the same (ING) suffix. Our study probes one of the factors that emerged to condition variation within (ING), style.
Data come from a roughly age/sex-balanced sample of 40 speakers from the Philadelphia Neighborhood Corpus (Labov & Rosenfelder 2011). Every instance of (ING) was coded for pronunciation (apical [ɪn] or velar [ɪŋ]) and grammatical class: nominal (including monomorphemes and nominal gerunds), verbal (participles and progressives) or quantifier (something/nothing). The preceding segments (velar-nasal, coronal-nasal, coronal-obstruent, or other) and following segments (velar, pause, or other) were also accounted for, and were coded based on their place and manner of articulation.
Our grammatical coding of (ING) follows the classificatory model developed by Tamminga (2014:44-49), and the examples that follow are Tamminga’s. Monomorphemic (ING) refers to roots, or items that are stored as whole objects in the lexicon. Examples include morning, pudding, awning, ceiling and during.
Nominal gerunds refer to words that can be split into a nominal or adjectival root and an -ing suffix, but whose meanings derive from both parts of the word. Examples include building (as in an apartment building), housing, clothing and lining.
Participles, like progressives, are words partially composed of a verbal head. Unlike progressives, however, participles can be substituted with nouns. An example of a participle is coding (which can be substituted with the noun “work”: “She does computer coding” is akin to “She does computer work”). An example of a progressive is studying.
Finally, something/nothing quantifiers are notable because they can be articulated in more reduced ways, with glottal stops and syllabic nasals, in addition to being realized with (ING)’s common apical and velar variants.
## Bin_style newgram sex school.cat birthyear
## Careful:572 exclude: 12 f:566 Just HS :378 Min. :1895
## Casual :484 g :252 m:490 Not HS :290 1st Qu.:1929
## m : 97 Some college:344 Median :1944
## p :391 NA's : 44 Mean :1944
## r :178 3rd Qu.:1964
## s :126 Max. :1984
## Pre_Seg Post_Seg index N
## coronal.N :160 other :555 Min. :0.0000 Min. : 1.000
## coronal.obs:424 pause :355 1st Qu.:0.0000 1st Qu.: 1.000
## other :448 velar.C:146 Median :0.5000 Median : 2.000
## velar.N : 24 Mean :0.5294 Mean : 3.591
## 3rd Qu.:1.0000 3rd Qu.: 4.000
## Max. :1.0000 Max. :59.000
We adopt Labov (2001b)’s Style Decision Tree for classifying speech in the sociolinguistic interview into 1 of 8 contextual styles, further grouped into “Careful” or “Casual” for analysis. We expect that more [ɪŋ] than [ɪn] will appear in Careful speech, as [ɪŋ] is the standard variant of (ING) and has indexical meanings including formality, effortfulness, articulateness and education (Campbell-Kibler 2007). Conversely, we expect that the nonstandard variant [ɪn], which has been found to index inarticulateness/unpretentiousness, relaxation, and uneducation (Campbell-Kibler 2007), will appear with a higher frequency than [ɪŋ] in Casual speech.
#had to remove coding because not working - trying to figure it out
## style Bin_style ing.rate
## 1: R Careful 0.4497817
## 2: L Careful 0.2631579
## 3: S Careful 0.3506098
## 4: C Careful 0.4241908
## 5: N Casual 0.2828125
## 6: G Casual 0.3828125
## 7: K Casual 0.4782609
## 8: T Casual 0.4711656
We fit a logistic regression model to predict (ING) variant used from birth year, preceding segment, following segment, speaker gender, style, grammatical class, level of educational attainment and lexical frequency. We tested for birth year/education and speaker gender/style interactions. Every predictor except style yields significant main effects, and a significant birth year/education interaction appears. A logistic regression was run after it was discovered that a generalized linear mixed effects model could not support the data. < CHECK THIS SENT
# Working in dplyr
library(dplyr)
# Working in lme4 (regressions)
library(lme4)
# Accessing data file
ing.lsa3 <- read.csv("ing.lsa3.csv")
# Turning ing.lsa3 into dataframe
ing.lsa3 <- data.frame(ing.lsa3)
# Making model: birthyear * education, sex * style, logfreq
# Model includes all grammatical categories
mod.ing.lsa.all.fin <- glm(code ~ birthyear * school.cat + Pre_Seg + Post_Seg + sex *
Bin_style + logfreq + newgram2, ing.lsa3, family = "binomial")
# Seeing the results of mod.ing.lsa.all.fin
summary(mod.ing.lsa.all.fin)
##
## Call:
## glm(formula = code ~ birthyear * school.cat + Pre_Seg + Post_Seg +
## sex * Bin_style + logfreq + newgram2, family = "binomial",
## data = ing.lsa3)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.4155 -0.8756 -0.5646 1.0027 2.2715
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 26.339383 6.137636 4.291 1.78e-05
## birthyear -0.012062 0.003142 -3.839 0.000124
## school.catNot HS -45.296690 10.745597 -4.215 2.49e-05
## school.catSome college -21.738879 8.393810 -2.590 0.009601
## Pre_Segcoronal.obs -0.846507 0.186001 -4.551 5.34e-06
## Pre_Segother -1.428586 0.175955 -8.119 4.70e-16
## Pre_Segvelar.N -2.482373 0.547082 -4.537 5.69e-06
## Post_Segpause 0.573719 0.091797 6.250 4.11e-10
## Post_Segvelar.C 0.051659 0.167874 0.308 0.758290
## sexm -1.068120 0.104626 -10.209 < 2e-16
## Bin_styleCasual -0.201488 0.105392 -1.912 0.055903
## logfreq -0.145758 0.021402 -6.810 9.73e-12
## newgram2p -0.993734 0.088338 -11.249 < 2e-16
## newgram2s -0.996756 0.169223 -5.890 3.86e-09
## birthyear:school.catNot HS 0.023434 0.005553 4.220 2.44e-05
## birthyear:school.catSome college 0.011628 0.004297 2.706 0.006809
## sexm:Bin_styleCasual 0.157157 0.158067 0.994 0.320106
##
## (Intercept) ***
## birthyear ***
## school.catNot HS ***
## school.catSome college **
## Pre_Segcoronal.obs ***
## Pre_Segother ***
## Pre_Segvelar.N ***
## Post_Segpause ***
## Post_Segvelar.C
## sexm ***
## Bin_styleCasual .
## logfreq ***
## newgram2p ***
## newgram2s ***
## birthyear:school.catNot HS ***
## birthyear:school.catSome college **
## sexm:Bin_styleCasual
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 4852.6 on 3598 degrees of freedom
## Residual deviance: 4073.3 on 3582 degrees of freedom
## (193 observations deleted due to missingness)
## AIC: 4107.3
##
## Number of Fisher Scoring iterations: 4
Following Tagliamonte (2004) and using the full model’s predictors, we fit separate models for each grammatical class. For nominal and verbal (ING), preceding segment, following segment, speaker gender, grammar and lexical frequency are significant, but birth year, education, style and both interactions were significant only for verbal (ING).
# Working in lme4 (regression)
library(lme4)
# Accessing data file
ing.lsa.nom <- read.csv("ing.lsa.nom.csv")
# Creating first model - NOMINAL
mod.ing.lsa.nom <- glm(code ~ birthyear + Pre_Seg + Post_Seg + sex + Bin_style +
school.cat, ing.lsa.nom, family = "binomial")
# Seeing results of nominal model
summary(mod.ing.lsa.nom)
##
## Call:
## glm(formula = code ~ birthyear + Pre_Seg + Post_Seg + sex + Bin_style +
## school.cat, family = "binomial", data = ing.lsa.nom)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.7251 -1.0595 0.4884 0.9360 1.7418
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 11.622221 6.965703 1.668 0.09522 .
## birthyear -0.004882 0.003558 -1.372 0.17010
## Pre_Segcoronal.obs -0.925760 0.292942 -3.160 0.00158 **
## Pre_Segother -2.168827 0.271319 -7.994 1.31e-15 ***
## Pre_Segvelar.N -2.330654 0.968428 -2.407 0.01610 *
## Post_Segpause 1.118219 0.182534 6.126 9.01e-10 ***
## Post_Segvelar.C 0.161780 0.285289 0.567 0.57066
## sexm -0.756193 0.147513 -5.126 2.96e-07 ***
## Bin_styleCasual -0.076404 0.149253 -0.512 0.60872
## school.catNot HS -0.379256 0.193057 -1.964 0.04947 *
## school.catSome college 0.540211 0.174830 3.090 0.00200 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 1357.7 on 1014 degrees of freedom
## Residual deviance: 1150.6 on 1004 degrees of freedom
## (72 observations deleted due to missingness)
## AIC: 1172.6
##
## Number of Fisher Scoring iterations: 4
# Working in lme4 (regression)
library(lme4)
# Accessing data file
ing.lsa.verb <- read.csv("ing.lsa.verb.csv")
# Creating second model - VERBAL
mod.ing.lsa.verb <- glm(code ~ birthyear + Pre_Seg + Post_Seg + sex + Bin_style +
school.cat, ing.lsa.verb, family = "binomial")
# Seeing results of verbal model
summary(mod.ing.lsa.verb)
##
## Call:
## glm(formula = code ~ birthyear + Pre_Seg + Post_Seg + sex + Bin_style +
## school.cat, family = "binomial", data = ing.lsa.verb)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -1.7620 -0.8609 -0.5940 1.0920 2.2093
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -0.5338711 4.9889810 -0.107 0.91478
## birthyear 0.0002321 0.0025501 0.091 0.92749
## Pre_Segcoronal.obs -0.2214000 0.2694499 -0.822 0.41126
## Pre_Segother -0.7193356 0.2584618 -2.783 0.00538 **
## Pre_Segvelar.N -1.8218400 0.6899855 -2.640 0.00828 **
## Post_Segpause 0.6193800 0.1195548 5.181 2.21e-07 ***
## Post_Segvelar.C -0.1773273 0.2332191 -0.760 0.44705
## sexm -1.1218894 0.1028091 -10.912 < 2e-16 ***
## Bin_styleCasual -0.2453234 0.1002576 -2.447 0.01441 *
## school.catNot HS 0.2785092 0.1379976 2.018 0.04357 *
## school.catSome college 0.9969303 0.1191992 8.364 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 2725.1 on 2164 degrees of freedom
## Residual deviance: 2448.4 on 2154 degrees of freedom
## (97 observations deleted due to missingness)
## AIC: 2470.4
##
## Number of Fisher Scoring iterations: 4
# Working in lme4 (regression)
library(lme4)
# Accessing data file
ing.lsa.quant <- read.csv("ing.lsa.quant.csv")
# Creating third model - QUANTIFIER
mod.ing.lsa.quant <- glm(code ~ birthyear + Post_Seg + sex + Bin_style +
school.cat, ing.lsa.quant, family = "binomial")
# Seeing results of quantifier model
summary(mod.ing.lsa.quant)
##
## Call:
## glm(formula = code ~ birthyear + Post_Seg + sex + Bin_style +
## school.cat, family = "binomial", data = ing.lsa.quant)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -1.8524 -0.8132 -0.5362 0.9875 2.1383
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 26.370104 12.355885 2.134 0.0328 *
## birthyear -0.013918 0.006323 -2.201 0.0277 *
## Post_Segpause -0.371417 0.267237 -1.390 0.1646
## Post_Segvelar.C 1.089798 0.527798 2.065 0.0389 *
## sexm -1.144259 0.237424 -4.819 1.44e-06 ***
## Bin_styleCasual -0.046640 0.236920 -0.197 0.8439
## school.catNot HS 0.274444 0.320272 0.857 0.3915
## school.catSome college 1.670657 0.286623 5.829 5.58e-09 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 530.56 on 423 degrees of freedom
## Residual deviance: 454.64 on 416 degrees of freedom
## (8 observations deleted due to missingness)
## AIC: 470.64
##
## Number of Fisher Scoring iterations: 4
The speaker gender/style interaction revealed that women style-shift more dramatically than men do.
# Working in dplyr
library(dplyr)
# Working in ggplot (graphing)
library(ggplot2)
# Accessing data file
ing.lsa2 <- read.csv("ing.lsa2.csv")
# Reordering Casual and Careful style values along x-axis
ing.lsa2$style <- factor(ing.lsa2$style, c("R","L","S","C","N","G","K","T","R","L","S","C","N","G","K","T"))
# Creating graph
ggplot(ing.lsa2, aes(style,code)) +
geom_bar(stat = "identity", aes(fill=sex)) +
facet_wrap(~ Bin_style) +
facet_grid(sex ~ Bin_style) +
ggtitle("Stylistic Differentiation of (ING) by sex
for eight categories of the Style Decision Tree")
For quantifier (ING), birth year, education, speaker gender and the birth year/education interaction are significant. This suggests external factors condition all three grammatical forms differently and that internal factors condition quantifier (ING) differently than nominal and verbal (ING), which supports nominal and verbal (ING)’s divergent “social profiles” (Tagliamonte 2004:399; Tamminga 2014) and quantifier (ING)’s observed (Labov 2001a:88) contrastive behavior.
While suprising, these findings resemble the grammatically differentiated constraint ranking for (ING) observed by Tagliamonte in a 2004 study of York English. Of the internal and external factors that Tagliamonte evaluated, grammatical category most significantly influenced patterns of (ING) usage (Tagliamonte 2004:398). Subsequent analysis of constraint ranking for nominal compared to verbal (ING) revealed that “NOUNS and VERBS have entirely separate and unique linguistic and social profiles” (Tagliamonte 2004:399). Although differences exist in the factors that Tagliamonte and I found to be significant, in both studies, of two distinct varieties of English, grammatical category primarily conditioned variation within (ING). The significant role of grammar in mediating variation within (ING) is not new (see Houston 1985; Marsh 1866), but as Tagliamonte emphasizes sufficient attention has not be dedicated to examining this trend.
It is striking that stylistic constraints primarily distinguish each grammatical category; these categories have not consistently been differentiated in literature on (ING)’s indexical meaning (Eckert 2012, Podesva 2007, Kiesling 2008). This evidence supports the claim that (ING) is more than one variable (Tagliamonte 2004:400, Tamminga 2014:5). Verbal and quantifier (ING) demonstrated birth year/education interactions, shown by decreasing /ing/ rates for speakers who only attended high school and reflecting college education’s changing prevalence, as finishing high school is no longer a definitive social achievement. Birth year’s significant main effect, in addition to the interaction (as in Horvath 1985 and Labov 1972), counters (ING)’s corroborated stability (summarized Labov 2001a:86), which suggests a change in progress.
# Working in dplyr
library(dplyr)
# Working in ggplot (graphing)
library(ggplot2)
# Accessing data file
ing.lsa2 <- read.csv("ing.lsa2.csv")
# Turning ing.lsa2 into dataframe
ing.lsa2 <- data.frame(ing.lsa2)
# Reordering educational attainment values
ing.lsa2$school.cat <- factor(ing.lsa2$school.cat, c("Not HS","Just HS","Some college"))
# Creating graph
ggplot(ing.lsa2, aes(birthyear, code, color = school.cat)) +
geom_point(aes(color=school.cat)) +
stat_smooth(aes(color=school.cat, fill=school.cat), method = "lm") +
labs(x = "Birth year", y = "/ing/ usage (%)",
color = "Educational attainment", fill="Educational attainment") +
ggtitle("/ing/ rate by education over time")
That the rate of (ING) use is changing over time with educational achievement is striking, and reinforces how deeply patterns of language usage are entrenched in greater social phenomena. This is interesting given (ING)’s purported stability (summarized Labov 2001a:86), but more significantly because it implies further investigations of (ING) can be used to probe socially constructed realities that are evolving below the level of consciousness. Eckert (2012:94) identifies “indexical mutability” as the core property of linguistic variables in the third wave study of variation. That (ING)’s usage was shown to change over time suggests it is not immune to the processes of bricolage by which speakers reconstruct variables to reflect their community’s changing social concerns.
Grammatically differentiated conditioning within (ING) draws attention to what the study of (ING) has yet to teach us, which emphasizes the variable’s continued relevance in third wave studies of variation. It appears subsequent studies of (ING) have the potential to deepen our understanding of indexicality and processes of speaker agency such as stance (Kiesling 2008). At a moment when understanding “the dynamics of variation in individuals” (Tamminga, MacKenzie & Embick 2015:2) is of emergent scientific interest, (ING) presents a unique opportunity to investigate variation at the individual level, in many individuals simultaneously. Analysis of larger, more variegated corpora, from diverse contexts, may be necessary to further understand (ING)’s socio-indexical meanings.
Campbell-Kibler, K. (2007). Accent, (ING), and the social logic of listener perceptions. American Speech, 82, 1, pp. 32-64. ● Eckert, P. (2012). Three waves of variation study: the emergence of meaning in the study of sociolinguistic variation. In Annual Review of Anthropology, 41, 87-100. ● Fischer, J. (1958). Social influences on the choice of a linguistic variant. In Word, 14, 1, pp. 47-56. ● Horvath, B.M. (1985). Variation in Australian English. Cambridge: Cambridge University Press. ● Houston, A. (1985). Continuity and change in English morphology: the variable (ING). PhD thesis, University of Pennsylvania. ● Kiesling, S.F. (2008). Style as stance: Stance as the explanation for patterns of sociolinguistic variation. In Alexandra Jaffe (ed), Sociolinguistic Perspectives on Stance. Oxford University Press. pp. 171-194. ● Labov, W. (1966/1982). The Social Stratification of English in New York City. Washington, D.C.: Center for Applied Linguistics. ● Labov, W. (1972). Sociolinguistic patterns. Philadelphia: University of Pennsylvania Press. ● Labov, W. (2001a). Principles of Linguistic Change: Volume 2, Social Factors. Blackwell Publishers. ● Labov, W. (2001b). The anatomy of style-shifting. In Eckert & Rickford (Eds.), Style and Sociolinguistic Variation. Cambridge University Press. ● Labov, W. & I. Rosenfelder. (2011). The Philadelphia Neighborhood Corpus. ● Podesva, R. (2007). Three sources of stylistic meaning. Texas Linguistic Forum 51. ● Tagliamonte, S.A. (2004). Someth[in]’s go[ing] on!: Variable ing at ground zero. In Gunnarsson et al. (eds.), Language Variation in Europe: papers from the Second International Conference on Language Variation in Europe, ICLA VE 2 Uppsala, Sweden, June 12-14, 2003. ● Tamminga, M. (2014). Persistence in the Production of Linguistic Variation. Ph.D. Dissertation, University of Pennsylvania. ● Tamminga, M., MacKenzie, L. & D. Embick. (2015). The dynamics of variation in individuals. Paper under review at Language Variation. ● Trudgill, P. (1974). The social differentiation of English in Norwich. Cambridge: University of Cambridge Press.