Introduction

My investigation is inspired by Beekhuizen, B., Watson, J., & Stevenson, S. (2017). Semantic typology and parallel corpora: Something about indefinite pronouns. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 39). While they studied the semantic functions of indefinite pronouns, I chose to study the semantic functions of prepositions, specifically the Spanish prepositions ‘por’ and ‘para’ and their English equivalents.

Methods

Using the OpenSubtitles database, I collected instances of ‘por’ and ‘para’, as well as the corresponding subtitles in English, and coded them by function. For example:
> Spanish subtitle: Las toallas sirven para secarse.
> English subtitle: The towels are for drying.

Here you can see that the Spanish uses ‘para’ to serve the same function as English uses ‘for’. They both identify the PURPOSE of the towels.

Semantic functions identified:

Function Description
Purpose Function or purpose of an object, e.g. 'Las toallas sirven para secarse'
Recipient Recipient of a gift, e.g. 'Para quien es este pastel?'
Destination Final destination of a journey, e.g. 'Vamos para casa'
Time destination Destination in the sense of a time period, the intention to use a thing at a certain time, e.g. 'La publicacion es para otoño'
Perception Expressing that a perception is specific to a person, e.g. 'Esta vida es sagrada para mi'
Infinitive Bears the infinitive form of the verb (in English, the 'to' that goes before the bare verb), e.g. 'Fue demasiado debil para llevar el acero'
Indeterminate space Around, near, or through a space, space in the sense of 'by means of' instead of 'destination', e.g. 'Me he dado una vuelta por el parque'
Indeterminate time A time period, such as the morning, e.g. 'Mañana por la mañana'
Cause Cause or reason for an action, e.g. 'Gracias por el regalo'
Distribution In English, the sense of 'per' in 'one globe per child', e.g. 'he repartido dos globos por niño'
Interchange Change of one thing for another in a barter or payment, e.g. 'Lo compramos solo por 20 euros'
Media The media by which people communicate, e.g. 'hemos hablado por telefono'
Attribution/agency Assigning an action to an entity, e.g. 'fue iniciado por el gobierno'
Feelings for In English, the 'for' in 'my feelings for her', e.g. 'Deberia comprender mi amor por ella'
Topic The topic about which the verb is, e.g. 'despreocupada por los celos'
Adverbializer Turns an adjective into an adverb, e.g. 'Jenny se calma por completa'
Address Addressing the verb to a person, e.g. 'pregunte por Mr. Barnes'
Identity In English, the 'for' in 'take her for your wife,' e.g. 'darle por esposa'
Stock phrase Part of a larger phrase, unlikely to be parsed as a separate function, e.g. 'por favor' or 'por Dios'
labels <- c("Group 1", "Group 2", "Group 3")
get_code_name <- function(var1) {
  x = switch(var1,
             "PU"= "Purpose",
             "RE"= "Recipient",
             "DE"= "Destination",
             "TD"= "Time destination",
             "PE"= "Perception",
             "INF"= "Infinitive",
             "IS" = "Indeterminate space",
             "IT" = "Indeterminate time",
             "CA" = "Cause",
             "DI" = "Distribution",
             "IN" = "Interchange",
             "ME" = "Media", 
             "AT" = "Attribution/Agency",
             "FE" = "Feelings for",
             "TO" = "Topic",
             "ADV" = "Adverbializer",
             "AD" = "Address",
             "ID" = "Identity",
             "POR FAVOR" = "Stock phrase",
             "FOR DIOS" = "Stock phrase")
  return(x)  
}
t <- t %>%
  mutate("semantic_func" = sapply(code, get_code_name))
                   

Results

Here is the distribution of semantic functions for the two prepositions. Each is distributed through a variety of functions. Some appear in both plots, some do not.


t %>% ggplot(aes(x=code)) + geom_bar() + facet_grid(~ spanish_prep_found, scales="free") +
  labs(x="Semantic Function") + guides(x=guide_axis(angle=45)) + scale_x_discrete(label = t$semantic_func)

For each of the English words or phrases coded (where n>1), here is the distribution as to when they’re realized with ‘por’ and when with ‘para’. A significant number of English prepositions can be realized as either, depending on the semantic function. This is a challenge for L1 English L2 learners of Spanish.

t %>% group_by(translated_to) %>% filter(n() > 1) %>%
  ggplot(aes(y=translated_to, fill=spanish_prep_found, label=spanish_prep_found)) +
  geom_bar(position="fill") + 
  labs(x="Percent Realization", y="English Translation") +
  scale_fill_discrete(name="Spanish Preposition") 

Here is a scatterplot of all the data together. Semantic functions are much closer to a split between ‘por’ and ‘para’ (most columns have only pink or blue) than they are among English prepositions (most rows have both pink and blue), despite there being many more of them in this dataset.

t %>% group_by(translated_to) %>% filter(n() > 1) %>%
  ggplot(aes(code,translated_to, color=spanish_prep_found)) + geom_count() +
  labs(x="Semantic Function", y="English Translation", color="Spanish Preposition") +
  scale_fill_discrete(name=c("Spanish Preposition", "n")) + 
  guides(x=guide_axis(angle=45), size = guide_legend(title='Number of Occurrences')) +
  scale_x_discrete(label = t$semantic_func)

LS0tDQp0aXRsZTogIlIgTm90ZWJvb2siDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KIyMgSW50cm9kdWN0aW9uDQpNeSBpbnZlc3RpZ2F0aW9uIGlzIGluc3BpcmVkIGJ5IEJlZWtodWl6ZW4sIEIuLCBXYXRzb24sIEouLCAmIFN0ZXZlbnNvbiwgUy4gKDIwMTcpLiBTZW1hbnRpYyB0eXBvbG9neSBhbmQgcGFyYWxsZWwgY29ycG9yYTogU29tZXRoaW5nIGFib3V0IGluZGVmaW5pdGUgcHJvbm91bnMuIEluICpQcm9jZWVkaW5ncyBvZiB0aGUgQW5udWFsIE1lZXRpbmcgb2YgdGhlIENvZ25pdGl2ZSBTY2llbmNlIFNvY2lldHkqIChWb2wuIDM5KS4gV2hpbGUgdGhleSBzdHVkaWVkIHRoZSBzZW1hbnRpYyBmdW5jdGlvbnMgb2YgaW5kZWZpbml0ZSBwcm9ub3VucywgSSBjaG9zZSB0byBzdHVkeSB0aGUgc2VtYW50aWMgZnVuY3Rpb25zIG9mIHByZXBvc2l0aW9ucywgc3BlY2lmaWNhbGx5IHRoZSBTcGFuaXNoIHByZXBvc2l0aW9ucyAncG9yJyBhbmQgJ3BhcmEnIGFuZCB0aGVpciBFbmdsaXNoIGVxdWl2YWxlbnRzLg0KDQpgYGB7ciBpbmNsdWRlPUZBTFNFfQ0KbGlicmFyeShlbnRyb3B5KQ0KbGlicmFyeSh0aWR5dmVyc2UpDQpsaWJyYXJ5KGd0KQ0Kc2V0d2QoIkM6L1VzZXJzL0xlbm92by9Eb2N1bWVudHMvTVNVL05MUC9weXRob24gZm9yIGZpbmFsIikNCg0KZGF0YSA9IHJlYWQuY3N2KCdkYXRhL2Rldl9zZXQudHN2Jywgc2VwID0gJ1x0Jywgc3RyaXAud2hpdGUgPSBUUlVFLGZpbGVFbmNvZGluZz0ibGF0aW4xIikNCnQgPSBkYXRhICU+JSANCiAgZmlsdGVyKCEoY29kZSAlaW4lIGMoJ1BVPycsJ0VSUicsICdQT1IgRElPUycsICdQT1IgRkFWT1InLCAnQU1CSUcnKSkpICU+JQ0KICBmaWx0ZXIoISh0cmFuc2xhdGVkX3RvICVpbiUgYygnRElTVFJBTlMnLCAnRVJST1InLCAnRVJSJywgJ05PTkUnKSkpDQoNCmBgYA0KIyMgTWV0aG9kcw0KVXNpbmcgdGhlIE9wZW5TdWJ0aXRsZXMgZGF0YWJhc2UsIEkgY29sbGVjdGVkIGluc3RhbmNlcyBvZiAncG9yJyBhbmQgJ3BhcmEnLCBhcyB3ZWxsIGFzIHRoZSBjb3JyZXNwb25kaW5nIHN1YnRpdGxlcyBpbiBFbmdsaXNoLCBhbmQgY29kZWQgdGhlbSBieSBmdW5jdGlvbi4gDQpGb3IgZXhhbXBsZTogIA0KPiBTcGFuaXNoIHN1YnRpdGxlOiBMYXMgdG9hbGxhcyBzaXJ2ZW4gcGFyYSBzZWNhcnNlLiAgDQo+IEVuZ2xpc2ggc3VidGl0bGU6IFRoZSB0b3dlbHMgYXJlIGZvciBkcnlpbmcuICANCg0KSGVyZSB5b3UgY2FuIHNlZSB0aGF0IHRoZSBTcGFuaXNoIHVzZXMgJ3BhcmEnIHRvIHNlcnZlIHRoZSBzYW1lIGZ1bmN0aW9uIGFzIEVuZ2xpc2ggdXNlcyAnZm9yJy4gVGhleSBib3RoIGlkZW50aWZ5IHRoZSBQVVJQT1NFIG9mIHRoZSB0b3dlbHMuIA0KDQojIyMgU2VtYW50aWMgZnVuY3Rpb25zIGlkZW50aWZpZWQ6DQoNCmBgYHtyIGVjaG89RkFMU0V9DQpzZW1hbnRpY19mdW5jdGlvbnMgPC0gZGF0YS5mcmFtZSgNCiAgRnVuY3Rpb24gPSBjKCJQdXJwb3NlIiwgIlJlY2lwaWVudCIsICJEZXN0aW5hdGlvbiIsICJUaW1lIGRlc3RpbmF0aW9uIiwgIlBlcmNlcHRpb24iLCAiSW5maW5pdGl2ZSIsICJJbmRldGVybWluYXRlIHNwYWNlIiwgIkluZGV0ZXJtaW5hdGUgdGltZSIsICJDYXVzZSIsICJEaXN0cmlidXRpb24iLCAiSW50ZXJjaGFuZ2UiLCAiTWVkaWEiLCAiQXR0cmlidXRpb24vYWdlbmN5IiwgIkZlZWxpbmdzIGZvciIsICJUb3BpYyIsICJBZHZlcmJpYWxpemVyIiwgIkFkZHJlc3MiLCAiSWRlbnRpdHkiLCAiU3RvY2sgcGhyYXNlIiksDQogRGVzY3JpcHRpb24gPSBjKCJGdW5jdGlvbiBvciBwdXJwb3NlIG9mIGFuIG9iamVjdCwgZS5nLiAnTGFzIHRvYWxsYXMgc2lydmVuIHBhcmEgc2VjYXJzZSciLCAiUmVjaXBpZW50IG9mIGEgZ2lmdCwgZS5nLiAnUGFyYSBxdWllbiBlcyBlc3RlIHBhc3RlbD8nIiwgIkZpbmFsIGRlc3RpbmF0aW9uIG9mIGEgam91cm5leSwgZS5nLiAnVmFtb3MgcGFyYSBjYXNhJyIsICJEZXN0aW5hdGlvbiBpbiB0aGUgc2Vuc2Ugb2YgYSB0aW1lIHBlcmlvZCwgdGhlIGludGVudGlvbiB0byB1c2UgYSB0aGluZyBhdCBhIGNlcnRhaW4gdGltZSwgZS5nLiAnTGEgcHVibGljYWNpb24gZXMgcGFyYSBvdG/DsW8nIiwgIkV4cHJlc3NpbmcgdGhhdCBhIHBlcmNlcHRpb24gaXMgc3BlY2lmaWMgdG8gYSBwZXJzb24sIGUuZy4gJ0VzdGEgdmlkYSBlcyBzYWdyYWRhIHBhcmEgbWknIiwgIkJlYXJzIHRoZSBpbmZpbml0aXZlIGZvcm0gb2YgdGhlIHZlcmIgKGluIEVuZ2xpc2gsIHRoZSAndG8nIHRoYXQgZ29lcyBiZWZvcmUgdGhlIGJhcmUgdmVyYiksIGUuZy4gJ0Z1ZSBkZW1hc2lhZG8gZGViaWwgcGFyYSBsbGV2YXIgZWwgYWNlcm8nIiwgIkFyb3VuZCwgbmVhciwgb3IgdGhyb3VnaCBhIHNwYWNlLCBzcGFjZSBpbiB0aGUgc2Vuc2Ugb2YgJ2J5IG1lYW5zIG9mJyBpbnN0ZWFkIG9mICdkZXN0aW5hdGlvbicsIGUuZy4gJ01lIGhlIGRhZG8gdW5hIHZ1ZWx0YSBwb3IgZWwgcGFycXVlJyIsICJBIHRpbWUgcGVyaW9kLCBzdWNoIGFzIHRoZSBtb3JuaW5nLCBlLmcuICdNYcOxYW5hIHBvciBsYSBtYcOxYW5hJyIsICJDYXVzZSBvciByZWFzb24gZm9yIGFuIGFjdGlvbiwgZS5nLiAnR3JhY2lhcyBwb3IgZWwgcmVnYWxvJyIsICJJbiBFbmdsaXNoLCB0aGUgc2Vuc2Ugb2YgJ3BlcicgaW4gJ29uZSBnbG9iZSBwZXIgY2hpbGQnLCBlLmcuICdoZSByZXBhcnRpZG8gZG9zIGdsb2JvcyBwb3IgbmnDsW8nIiwgIkNoYW5nZSBvZiBvbmUgdGhpbmcgZm9yIGFub3RoZXIgaW4gYSBiYXJ0ZXIgb3IgcGF5bWVudCwgZS5nLiAnTG8gY29tcHJhbW9zIHNvbG8gcG9yIDIwIGV1cm9zJyIsICJUaGUgbWVkaWEgYnkgd2hpY2ggcGVvcGxlIGNvbW11bmljYXRlLCBlLmcuICdoZW1vcyBoYWJsYWRvIHBvciB0ZWxlZm9ubyciLCAiQXNzaWduaW5nIGFuIGFjdGlvbiB0byBhbiBlbnRpdHksIGUuZy4gJ2Z1ZSBpbmljaWFkbyBwb3IgZWwgZ29iaWVybm8nIiwgIkluIEVuZ2xpc2gsIHRoZSAnZm9yJyBpbiAnbXkgZmVlbGluZ3MgZm9yIGhlcicsIGUuZy4gJ0RlYmVyaWEgY29tcHJlbmRlciBtaSBhbW9yIHBvciBlbGxhJyIsICJUaGUgdG9waWMgYWJvdXQgd2hpY2ggdGhlIHZlcmIgaXMsIGUuZy4gJ2Rlc3ByZW9jdXBhZGEgcG9yIGxvcyBjZWxvcyciLCAiVHVybnMgYW4gYWRqZWN0aXZlIGludG8gYW4gYWR2ZXJiLCBlLmcuICdKZW5ueSBzZSBjYWxtYSBwb3IgY29tcGxldGEnIiwiQWRkcmVzc2luZyB0aGUgdmVyYiB0byBhIHBlcnNvbiwgZS5nLiAncHJlZ3VudGUgcG9yIE1yLiBCYXJuZXMnIiwgIkluIEVuZ2xpc2gsIHRoZSAnZm9yJyBpbiAndGFrZSBoZXIgZm9yIHlvdXIgd2lmZSwnIGUuZy4gJ2RhcmxlIHBvciBlc3Bvc2EnIiwgIlBhcnQgb2YgYSBsYXJnZXIgcGhyYXNlLCB1bmxpa2VseSB0byBiZSBwYXJzZWQgYXMgYSBzZXBhcmF0ZSBmdW5jdGlvbiwgZS5nLiAncG9yIGZhdm9yJyBvciAncG9yIERpb3MnIikNCikNCnNlbWFudGljX2Z1bmN0aW9ucyAlPiUgIGd0KCkNCmBgYA0KYGBge3J9DQpsYWJlbHMgPC0gYygiR3JvdXAgMSIsICJHcm91cCAyIiwgIkdyb3VwIDMiKQ0KZ2V0X2NvZGVfbmFtZSA8LSBmdW5jdGlvbih2YXIxKSB7DQogIHggPSBzd2l0Y2godmFyMSwNCiAgICAgICAgICAgICAiUFUiPSAiUHVycG9zZSIsDQogICAgICAgICAgICAgIlJFIj0gIlJlY2lwaWVudCIsDQogICAgICAgICAgICAgIkRFIj0gIkRlc3RpbmF0aW9uIiwNCiAgICAgICAgICAgICAiVEQiPSAiVGltZSBkZXN0aW5hdGlvbiIsDQogICAgICAgICAgICAgIlBFIj0gIlBlcmNlcHRpb24iLA0KICAgICAgICAgICAgICJJTkYiPSAiSW5maW5pdGl2ZSIsDQogICAgICAgICAgICAgIklTIiA9ICJJbmRldGVybWluYXRlIHNwYWNlIiwNCiAgICAgICAgICAgICAiSVQiID0gIkluZGV0ZXJtaW5hdGUgdGltZSIsDQogICAgICAgICAgICAgIkNBIiA9ICJDYXVzZSIsDQogICAgICAgICAgICAgIkRJIiA9ICJEaXN0cmlidXRpb24iLA0KICAgICAgICAgICAgICJJTiIgPSAiSW50ZXJjaGFuZ2UiLA0KICAgICAgICAgICAgICJNRSIgPSAiTWVkaWEiLCANCiAgICAgICAgICAgICAiQVQiID0gIkF0dHJpYnV0aW9uL0FnZW5jeSIsDQogICAgICAgICAgICAgIkZFIiA9ICJGZWVsaW5ncyBmb3IiLA0KICAgICAgICAgICAgICJUTyIgPSAiVG9waWMiLA0KICAgICAgICAgICAgICJBRFYiID0gIkFkdmVyYmlhbGl6ZXIiLA0KICAgICAgICAgICAgICJBRCIgPSAiQWRkcmVzcyIsDQogICAgICAgICAgICAgIklEIiA9ICJJZGVudGl0eSIsDQogICAgICAgICAgICAgIlBPUiBGQVZPUiIgPSAiU3RvY2sgcGhyYXNlIiwNCiAgICAgICAgICAgICAiRk9SIERJT1MiID0gIlN0b2NrIHBocmFzZSIpDQogIHJldHVybih4KSAgDQp9DQp0IDwtIHQgJT4lDQogIG11dGF0ZSgic2VtYW50aWNfZnVuYyIgPSBzYXBwbHkoY29kZSwgZ2V0X2NvZGVfbmFtZSkpDQogICAgICAgICAgICAgICAgICAgDQpgYGANCg0KIyMjIFJlc3VsdHMNCkhlcmUgaXMgdGhlIGRpc3RyaWJ1dGlvbiBvZiBzZW1hbnRpYyBmdW5jdGlvbnMgZm9yIHRoZSB0d28gcHJlcG9zaXRpb25zLiBFYWNoIGlzIGRpc3RyaWJ1dGVkIHRocm91Z2ggYSB2YXJpZXR5IG9mIGZ1bmN0aW9ucy4gU29tZSBhcHBlYXIgaW4gYm90aCBwbG90cywgc29tZSBkbyBub3QuDQoNCmBgYHtyfQ0KDQp0ICU+JSBnZ3Bsb3QoYWVzKHg9Y29kZSkpICsgZ2VvbV9iYXIoKSArIGZhY2V0X2dyaWQofiBzcGFuaXNoX3ByZXBfZm91bmQsIHNjYWxlcz0iZnJlZSIpICsNCiAgbGFicyh4PSJTZW1hbnRpYyBGdW5jdGlvbiIpICsgZ3VpZGVzKHg9Z3VpZGVfYXhpcyhhbmdsZT00NSkpICsgc2NhbGVfeF9kaXNjcmV0ZShsYWJlbCA9IHQkc2VtYW50aWNfZnVuYykNCmBgYA0KDQpGb3IgZWFjaCBvZiB0aGUgRW5nbGlzaCB3b3JkcyBvciBwaHJhc2VzIGNvZGVkICh3aGVyZSBuPjEpLCBoZXJlIGlzIHRoZSBkaXN0cmlidXRpb24gYXMgdG8gd2hlbiB0aGV5J3JlIHJlYWxpemVkIHdpdGggJ3BvcicgYW5kIHdoZW4gd2l0aCAncGFyYScuIEEgc2lnbmlmaWNhbnQgbnVtYmVyIG9mIEVuZ2xpc2ggcHJlcG9zaXRpb25zIGNhbiBiZSByZWFsaXplZCBhcyBlaXRoZXIsIGRlcGVuZGluZyBvbiB0aGUgc2VtYW50aWMgZnVuY3Rpb24uIFRoaXMgaXMgYSBjaGFsbGVuZ2UgZm9yIEwxIEVuZ2xpc2ggTDIgbGVhcm5lcnMgb2YgU3BhbmlzaC4NCmBgYHtyfQ0KdCAlPiUgZ3JvdXBfYnkodHJhbnNsYXRlZF90bykgJT4lIGZpbHRlcihuKCkgPiAxKSAlPiUNCiAgZ2dwbG90KGFlcyh5PXRyYW5zbGF0ZWRfdG8sIGZpbGw9c3BhbmlzaF9wcmVwX2ZvdW5kLCBsYWJlbD1zcGFuaXNoX3ByZXBfZm91bmQpKSArDQogIGdlb21fYmFyKHBvc2l0aW9uPSJmaWxsIikgKyANCiAgbGFicyh4PSJQZXJjZW50IFJlYWxpemF0aW9uIiwgeT0iRW5nbGlzaCBUcmFuc2xhdGlvbiIpICsNCiAgc2NhbGVfZmlsbF9kaXNjcmV0ZShuYW1lPSJTcGFuaXNoIFByZXBvc2l0aW9uIikgDQpgYGANCg0KSGVyZSBpcyBhIHNjYXR0ZXJwbG90IG9mIGFsbCB0aGUgZGF0YSB0b2dldGhlci4gU2VtYW50aWMgZnVuY3Rpb25zIGFyZSBtdWNoIGNsb3NlciB0byBhIHNwbGl0IGJldHdlZW4gJ3BvcicgYW5kICdwYXJhJyAobW9zdCBjb2x1bW5zIGhhdmUgb25seSBwaW5rIG9yIGJsdWUpIHRoYW4gdGhleSBhcmUgYW1vbmcgRW5nbGlzaCBwcmVwb3NpdGlvbnMgKG1vc3Qgcm93cyBoYXZlIGJvdGggcGluayBhbmQgYmx1ZSksIGRlc3BpdGUgdGhlcmUgYmVpbmcgbWFueSBtb3JlIG9mIHRoZW0gaW4gdGhpcyBkYXRhc2V0Lg0KYGBge3J9DQp0ICU+JSBncm91cF9ieSh0cmFuc2xhdGVkX3RvKSAlPiUgZmlsdGVyKG4oKSA+IDEpICU+JQ0KICBnZ3Bsb3QoYWVzKGNvZGUsdHJhbnNsYXRlZF90bywgY29sb3I9c3BhbmlzaF9wcmVwX2ZvdW5kKSkgKyBnZW9tX2NvdW50KCkgKw0KICBsYWJzKHg9IlNlbWFudGljIEZ1bmN0aW9uIiwgeT0iRW5nbGlzaCBUcmFuc2xhdGlvbiIsIGNvbG9yPSJTcGFuaXNoIFByZXBvc2l0aW9uIikgKw0KICBzY2FsZV9maWxsX2Rpc2NyZXRlKG5hbWU9YygiU3BhbmlzaCBQcmVwb3NpdGlvbiIsICJuIikpICsgDQogIGd1aWRlcyh4PWd1aWRlX2F4aXMoYW5nbGU9NDUpLCBzaXplID0gZ3VpZGVfbGVnZW5kKHRpdGxlPSdOdW1iZXIgb2YgT2NjdXJyZW5jZXMnKSkgKw0KICBzY2FsZV94X2Rpc2NyZXRlKGxhYmVsID0gdCRzZW1hbnRpY19mdW5jKQ0KYGBgDQoNCg==