Pre-wrangle

Want a data format that has the utterances divied up for throwing into sbert

take 1: - divide on end punctuation (.!?) and on returns - include speaker and listener (easy to filter later)

?: do we use the pre or post filtered/cleaned? here we’ll use raw (no spellcheck, no filter)

take two : we concatenate everything (with a space) still no spellcheck or filtering

Interlude

This is where they need to get embedding with jupyter

Post-wrangle

separate approach

[Deprecated] We embed each sentence separately, but then average across the embeddings to get a vector for each player:trial combo.

concat approach

How different are average versus concat embeddings:

9672 of the rtest %>% nrow()` person-trial combos were single line utterances to begin with, so these are the same embedding regardless of method.

Of the others, this is the cosine similarity distribution between the two methods (grouped by player count).

So, average similarity is .87ish for that half (and 1 for the just identical half), but there are some outliers thare lower than .8.

Switching below diagrams to use concatenate method.

(Still questions about cleaning etc.)

Caveats to interpretation

This is done on the raw transcripts. Some things that could contribute to “similarity” include larger strings (because there will be averaging, which tends to bring things in) and more non-reference language which we expect to be more uniform (maybe?).

We could attempt to “solve” these in future by a) annotating for the “reference” expression (or using cleaned transcript as a half way point).

Across game divergence:

Do tangram descriptions diverge sooner in smaller groups? How similar are descriptions of the same tangram in the same block across games?

(only looking at speaker utts)

Within the same tangram, repNum, and numPlayers, but across different games

Similarity is higher in larger games and earlier in the games. See that 6noro starts out looking like a 6p games and ends looking like a 2-3 player game.

Within game specialization

(only looking at speaker utts)

in a game, in a round, how different are the descriptions for different tangrams?

Within game, repNum, numPlayers, between tangrams

Earlier reps have more similarity, as do large games.

Tangram description evolution

How similar is a tangram description on round N to N+1 as a function of rep and game size?

6noro has really similar descriptions throughout.

Pairs of adjacent rounds have more similar descriptions when the rounds are later (4-5 is closer than 0-1). Descriptions are more similar with larger groups in 0-1, but less similar in larger groups in 2-3, 3-4, and 4-5.

Speaker sim to first round

Speaker sim to last round

Here’s where 6noro really looks like a 2p game.

sample transcript

A semi-random selection of games & tangrams, trying to get some diversity.

earlier	later	earlierText	laterText	sim
0	1	dinosaur one leg lifted head backwards	head tilted backward, one leg raised little standing position	0.6322160
0	2	dinosaur one leg lifted head backwards	looks like a stomping penguin to me, you called it a dinosaur square on upper right side	0.4868374
0	3	dinosaur one leg lifted head backwards	penguin stomping feet, dinosaur, square head on the right	0.5793958
0	4	dinosaur one leg lifted head backwards	dinosaur, head tilted backward right	0.8121429
0	5	dinosaur one leg lifted head backwards	stomping penguin dinosaur	0.5108741
1	2	head tilted backward, one leg raised little standing position	looks like a stomping penguin to me, you called it a dinosaur square on upper right side	0.2924130
1	3	head tilted backward, one leg raised little standing position	penguin stomping feet, dinosaur, square head on the right	0.3743544
1	4	head tilted backward, one leg raised little standing position	dinosaur, head tilted backward right	0.5450624
1	5	head tilted backward, one leg raised little standing position	stomping penguin dinosaur	0.1429436
2	3	looks like a stomping penguin to me, you called it a dinosaur square on upper right side	penguin stomping feet, dinosaur, square head on the right	0.8326169
2	4	looks like a stomping penguin to me, you called it a dinosaur square on upper right side	dinosaur, head tilted backward right	0.5779020
2	5	looks like a stomping penguin to me, you called it a dinosaur square on upper right side	stomping penguin dinosaur	0.7573282
3	4	penguin stomping feet, dinosaur, square head on the right	dinosaur, head tilted backward right	0.6258747
3	5	penguin stomping feet, dinosaur, square head on the right	stomping penguin dinosaur	0.8331335
4	5	dinosaur, head tilted backward right	stomping penguin dinosaur	0.5017789

earlier	later	earlierText	laterText	sim
0	1	sitting on the ground facing left	sitting facing left, can’t see arms, legs are outstretched	0.5795866
0	2	sitting on the ground facing left	sitting, facing left, head bowed, legs outstretched some	0.6725468
0	3	sitting on the ground facing left	mopey guy facing left legs outstretched	0.1782776
0	4	sitting on the ground facing left	mopey guy	0.0669464
0	5	sitting on the ground facing left	moper	0.0535394
1	2	sitting facing left, can’t see arms, legs are outstretched	sitting, facing left, head bowed, legs outstretched some	0.7637988
1	3	sitting facing left, can’t see arms, legs are outstretched	mopey guy facing left legs outstretched	0.4570391
1	4	sitting facing left, can’t see arms, legs are outstretched	mopey guy	0.0721988
1	5	sitting facing left, can’t see arms, legs are outstretched	moper	-0.0354502
2	3	sitting, facing left, head bowed, legs outstretched some	mopey guy facing left legs outstretched	0.4424953
2	4	sitting, facing left, head bowed, legs outstretched some	mopey guy	0.1479361
2	5	sitting, facing left, head bowed, legs outstretched some	moper	0.0124740
3	4	mopey guy facing left legs outstretched	mopey guy	0.6457207
3	5	mopey guy facing left legs outstretched	moper	0.3183658
4	5	mopey guy	moper	0.5250157

earlier	later	earlierText	laterText	sim
0	1	weird shape. like a palm tree or bunny ears from the head body is of 2 triangles. looking right	body looks like a triangle head has 2 bunny ears sticking out	0.7387307
1	2	body looks like a triangle head has 2 bunny ears sticking out	bunny ears traiangle body	0.7907372
2	3	bunny ears traiangle body	bunny ears	0.8592576
3	4	bunny ears	the bunny or girl in a kimono facing right yes	0.5323732
4	5	the bunny or girl in a kimono facing right yes	bunny ears	0.5323732

earlier	later	earlierText	laterText	sim
0	1	this is the triangle head person haha	this is the one with the triangle ears like a bunny	0.5474127
0	2	this is the triangle head person haha	good ol triangle head	0.6664659
0	3	this is the triangle head person haha	our fave mr triangle head	0.6533870
0	4	this is the triangle head person haha	triangle head ftw!	0.6695504
0	5	this is the triangle head person haha	triangle head<3 our fave :’( haha	0.7322333
1	2	this is the one with the triangle ears like a bunny	good ol triangle head	0.4983486
1	3	this is the one with the triangle ears like a bunny	our fave mr triangle head	0.4054277
1	4	this is the one with the triangle ears like a bunny	triangle head ftw!	0.4208512
1	5	this is the one with the triangle ears like a bunny	triangle head<3 our fave :’( haha	0.4181670
2	3	good ol triangle head	our fave mr triangle head	0.7457436
2	4	good ol triangle head	triangle head ftw!	0.7452906
2	5	good ol triangle head	triangle head<3 our fave :’( haha	0.6989794
3	4	our fave mr triangle head	triangle head ftw!	0.6391654
3	5	our fave mr triangle head	triangle head<3 our fave :’( haha	0.7786541
4	5	triangle head ftw!	triangle head<3 our fave :’( haha	0.6734986

this example really raises the question of what to do with non-referential language (or dubiously deferential language). Like the sentiment in “good ol” and “our fave” and “haha” is also relevantly similar to later iterations, but it’s not “reference per se”?

We could try to pull just the referential descriptions out?

Listener - listener - speaker alignment (same round)

Include listener stuff group within game, within tangram, within round, across utterer

code for includes speaker?

(This is nonsense)

Correlates of success!

We want to look at whether properties of the utterances (like distinctiveness) are related to how successful people were that trial.

We’re going to need to un upper-triangle this!

First try grouping by rep and game

We can also un upper triangle this to look at a per tangram basis – we see that more distinctive tangram descriptions (lower similarity) correlate with higher success rates.

(Note this is a correlation, causality is likely … complicated and cyclic)

TODO future

Is there anything good to do with listener data?

How to look for “same speaker” effects? Would want to do same tangram over all the rounds and then use final (or first round) distance, and same speaker?. Where there’s a same speaker is deeply confounded with group size!!!!

Embeddings