Trials were excluded if the speaker did not provide any description before the listener clicked.

Accuracy over time

The practice trials are great – probably some improvement over time in regular trials?

Rate of echoing

I may be missing some echoes, although the rate of echoing is high enough we can’t subset based on it (not to mention it’s not independent of child behavior).

Speed to start of description

I was trying to look at whether speakers initiated descriptions faster later on. Some weird negative outliers suggest a timing glitch in at least one expt. But also, it doesn’t look like this is true (also requires relying on more layers of timing accuracy & alignment).

TO DO try to understand negative outliers?

Speed to response

How long do trials take?

Note, some high outliers cut out of view.

speed to response does get faster over time!

Length of description from speaker

Going up slightly, if anything, not down. (Although for this task, not sure I’d expect adults to go down rather than to start at fast and stay there). But it’s still different!

Could also look at total words that are at least vaguely game related, although this will have “it looks like”, repetition, and inconsistently tagged “Yes” in response to “do you see it” so idk if that’s useful

Sbert

Check practice trials

We expect when it’s the same, high agreement and when they’re different, low agreement. This checks out.

Within a game

Descriptions from different kids of the same item are more similar than diff items. Same item is described more like partner describes it than like random other kid does. (Note that within game we can only compare targets across blocks, so doing cross-block for everything)

Do descriptions get more different?

Not seeing noticeable change over time.

For above, could try subsetting by successful utterances or something.

The fun part

What sorts of wacky descriptions do kids use successfully?

Check pre-reg thingy

TODO: We’re probably not doing various things we said we’d do, but check anyway!