MS Data Management Working Group
6/20/18
It's not a bad place.
So:
Conceptually, the final report has two main parts:
But what about the executive summary, you ask?
There are other parts to the report.
So I'd like to cover just the two I mentioned in a bit more depth!
The questions we've asked mostly count things or place them in categories.
We've also chosen a fairly high level of analysis
This was to get results.
This limits the formative use of our analysis!
Most causal and even relational analyses are ruled out (or very complicated).
Counts and categories are useful to get the lay of the land.
But they don't explain or show a way forward!
That's not good or bad – it's just the nature of the data.
Some of the more complicated analyses might be useful in-house.
But I'm not sure we want to present high-dimensional regressions here!
cost ~ software mix + agency + agency size + data volume might be pretty interesting, though…
Okay, okay.
What this means is that a lot of weight will be carried by the best practices section.
And that means we should devote some thought to how it should look!
While we can mine the literature, we need focus.
Best practices are always best for some purpose.
What do we want to emphasize, in what proportion?
The point of data is to let us know stuff. And know that we know!
Fundamentally, collections of data must:
If you don't have this, all other considerations fall by the wayside.
As I said, this is going to be largely descriptive.
And we hope to tell a lot of our story graphically.
For instance, let's consider the questions about data volume and growth by DBMS.
We might see several things…
Questions?