(Source: http://www.stat.cmu.edu/~brian/701/notes/paper-structure.pdf)
A data analysis report is somewhat different from other types of professional writing that you may have done or seen, or will learn about in the future. It is related to but not the same as:
The overall structure of a data analysis report is simple:
The data analysis report is written for several different audiences at the same time:
Primary audience: A primary collaborator or client. Reads the Introduction and perhaps the Conclusion to find out what you did and what your conclusions were, and then perhaps fishes/skims through the Body, stopping only for some additional details on the parts that he/she thought were interesting or eye-catching. Organize the paper around an agenda for a conversation you want to have with this person about what you’ve learned about their data: e.g., from most general to most specific, or from most important to least important, etc. Provide the main evidence from your analysis (tabular, graphical, or otherwise) in the Body to support each point or conclusion you reach, but save more detailed evidence, and other ancillary material, for the Appendix.
Secondary Audience: An executive person. Probably only skims the Introduction and perhaps the Conclusion to find out what you did and what your conclusions are. Leave signposts in the Introduction, Body and Conclusion to make it easy for this person to swoop in, find the “headlines” of your work and conclusions, and swoop back out.
Secondary Audience: A technical supervisor. Reads the Body and then examines the Appendix for quality control: How good a job did you do in (raising and) answering the interesting questions? How efficient were you? Did you reach reasonable conclusions by defensible statistical methods? Etc. Make specific cross-references between the Body and specific parts of the Appendix so that this person can easily find supporting and ancillary material related to each main analysis you report in the Body. Add text to the technical material in the Appendix so that this person sees how and why you carried out the more detailed work shown in the Appendix.
The data analysis report has two very important features:
It is less important to worry about the latter two items in the Appendix which is expected to be more detailed and process-oriented. However, there should be enough text annotating the technical material in the Appendix so that the reader can see how and why you carried out the more detailed work shown there.
The data analysis report isn’t quite like a research paper or term paper in a class, nor like a research article in a journal. It is meant, primarily, to start an organized conversation between you and your client/collaborator. In that sense it is a kind of “internal” communication, sort of like an extended memo. On the other hand it also has an “external” life, informing a boss or supervisor what you’ve been doing.
Now let’s consider the basic outline of the data analysis report in more detail:
Good features for the Introduction include:
The above is a pretty good order to present this material in as well.
The body can be organized in several ways. Here are two that often work well:
This format is very familiar to those who have written psych research papers. It often works well for a data analysis paper as well, though one problem with it is that the Methods section often sounds like a bit of a stretch: In a psych research paper the Methods section describes what you did to get your data. In a data analysis paper, you should describe the analyses that you performed. Without the results as well, this can be pretty sterile sounding, so I often merge these “methods” pieces into the “Analysis” section when I write.
2. Analysis
2.1 Success Rate Methods Analysis Conclusions
2.2 Time to Relapse Methods Analysis Conclusions
2.3 Effect of Gender Methods Analysis Conclusions
2.4 Hospital Effects Methods Analysis Conclusions
Etc. . .
Other organizational formats are possible too. Whatever the format, it is useful to provide one or two well-chosen tables or graphs per question in the body of the report, for two reasons: First, graphical and tabular displays can convey your points more efficiently than words; and second, your “skimming” audiences will be more likely to have their eye caught by an interesting graph or table than by running text. However, too much graphical/tabular material will break up the flow of the text and become distracting; so extras should be moved to the Appendix.
The conclusion should reprise the questions and conclusions of the in- troduction, perhaps augmented by some additional observations or details gleaned from the analysis section. New questions, future work, etc., can also be raised here.
One or more appendices are the place to out details and ancillary materials. These might include such items as
In all cases, and especially in the case of computer code, it is a good idea to add some text sentences as comments or annotations, to make it easier for the uninitiated reader to follow what you are doing. It is often difficult to find the right balance between what to put in the appendix and what to put in the body of the paper. Generally you should put just enough in the body to make the point, and refer the reader to specific sections or page numbers in the appendix for additional graphs, tables and other details.
We make this the example for a data report.
This is an R Markdown Notebook. When you execute code within the notebook, the results appear beneath the code.
Try executing this chunk by clicking the Run button within the chunk or by placing your cursor inside it and pressing Cmd+Shift+Enter.
plot(cars)
Add a new chunk by clicking the Insert Chunk button on the toolbar or by pressing Cmd+Option+I.
When you save the notebook, an HTML file containing the code and output will be saved alongside it (click the Preview button or press Cmd+Shift+K to preview the HTML file).
(str(cars))