reference

No plotting before analysis

Measurement issues

Representative

The mathematical theorems which justify most frequentist statistical procedures apply only to random samples. The similar thing to independence= unrelated= randomization at certain degree.

Sample size and Power

Interpreting results

Multiple comparisons

Dividing a Continuous Variable into Categories

Inappropriate Method of Analysis

Pseudoreplication (not independent)

  • Most models for statistical inference require true replication. True replication permits the estimation of variability within a treatment. Without estimating variability within treatments, it is impossible to do statistical inference. Here, replication refers to having more than one experimental (or observational) unit with the same treatment. a treatment is independently applied.

  • If not, variability will probably be underestimated.

  • If not, confidence intervals that are too small.

  • An inflated probability of a Type I error (falsely rejecting a true null hypothesis).

  • Do whatever is possible to minimize lack of independence in the the pseudo-replicates and increase randomization.

  • Observational studies are particularly prone to pseudoreplication.

Using confidence intervals when prediction intervals are needed

Overinterpreting High R2

In many areas of the social and biological sciences, an R2 of about 0.50 or 0.60 is considered high. vice versa. where the response was independent of all the predictors (so all regressors have coefficient zero in the true mean function), but R2 = 0.59.2. ref

Alternatives to Stepwise Selection variables (may pitfall)

Suggestions for Researchers

planning research

  • involve a experienced statistician at the begin of the study.

  • it may be wise to limit your study.

  • think about how you will gather and analyze it before you start to gather the data.

  • design affects what method of analysis is appropriate.

  • Be sure to record any time and spatial variables present, whether or not you initially plan to use them in your analysis.

  • Also think about any factors that might make the sample biased.

  • Think carefully about what measures you will use.

  • If you are gathering observational data, think about possible confounding factors and plan your data gathering to reduce confounding.

  • Think carefully about how you will randomize or sample

  • Think carefully about whether or not the model assumptions of your intended method of analysis are likely to be reasonable.

  • Conduct a pilot study to trouble shoot and obtain variance estimates for a power analysis.

  • Decide on appropriate levels of Type I and Type II error, taking into account consequences of each type of error.

  • Plan how to deal with multiple inferences, including “data snooping” questions

  • How you plan to handle missing data.

analyzing data

  • ask whether or not the model assumptions of the procedure are plausible in the context of the data.

  • Plot the data as possible to get additional checks on whether or not model assumptions hold.

  • if model assumptions appear to be violated, consider transformations of the data, or use alternate methods of analysis as appropriate.

  • be sure to take that into account by using appropriate methodology for multiple comparisons.

  • Keep careful records of decisions made in data cleaning and in using software.

writing

  • Include enough detail so the reader can critique both the data gathering and the analysis.

  • Look for and report possible sources of bias

  • be sure to reiterate or summarize the limitations in stating conclusions

  • a website to accompany the article.

  • Include discussion of why the analyses used are appropriate.

  • Have the authors taken practical significance as well as statistical significance into account in drawing conclusions?

  • follow items to Evidence Based Medicine reports