문 건 웅
2014-6-13
The Confirmation of Results and Conclusions From One Study obtained independently in another
is considered the scientific gold stanadrd
Replication of findings and conducting studies with independent
Replication is particularly important in studies that can impact broad policy or regulatory decisions
Some studies cannot be replicated
대규모 , 장기간 연구가 아닌 경우
Selected articles published in Nature Genetics between January 2005 and December 2006 that had used profiling with microarrays
Of the 56 items retrieved electronically, 20 articles were considered potentially eligible for the project
The four teams were from
Each team was comprised of 3-6 scientists who worked together to evaluate each article.
Result could be reproduced : n=2
Reproduced with discrepancy : n=6
Could not be reproduced : n=10
In the Discovery/Test Validation stage of omics-based tests:
library(compareGroups)
data(predimed)
predimed 데이타의 모든 변수를 그룹별로 비교할거다
res=compareGroups(group ~ . , data=predimed)
표를 만들어라
createTable(res)
그룹별로 나눠서 비교할거다
res = compareGroups(group ~ .-sex-hormo, data=predimed)
모든 환자, 남자환자만, 여자환자만 따로 표를 세개 만든다
alltab=createTable(res)
femaletab=createTable(update(res,subset=sex=="Female"))
maletab=createTable(update(res,subset=sex=="Male"))
표 세개를 합친다.
cbind("ALL"=alltab,"FEMALE"=femaletab,"MALE"=maletab)
Minimal Effort ; maximal result
Nice plots
First have an idea
e.g. stopping distance correlate with speed ?
All results(figures, tables) manually imported to Word
in a single dynamic document
In literate programming, an analytical document is composed of a descriptive narrative “woven” together with software code and computed results.
Advantages
*Coursera : Reproducible Research (Jones Hopkins University)