Conducting High-Value Secondary Dataset Analysis: An Introductory Guide and Resources

TitleConducting High-Value Secondary Dataset Analysis: An Introductory Guide and Resources
Publication TypeJournal Article
Year of Publication2011
AuthorsAyanian, JZ, Covinsky, KE, Landon, BE, McCarthy, EP, Wee, CC, Steinman, MA
JournalJournal of General Internal Medicine
ISSN Number0884-8734
KeywordsDatasets, Meta-analyses, Survey Methodology

Secondary analyses of large datasets provide a mechanism for researchers to address high impact questions that would otherwise be prohibitively expensive and time-consuming to study. This paper presents a guide to assist investigators interested in conducting secondary data analysis, including advice on the process of successful secondary data analysis as well as a brief summary of high-value datasets and online resources for researchers, including the SGIM dataset compendium ( The same basic research principles that apply to primary data analysis apply to secondary data analysis, including the development of a clear and clinically relevant research question, study sample, appropriate measures, and a thoughtful analytic approach. A real-world case description illustrates key steps: (1) define your research topic and question; (2) select a dataset; (3) get to know your dataset; and (4) structure your analysis and presentation of findings in a way that is clinically meaningful. Secondary dataset analysis is a well-established methodology. Secondary analysis is particularly valuable for junior investigators, who have limited time and resources to demonstrate expertise and productivity.

Citation Key8744
PubMed ID21301985
PubMed Central IDPMC3138974