Research Pipeline

Below is an example pipeline or process for conducting research with Health Data such as EHR data. The list is by no means exhaustive. However, it is a good place to start. A page could be written on each of the steps in the pipeline - and most likely will be in future releases of CODIAC for Health.

  1. Conduct a literature review.

  2. Explicitly describe the research question.

  3. Form an interdisciplinary team that can guide and perform each step of the study.

  4. Fully specify the research protocol in advance of executing the study.

  5. Apply for IRB approval of the study.

  6. Apply for an Institutional Reliance Agreement, if necessary.

  7. Execute a Data (Transfer and) Use Agreement (DUA, DTUA), as required

  8. Comply with any application and approval procedures set forth by the data provider.

  9. Request access to / Set up computing infrastructure, as necessary.

  10. Assess the suitability (strengths and weaknesses) of the dataset(s) to be used in the study.

  11. Assess the quality of the dataset(s).

  12. Define the study cohort (and matching cases, if applicable).

  13. Create standard code sets for each clinical concept in the cohort definition and every independent and dependent variable.

  14. Compose a computable data request / data extraction specification.

  15. Clean and stage extracted data for analysis; handle missing values according to protocol.

  16. Characterize the study cohort (and matching cases, if applicable).

  17. Adjust for any bias or confounders in the data.

  18. Analyze the data according to protocol.

  19. Produce research products.

  20. Comply with any review procedures required by the data provider.

  21. Publish your work!

Resources

Articles

  • Shang N, Weng C, Hripcsak G. A conceptual framework for evaluating data suitability for observational studies. J Am Med Inform Assoc. 2018 Mar 1;25(3):248-258. doi: 10.1093/jamia/ocx095. PMID: 29024976; PMCID: PMC7378879.

Books

Last updated