• Home
  • Clinical Data Scientist - Gratisol Labs

Clinical Data Scientist - Gratisol Labs

Introduction to Clinical Data Science

Are you interested in how to use data generated by doctors, nurses, and the healthcare system to improve the care of future patients? If so, you may be a future clinical data scientist!

This specialization provides learners with hands on experience in use of electronic health records and informatics tools to perform clinical data science. This series of six courses is designed to augment learner’s existing skills in statistics and programming to provide examples of specific challenges, tools, and appropriate interpretations of clinical data.

By completing this specialization you will know how to: 1) understand electronic health record data types and structures, 2) deploy basic informatics methodologies on clinical data, 3) provide appropriate clinical and scientific interpretation of applied analyses, and 4) anticipate barriers in implementing informatics tools into complex clinical settings. You will demonstrate your mastery of these skills by completing practical application projects using real clinical data.

This specialization is supported by our industry partnership with Google Cloud. Thanks to this support, all learners will have access to a fully hosted online data science computational environment for free! Please note that you must have access to a Google account (i.e., gmail account) to access the clinical data and computational environment.

This course will prepare you to complete all parts of the Clinical Data Science Specialization. In this course you will learn how clinical data are generated, the format of these data, and the ethical and legal restrictions on these data. You will also learn enough SQL and R programming skills to be able to complete the entire Specialization – even if you are a beginner programmer. While you are taking this course you will have access to an actual clinical data set and a free, online computational environment for data science hosted by our Industry Partner Google Cloud. At the end of this course you will be prepared to embark on your clinical data science education journey, learning how to take data created by the healthcare system and improve the health of tomorrow’s patients.


Describe how each type of clinical data are generated, specifically outlining who creates the data, when and why the data are generated.

  • Write SQL code to combine two or more tables using database joins.
  • Write R code to manipulate and tidy data including: selecting columns, filtering rows, and joining data sets.
  • Write markdown formatted text and combine with R code in RMarkdown documents.