Introduction to Data Analysis using STATA

About Course

Instructor:

Dr. Richa Shah

Global Health Researcher | Epidemiologist & Gerontologist | Consultant, IARC (WHO) | Research Associate, Health Action and Research

Dr. Richa Shah is a medical doctor and global health professional specializing in epidemiology and gerontology. She earned her Master’s in Global Health (2017) and PhD in Global Health and Gerontology (2020) from The University of Tokyo, Japan.

Dr. Shah has worked as a postdoctoral researcher and is currently serving as a consultant at the International Agency for Research on Cancer (IARC), part of the World Health Organization, where she contributes to high-impact studies on global cancer trends. As a research associate at Health Action and Research (HAR), she has led numerous training programs on research methodology and data analysis.

Expertise & Teaching Approach
Dr. Shah combines strong academic rigor with practical experience in data analysis and managing large health datasets. She empowers learners to build the analytical skills and confidence needed to excel in global health research.

Course Details:

  • The course consists of 8 modules designed to teach the fundamentals of data analysis using Stata, a widely used statistical software in health research.

  • Beginner-friendly: Introduces core concepts on data cleaning, descriptive statistics, data visualization, and basic inferential methods. However, some foundational knowledge on basic data analysis could be beneficial.

  • Gain hands-on experience with real-world datasets and develop practical skills to perform meaningful analyses and interpret results confidently.

  • Suitable for students, researchers, and professionals, providing a strong foundation to start your journey with Stata.

Course Structure:

  • Each module contains specific lessons; some modules have one lesson, while others include more lessons.

  • Combines theoretical concepts with practical application using real-world datasets.

  • Exercise files and resources for each lesson are provided in the Exercise Files section.

  • For best results, follow the instructions using Stata commands demonstrated in the videos and provided in the exercise files.

Questions and Support

Certification:

  • Learners who complete all modules and pass all quizzes will receive a certificate of completion.
  • Learners are allowed to attempt quizzes multiple times to achieve a passing score.

Requirements:

  • A computer ready for installation.

  • Stata is installed on your computer (you may use your existing version if it is already installed).

  • If you do not have Stata installed, download the trial version from the following link:
    https://www.stata.com/customer-service/stata-evaluation/

  • Refer to the video tutorial in Module 0 Introduction for guidance on installing Stata. (Enrollment is required for access to the video.)

Note:

  • It takes 2-3 business days to have access to the trial version of Stata.

  • The trial version is valid for approximately two weeks to one month. If you are using the trial version, please plan and manage your time accordingly.

Show More

What Will You Learn?

  • Understand the fundamentals of Stata and its interface
  • Import, clean, and manage datasets
  • Perform descriptive and inferential statistical analyses
  • Apply bivariate analysis techniques such as t-tests, chi-square tests, and correlations
  • Apply multivariable analysis techniques such as linear regression and logistic regression
  • Visualize data effectively using Stata
  • Interpret statistical results and present findings with confidence
  • Use shortcut commands, loops, and other techniques to improve efficiency in your workflow

Course Content

Module 0: Introduction to Stata

  • 0.1 Introduction to Stata and overview of the course
    06:18

Module 1: Stata Interface

Module 2: Importing data, types of variables, grammar of STATA commands, setting a working directory
This module covers importing datasets, understanding variable types and labels, browsing and exploring data, and saving datasets. You will also learn how to import a dataset into Stata. For practice, please download the dataset (Stata_practice_dataset_2023.xlsx) provided in the Exercise Files.

Module 3: Data browsing, cleaning, filtering, labeling, renaming, dropping

Module 4: Generating and modifying variables
This module is divided into six sections which cover how to create new variables, recode existing ones, group or categorize data, convert string variables to numeric and finally, compute internal consistency using Cronbach’s alpha.

Module 5: Data Visualization in STATA and presenting descriptive stats

Module 6: Bivariate Analysis

Module 7: Multi variable analysis

Module 8: Addtional commands