NGO Management and Policy Development · MSD2736

Comprehensive Statistics Using Stata

The course prepares students to utilize STATA, one of the most commonly used statistical analysis software packages, and develop a deeper understanding of econometrics.  This training specifically deals with management, analysis and graphing of quantitative data using STATA. The training combines theoretical foundations with methodological perspectives, aiming at equipping participants with knowledge on use of quantitative data in social research using STATA

Comprehensive Statistics Using Stata
Online fee from
$200 USD
Standard: $300 · Save $100
Course Outcomes

Expected learning outcomes

  • Generate variables that contain summaries of the data
  • Create a summary dataset using collapse
  • Navigate a dataset using _n, _N and subscripted variables
  • Rearrange a dataset using reshape
  • Combine multiple datasets using merge and append
  • Identify duplicate observations
  • Export data to a spreadsheet
  • Create tailor-made publication quality graphs
  • Understand what macros and scalars are
  • Be able to use foreach and forvalues loops
  • Understand and use if statements 
  • Understand how Stata stores estimation results 
  • Be able to access and use stored estimation results 
  • Know how to export results using user-written commands: estout, outreg, tabout 
  • Understand how Stata programs work
  • Be able to write and use a simple Stata program 
  • Be able to write a do-file which exports results using file
Curriculum

Course modules and outline

Module one

  • Introduction to Statistics Concepts
  • Descriptive Statistics
  • Inferential Statistics
  • Research process
  • Concepts and Software for Data Collection and Processing
  • Planning data gathering and processing
  • Introduction to mixed methods for Quantitative Data Collection (Web, and Mobile tools)
  • Use of Mobile Phones for Data Collection (Open Data Kit)
  • STATA Basics
  • Learn how to use Dialog Boxes
  • The command widow
  • User-written extensions
  • Directory Management
  • Obtaining help and perform search
  • STATA syntax
  • Using Basic Statistical Commands
  • Reusing results of STATA commands

Module Two

  • Data Structures and Types of Variables
  • Distinguishing between Categorical data, Continuous data; Numerical, string and date/time variables
  • Managing missing data
  • Creating dummy variables
  • Other special purpose variables
  • Converting datasets into summaries
  • Labeling variables, encoding variables and generating new variables
  • Data Management using STATA
  • Import, Export, load and save datasets
  • Create pseudorandom datasets
  • Review and document the dataset
  • Sorting and ordering
  • Appending, merging and reorganizing datasets
  • Validate data structure
  • Identify duplicate observations
  • Alarming and nonsensical data
  • Benefits of filtering

Module Three

  • Output Management
  • Logs for output
  • Use of translators for exporting STATA files and output
  • Copy and paste from STATA to text editors and spreadsheets
  • Reproduction of past work, stored results and saved results
  • The command display, explicit subscripting and important prefixes
  • Basics of STATA programming
  • Executing commands using do-files
  • Proper structure of do-files
  • Using comments
  • Writing long commands

Module Four

  • Basics of graphing with STATA
  • Basics of Graphing Qualitative Data
  • Basics Graphing Quantitative Data
  • Dialog Boxes vs. Do-file routines
  • Inspecting the data prior graphing
  • Reducing the data dimension to speed up graphing
  • Setting range of variation
  • Subgroups and overlays in STATA Graphics
  • Graphing by categorical groups
  • Subgroup options
  • Formatting graph text
  • Superimposing densities and other graphs
  • Legends for multiple graphs and multiple axes
  • Generalized syntax for overlaying multiple graphs
  • The -two way-Command
  • Scatter graph
  • Linear fits and nonlinear fits
  • Parametric density estimators and Non-parametric density estimators
  • Soft coded vs. hard coded syntax.
  • Using loops for multiple overlays

Module Five

  • Advanced graphing
  • Combining multiple graphs side-by-side
  • Recasting two way plots
  • Reproducing formatting
  • Creating your own graph scheme
  • The graph editor as a scheme maker
  • Quantitative Analysis using STATA
  • Tests for Statistical Inference
  • Tests of Association
  • Tests of Difference
  • Linear regression analysis
  • Ordinary least squares
  • Predicted values and residuals
  • Correlation and Standardized Regression Coefficients
  • Hypothesis testing
  • Problems with regression
  • Introduction to panel data analysis
  • Advantages of panel data analysis
  • Panel data sets
  • Balanced and unbalanced panels
  • Panel data dimensions and frequencies
  • Properties of estimators
  • Graphing panel data
Target Audience

Who should attend?

The course is designed for people who want to be more efficient in their use of Stata. Those who are already experienced in using Stata for data analysis will benefit most from the course

Why Attend

Key course benefits

Practical capacity building approach
Designed for government, NGOs and public institutions
Real-world African case studies
Certificate of completion
Applicable tools and templates
Interactive facilitation and exercises
Course Enquiry

Need more information?

Ask Magna Skills about this course

Use the PHPMaker enquiry form to request a quotation, proposal letter, invoice, group training package, online access, or face-to-face training arrangement.