Data Preparation and Visualization for assessing level and trends of premature mortality from noncommunicable diseases (NCD) Ramon Martinez Technical Adviser in Health Metrics
[email protected]
@HlthAnalysis
Alteryx Webinar Panel: Preparing data for your Conference. October 13th, 2015
Contents • Introducing data prep for assessing level and trends of premature mortality from NCDs in countries of the Americas • Data analytics and visualization platform • Describe how data is prepared and integrated • Visualization tools for assessing trends of premature mortality from NCDs
1
Introduction Noncommunicable diseases (NCD) (cardiovascular disease, cancer, diabetes, and chronic respiratory diseases) are leading causes of deaths and disabilities worldwide. NCD accounts for 63% of global deaths (37 million annually), with 80% occurring in low and middle income countries. Almost a third of deaths from NCD in poor countries occurred prematurely in population under 60 years old. Monitoring the progress that countries are making with reducing the burden of NCD is a key function of the Public Health Surveillance for NCDs prevention and control. 2
Challenge Assessing and monitoring the level and trends of premature mortality from NCDs requires several large and heterogeneous data sources: 1. Registered death data (Vital Statistics) from death certificates reported by national authorities (official data) from 48 countries of the Americas 2. Population estimates by country, year, sex and age groups 3. Other reference data (e.g. world standard population, catalogue of causes of deaths, country, sex, and age groups). Data preparation and integration processes using Alteryx 3
Data Analytics & Visualization Platform: high level system architecture Web-based apps / services for data dissemination
GBD Visualizations
Research Collaboration site
MoH Web site
Data discovery & visual analytics
Visual discovery and analytic tools and methods. Data computation. Data visualization sharing and collaboration
Tableau
Data storage & managements
DBMS: Data repository with data (internal and external) ready for analysis and visualizations
MS SQL Server
Data preparation and integration
Data cleansing, transformation, processing, preparation and integration processes (workflows & scheduling)
Alteryx
Source systems / Data sources
Mortality
Diagram adapted from BI/DW technical system architecture
4
Open Data Portal
Health Survey
Hospital records PH surveillance
Population
Data Preparation and Integration Data preparation and integration processes using Alteryx
5
Integration of Mortality Data
6
Integration of Population Data
7
Integration of Mortality Measures
8
Title of the Presentation
Data Modeling in the Data Warehouse 1. Data is uploaded to the Data Warehouse 2. Dimensional models (star-schema model) are prepared 3. Data ready and accessible for analysis and visualization
9
Data Visualization Visualization tool for assessing level and trends of premature mortality from noncommunicable diseases.
10
Title of the Presentation
Assessing Level and Trends of Premature Mortality from NCDs Interactive visualization tool for assessing the trends of premature mortality from NCDs by country. It helps to answer the question: A selected country
11
Title of the Presentation
Conclusions • In the past, data were prepared using Excel and coding SQL sentences in MS Management Studio. • A lot of time expended with this approach, less control and transparency • Data preparation and integration using Alteryx allows us to improve: o The efficiency of processes. 60% reduction of data prep time o Transparency and gain more control over the data prep processes o Reutilization and automation of processes o Documentation of data preparation processes 12
Keep the conversation after the panel Ask your questions on Twitter using #ZenDataPrep
Ramon Martinez Technical Adviser in Health Metrics
[email protected]
@HlthAnalysis