Combining variables, creating and validating indices
Recommend Documents
*Institute for the Geography of Health ... geography of health and healthcare, on social capital, ..... and the writings of Hippocrates (Macintyre and Ellaway,.
3927 sqft. Tube size OD: 0.75 inches. Tube ID: 0.652 inches. Wall thickness: .... h
s p1 p2. For an Isentropic compression as shown: (. ) (. ) J pv pv h h.
Aug 8, 2006 - Creating and Validating an Algorithm to Measure AIDS Mortality in the Adult. Population using Verbal Autopsy. Ben A. Lopman. 1*. , Ruanne V.
Aug 8, 2006 - Using these signs and symptoms, demographic surveillance data on AIDS deaths may allow for the estimation of AIDS mortality and even HIV ...
laceration, open-bite, surgical relapse, dental injury, nasal septal deviation, and ... aesthetics; and regret over having had the surgery. ..... American Medical Informatics Association Annual Symposium Proceedings, 195e199. Espeland, L.
Digital Pelvic Radiography: Increasing Distance to Reduce Dose. Contrast .... of a GBCA and degree of renal impairment at the time of exposure. Record the ...... et al in 2003 investigated various dose reduction tech- ...... Sarasota, Florida.
Mar 2, 2011 - Screen patients for acute kidney injury and other conditions that .... absorption via the gastrointestinal tract is poor (approximately 5% of the ...... 11. I am able to describe how soft tissue signs can be used to ..... set at 100 cm,
ing advanced spelling checkers, where the usage of contextual rules at the syntax level can sig- nificantly increase the spelling error detection and correction ...
the hard or 'heroic' assumption of LISREL as Herman Wold liked to call it. Anticipating ..... The author gratefully acknowledges reference to McDonald (1996) by .... asymptotically center around (Bâ,Îâ) say, which are such that (I â Bâ)â1
tornam a metodologia passível de ser utilizada em qualquer local, facilitando a tomada de decisão em regiões onde os registros de ocorrências de incêndios ...
(b) gathering information on the needs of patients through blogs and virtual communities; (c) evaluating .... of the population, respectful of its culture and informative (Serxner,. 2000) ..... tance of the booklet; the best time to present the bookl
AbstractâThe purpose of this paper is to investigate the relationship among the key macroeconomic variables and Islamic stock market in India. This study is ...
We tested the assumptions that the relationship between body mass and body size is .... coffee grinder, and fat content was determined using petroleum ether in ...
statistical models are developed to explore the factors associated with the decision to .... incomplete socio-demographi
Feb 11, 2016 - anthropometrics. Much recent clinical application of simple biometrics has ... at https://nirkrakauer.net/sw/absi-calculator.html and can be.
Feb 11, 2016 - wonder how much she benefited health-wise from bariatric surgery. Assuming that she were today a candidate considering bariatric surgery ...
Sep 17, 2018 - (QE65000, Ocean Optics, Dunedin, FL, USA). ..... Jin, X.; Liu, S.; Baret, F.; Hemerle, M.; Comar, A. Estimates of plant density of wheat crops at ...
Aug 16, 2012 - As an example, the entity 'dbpedia:Stevie Wonder' has a. pointing to 'dbpedia:Jimi Hendrix'. Since the AOR task aims at ...
Aug 24, 2016 - {x1,..., xp1 } and a set of p2 categorical variables denoted by {Ëx1,..., Ëxp2 } with p1 + p2 = p. Let X be the corresponding data matrix of ...
Jan 4, 2012 - determine whether a manufacturing process is in a state of statistical control or not. If analysis of the control chart indicates that the process is ...
Oct 18, 2006 - The RecipeSheet lets users create information proc- essors, called recipes, which may take input in a variety of forms such as text, Web pages, ...
dimensional markup produced online by multilingual natural ... XSLT-based online integration of various shallow ... saving it to XML files or to an XML database. .... reorder fs2rmrsxml rmrs_lex. RMRS result pos_filter. SProUT rmrs_morph.
Beauchaine, 2001; Thayer & Lane, 2000), both in the experience of dysphoria (cf. Kreibig ... Fifth Avenue, Webster Hall Suite 134, Pittsburgh, PA 15213. E-mail:.
Validating and Verifying AI Systems › publication › fulltext › Validating... › publication › fulltext › Validating...by DJ Hand · 2020 · Cited by 2 · Related articlesJun 12, 2020 — In machine learning, validation is often used in the ... dation as
Combining variables, creating and validating indices
regression, but the categories are not true dichotomy. ⢠All transformations can be done in two ways. â Using the SPSS graphical user interface. â Writing code.
Combining variables, creating and validating indices
Workshops on writing research papers with TIMSS data Pretoria, South Africa May 27th – 31st
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
• Aggregation of variables • Creating indices
Simple recoding of variables • Changing categories – When the order is reversed – When dichotomous variables need to be included in regression, but the categories are not true dichotomy
• All transformations can be done in two ways – Using the SPSS graphical user interface – Writing code
• The data file we will use is the Student Background C:\Workshop\Data\bsgzafm5.sav
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
Recode Variable – Variable Information [BSDGEDUP] Variable Codes:
-> STEPS: – I. Recode ‚6‘ (not applicable) to SYSMIS – II. Reverse the scale and coding the lowest value to zero (we expect association between higher education with higher reading scores): 5->1/ 4->2/ 3->3/ 2->4/ 1->5 – III. Recode all other values to SYSMIS
Recode ASDHEDUP – I. Define input/output variable Transform -> Recode into different variables… 2. Name Output Variable [BSDGEDUPr] BSDGEDUPr 3. Label Output Variable (optional)
4. Press ‚Change‘
5. Press ‚Old and New Values…‘
1. Select variable [ASDHEDUP]
Recode BSDGEDUP – II. All missing to SYSMIS Recode all missing values (user defined or system) into system missing
Confirm each recoding step with ‚Add‘
Recode BSDGEDUP – III. Reverse coding of categories
Recode BSDGEDUP – Define Value Labels (optional) Tab: Variable View Click to edit
Enter values [1…4] Enter corresponding labels Confirm each entry with ‘Add’
Recode ASDHEDUP – do all at once RECODE BSDGEDUP (1=5) (2=4) (3=3) (4=2) (5=1) (6=SYSMISS) (MISSING=SYSMISS) INTO BSDGEDUPr. VARIABLE LABELS BSDGEDUPr "PARENTS HIGHEST EDUCATIONAL LEVEL - REVERSED" VALUE LABELS BSDGEDUPr "1" "SOME PRIMARY, LOWER SECONDARY OR NO SCHOOL" "2" "LOWER SECONDARY" "3" "UPPER SECONDARY" "4" "POST-SECONDARY BUT NOT UNIVERSITY" "5" "UNIVERSITY OR HIGHER". EXECUTE.
Recode ITSEX • Recode the variable for regression – Original values: 1 – girl; 2 – boy – Recoded values: 0 – girl; 1 – boy
• The original values used in regression will give us incorrect estimate for the intercept • Syntax: RECODE ITSEX (1=0) (2=1) (MISSING=SYSMISS) INTO ITSEXr. VARIABLE LABELS ITSEX "SEX OF STUDENTS - RECODED". VALUE LABELS ITSEXr "0" "GIRL" "1" "BOY". EXECUTE.
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
• Aggregation of variables • Creating indices
Collapsing categories • When we need less categories than the variable provides • Books at home 0-10 BOOKS 11-25 BOOKS 26-100 BOOKS 101-200 BOOKS MORE THAN 200
UP TO 100 BOOKS MORE THAN 100 BOOKS
Collapsing categories • Syntax: RECODE BSBG04 (1 THRU 3 = 0) (4 THRU 5 = 1) INTO BSBG04r. VARIABLE LABELS BSBG04r "GEN\AMOUNT OF BOOKS IN YOUR HOME - COLLAPSED". VALUE LABELS BSBG04r "0" "UP TO 100 BOOKS" "1" "MORE THAN 100 BOOKS". EXECUTE.
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
• Aggregation of variables • Creating indices
Convert continuous into categorical variable • When we need the values over of the continuum of a variable to be represented as categories • Students Confident with Mathematics [BSBGSLC] – Derived variable (IRT) – Continuous scale: • Mean: 10.0 • SD: 2
• Recode into categorical variable: – Lowest through 5 = 1 – 5.1 through 10.0 = 2 – 10.1 through highest = 3
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
• Aggregation of variables • Creating indices
Aggregation of variables • When we need to obtain a measure for the sampled students in the schools based on their common characteristics • This could give us a measure of the student intake in the schools • For example, aggregated SES measures can give us idea what is the average SES across schools and use it to control for the student intake in linear regression
Aggregation of variables • Home Educational Resources Scale (HER)
1. Select Break Variable [IDSCHOOL] (NOTE: If you have more than one country, you will need to add IDCNTRY as first break variable 2. Select variables to aggregate [BSBGHER]
Default function is ‘mean’
3. Name the resulting variable [BSBGHERm]
4. Add the data as a new variable
Aggregation of variables • Syntax: SORT CASES BY IDSCHOOL. AGGREGATE /OUTFILE=* MODE=ADDVARIABLES /PRESORTED /BREAK=IDSCHOOL /BSBGHERm=MEAN(BSBGHER).
Table of Contents • Simple recoding of variables • Changing categories – reverse and simple change • Collapsing categories • Convert continuous into categorical variable
• Aggregation of variables • Creating indices
What is an Index? • A variable, a manifestation, indicator, etc. • Something that tells us something about something • Could be a single variable or combination of several variables
Combining Variables • Usually we talk about an index referring to a combination of two or more variables into one – – – –
Economic Index, Stock exchange index, etc. A ratio of two variables (student to teacher ratio) A score on a test Any other variable that results from the combination of 2 or more other variables
Combining Variables • One classification – Simple indices: arithmetic transformation or recoding of one or more items – Scale indices: constructed through the scaling of items (using IRT, factor scores, sum of individual scores, etc.)
• Many considerations and decisions go into computing an index
Steps in Creating Indices • Select the variables based on a theoretical background • Review variability of each of the variables • Check relationship with achievement (if the goal is to study achievement) • Conduct an exploratory or confirmatory factor analysis • Perform necessary recoding • Check the reliability • Compute the index • Check your work
Steps in Creating Indices • Are there variables that need to be recoded or reversed? • What’s the incidence of missing data? – – – –
How will you deal with missing data? How much is tolerable? Listwise or casewise deletion? Relative contribution of each item?
• Are the components measuring something similar? • How will the variables be grouped?
Computing Indices • How will you compute the index? – Sum or average score – Factor score – IRT score • Deciding on model • Deciding on estimation method
– Selecting parameters
• Advantages and disadvantages? – Computing indices over time – Comparability across countries – Relative contribution from countries to the parameter estimation
Setting the Scale • • • • •
How many units? Where does it begin and where does it end? Where do you set the mean? Do you set the dispersion? Same or different metric across countries?
Principal Component Analysis (PCA) • A method for data reduction
V1
V2
Factor 1
V3
V4
V5
V6
Factor 2
„Many correlated variables are replaced by few variables, that are uncorrelated“
V7
Partial correlation reminding correlation x y rxy => 0 factor loading rzx
z
factor loading rzy
„synthetic“ variable => Latent Factor
PCA as a method for generating of hypothesis • The latent factor estimates the joint variance of the variables that are in the model • Whereas it doesn‘t provide any information about the context-related meaning of the latent factor • but: factor loadings (which are partial correlations between the latent factor and the manifest variables) allows for: • …generating of hypothesis about the context-related interpretation of the latent factor
PCA as a method for analysis of the dimensions • Factor analysis allows for testing of the multidimensionality of complex sociological and psychological constructs • The single-dimensionality is one of the criteria of the quality of the constructs – When variables contains only one dimension the latent variable can be used for further analysis – In the case of multidimensionality further model development is needed (e.g. excluding variables, defining more the one factor that should be modeled separately)
Reliability • Cronbachs’ α coefficient – tests the generalizability (reliability) of the score under the condition that all variables of the score are correlated by 1 • correlation equal to 1 means that the construct is very consistent
– Range from 0 to 1 – Value near 1 means that the factor is close to the assumption of 1-correlation high reliability
Example Watch for the direction!
• Index of Student Confidence in Mathematics Variable BSBM16A BSBM16B BSBM16C BSBM16D BSBM16E BSBM16F BSBM16G BSBM16H BSBM16I BSBM16J BSBM16K BSBM16L BSBM16M BSBM16N
Question I usually do well in mathematics Mathematics is more difficult for me than for many of my classmates Mathematics is not one of my strengths I learn things quickly in mathematics Mathematics makes me confused and nervous I am good at working out difficult mathematics problems My teacher thinks I can do well in mathematics with difficult materials My teacher tells me I am good at mathematics Mathematics is harder for me than any other subject I think learning mathematics will help me in my daily life I need mathematics to learn other school subjects I need to do well in mathematics to get into the of my choice I need to do well in mathematics to get the job I want I would like a job that involves using mathematics
List of response categories • Response categories – – – – –
1 – AGREE A LOT 2 – AGREE A LITTLE 3 – DISAGREE A LITTLE 4 – DISAGREE A LOT 9 – OMITTED (missing value)
• Again, watch for the direction!
Recode variables • Positive statements to be reversed – – – – – –
Producing the scale using PCA • Principal Component Analysis (PCA) –involves extracting linear composites of observed variables • Data reduction method • Reduces correlated observed variables to a smaller set of important independent composite variables
In the graphical user interface of SPSS… • Load the merged file after recoding the variables • Apply the Total Student Weight (TOTWGT) NOTE: If you do it for more than one country, use the Senate Weight (SENWGT)
In the graphical user interface of SPSS… • Run PCA – Open the dialog box
In the graphical user interface of SPSS… • Run PCA (continued) – Add all recoded variables into the list box
In the graphical user interface of SPSS… • Run PCA (continued) – Extraction settings
In the graphical user interface of SPSS… • Run PCA (continued) – Rotation settings
In the graphical user interface of SPSS… • Run PCA (continued) – Factor scores settings
In the graphical user interface of SPSS… • Run PCA (continued)
In the graphical user interface of SPSS… • Run PCA (continued)
In the graphical user interface of SPSS… • Run PCA (continued)
Question I usually do well in mathematics I learn things quickly in mathematics I am good at working out difficult mathematics problems My teacher thinks I can do well in mathematics with difficult materials My teacher tells me I am good at mathematics I think learning mathematics will help me in my daily life I need mathematics to learn other school subjects I need to do well in mathematics to get into the of my choice I need to do well in mathematics to get the job I want I would like a job that involves using mathematics Mathematics is more difficult for me than for many of my classmates Mathematics is not one of my strengths Mathematics makes me confused and nervous Mathematics is harder for me than any other subject
• 1 – Students confidence in mathematics • 2 – Students opinion on mathematics utility • 3 – Students experience difficulty in mathematics
Rerun PCA for Factor 1 Variable BSBM16A BSBM16D BSBM16F BSBM16G
Question
I usually do well in mathematics I learn things quickly in mathematics I am good at working out difficult mathematics problems My teacher thinks I can do well in mathematics with difficult materials BSBM16H My teacher tells me I am good at mathematics