Results: MathLogReg2.sas

Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

HS variables

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 435

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 257
2 No 178

Probability modeled is passed='Yes'.

Note:144 observations were deleted due to missing values for the response or explanatory variables.

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 590.611 465.610
SC 594.686 481.912
-2 Log L 588.611 457.610

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 131.0005 3 <.0001
Score 110.3360 3 <.0001
Wald 82.9074 3 <.0001

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept 1 -15.9885 2.0503 60.8083 <.0001
hsgpa 1 0.1491 0.0331 20.3218 <.0001
hscalc 1 0.0582 0.0126 21.3526 <.0001
hsengl 1 0.00437 0.0164 0.0711 0.7898

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.161 1.088 1.238
hscalc 1.060 1.034 1.086
hsengl 1.004 0.973 1.037

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 81.0 Somers' D 0.620
Percent Discordant 19.0 Gamma 0.620
Percent Tied 0.0 Tau-a 0.300
Pairs 45746 c 0.810

Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

HS gpa and calc, course2 and diagnostic test

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 375

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 234
2 No 141

Probability modeled is passed='Yes'.

Note:204 observations were deleted due to missing values for the response or explanatory variables.

Class Level Information

Class Level Information
Class Value Design Variables
course2 Catch-up 1 0
  Elite 0 1
  Mainstrm 0 0

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 498.554 379.569
SC 502.481 407.057
-2 Log L 496.554 365.569

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 130.9852 6 <.0001
Score 108.3421 6 <.0001
Wald 79.9399 6 <.0001

Type 3 Tests

Type 3 Analysis of Effects
Effect DF Wald
Chi-Square
Pr > ChiSq
hsgpa 1 14.6472 0.0001
hscalc 1 18.1207 <.0001
course2 2 0.4383 0.8032
precalc 1 8.5427 0.0035
calc 1 1.6624 0.1973

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter   DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept   1 -14.8670 2.3187 41.1102 <.0001
hsgpa   1 0.1203 0.0314 14.6472 0.0001
hscalc   1 0.0596 0.0140 18.1207 <.0001
course2 Catch-up 1 0.2860 0.5602 0.2607 0.6097
course2 Elite 1 0.2244 0.5129 0.1915 0.6617
precalc   1 0.2617 0.0895 8.5427 0.0035
calc   1 0.0843 0.0654 1.6624 0.1973

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.128 1.060 1.199
hscalc 1.061 1.033 1.091
course2 Catch-up vs Mainstrm 1.331 0.444 3.991
course2 Elite vs Mainstrm 1.252 0.458 3.420
precalc 1.299 1.090 1.548
calc 1.088 0.957 1.237

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 83.5 Somers' D 0.670
Percent Discordant 16.5 Gamma 0.670
Percent Tied 0.0 Tau-a 0.315
Pairs 32994 c 0.835

Wald Test for Contrasts

Contrast Test Results
Contrast DF Wald
Chi-Square
Pr > ChiSq
Course 2 0.4383 0.8032

Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

HS gpa and calc, precalc and total score

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 375

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 234
2 No 141

Probability modeled is passed='Yes'.

Note:204 observations were deleted due to missing values for the response or explanatory variables.

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 498.554 376.007
SC 502.481 395.642
-2 Log L 496.554 366.007

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 130.5468 4 <.0001
Score 108.2737 4 <.0001
Wald 79.7057 4 <.0001

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept 1 -14.6351 2.2803 41.1914 <.0001
hsgpa 1 0.1181 0.0311 14.4227 0.0001
hscalc 1 0.0592 0.0136 18.9109 <.0001
precalc 1 0.1812 0.1250 2.0999 0.1473
totscore 1 0.0821 0.0650 1.5969 0.2063

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.125 1.059 1.196
hscalc 1.061 1.033 1.090
precalc 1.199 0.938 1.532
totscore 1.086 0.956 1.233

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 83.5 Somers' D 0.670
Percent Discordant 16.5 Gamma 0.670
Percent Tied 0.0 Tau-a 0.315
Pairs 32994 c 0.835

Test Statement Results

Linear Hypotheses Testing Results
Label Wald
Chi-Square
DF Pr > ChiSq
precalc_n_totscore 13.8587 2 0.0010


Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

Try gender, ethnic and mother tongue controlling for good stuff

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 370

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 232
2 No 138

Probability modeled is passed='Yes'.

Note:209 observations were deleted due to missing values for the response or explanatory variables.

Class Level Information

Class Level Information
Class Value Design Variables
ethnic Asian 1 0 0 0 0
  East Indian 0 0 0 0 0
  Eastern European 0 1 0 0 0
  European not Eastern 0 0 1 0 0
  Middle-Eastern and Pakistani 0 0 0 1 0
  Other and DK 0 0 0 0 1

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 490.784 379.703
SC 494.698 422.752
-2 Log L 488.784 357.703

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 131.0808 10 <.0001
Score 110.8226 10 <.0001
Wald 80.9993 10 <.0001

Type 3 Tests

Type 3 Analysis of Effects
Effect DF Wald
Chi-Square
Pr > ChiSq
hsgpa 1 10.3205 0.0013
hscalc 1 24.3884 <.0001
precalc 1 13.5282 0.0002
ethnic 5 4.5364 0.4750
gender 1 0.8343 0.3610
mtongue 1 0.1917 0.6615

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter   DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept   1 -14.2314 2.4312 34.2655 <.0001
hsgpa   1 0.1043 0.0325 10.3205 0.0013
hscalc   1 0.0687 0.0139 24.3884 <.0001
precalc   1 0.3205 0.0871 13.5282 0.0002
ethnic Asian 1 -0.3522 0.4642 0.5758 0.4480
ethnic Eastern European 1 -0.0314 0.5126 0.0038 0.9512
ethnic European not Eastern 1 0.2889 0.4266 0.4587 0.4982
ethnic Middle-Eastern and Pakistani 1 -0.4521 0.5419 0.6963 0.4040
ethnic Other and DK 1 0.6539 0.8954 0.5334 0.4652
gender   1 0.2465 0.2698 0.8343 0.3610
mtongue   1 -0.1485 0.3391 0.1917 0.6615

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.110 1.042 1.183
hscalc 1.071 1.042 1.101
precalc 1.378 1.161 1.634
ethnic Asian vs East Indian 0.703 0.283 1.746
ethnic Eastern European vs East Indian 0.969 0.355 2.647
ethnic European not Eastern vs East Indian 1.335 0.579 3.081
ethnic Middle-Eastern and Pakistani vs East Indian 0.636 0.220 1.840
ethnic Other and DK vs East Indian 1.923 0.333 11.120
gender 1.279 0.754 2.171
mtongue 0.862 0.443 1.676

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 83.7 Somers' D 0.674
Percent Discordant 16.3 Gamma 0.674
Percent Tied 0.0 Tau-a 0.316
Pairs 32016 c 0.837

Coefficients for Contrasts

Coefficients of Contrast Demographics
Parameter Row1 Row2 Row3 Row4 Row5 Row6 Row7
Intercept 0 0 0 0 0 0 0
hsgpa 0 0 0 0 0 0 0
hscalc 0 0 0 0 0 0 0
precalc 0 0 0 0 0 0 0
ethnicAsian 1 0 0 0 0 0 0
ethnicEastern_European 0 1 0 0 0 0 0
ethnicEuropean_not_Eastern 0 0 1 0 0 0 0
ethnicMiddle_Eastern__and_Pakist 0 0 0 1 0 0 0
ethnicOther___and_DK 0 0 0 0 1 0 0
gender 0 0 0 0 0 1 0
mtongue 0 0 0 0 0 0 1

Wald Test for Contrasts

Contrast Test Results
Contrast DF Wald
Chi-Square
Pr > ChiSq
Demographics 7 6.0125 0.5383

Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

My model: HS gpa, HS calculus mark, and Precalculus subtest

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 375

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 234
2 No 141

Probability modeled is passed='Yes'.

Note:204 observations were deleted due to missing values for the response or explanatory variables.

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 498.554 375.618
SC 502.481 391.326
-2 Log L 496.554 367.618

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 128.9358 3 <.0001
Score 107.7971 3 <.0001
Wald 79.6583 3 <.0001

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept 1 -14.7970 2.2683 42.5550 <.0001
hsgpa 1 0.1173 0.0310 14.3281 0.0002
hscalc 1 0.0638 0.0132 23.3346 <.0001
precalc 1 0.2989 0.0844 12.5464 0.0004

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.124 1.058 1.195
hscalc 1.066 1.039 1.094
precalc 1.348 1.143 1.591

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 83.2 Somers' D 0.664
Percent Discordant 16.8 Gamma 0.664
Percent Tied 0.0 Tau-a 0.312
Pairs 32994 c 0.832

Prediction of Performance in First-year Calculus

Predict Passing the course (Y-N) with Logistic Regression

Stepwise Logistic Regression

The LOGISTIC Procedure

The LOGISTIC Procedure

Model Information

Model Information
Data Set WORK.MATHEX  
Response Variable passed Passed the course
Number of Response Levels 2  
Model binary logit  
Optimization Technique Fisher's scoring  

Observations Summary

Number of Observations Read 579
Number of Observations Used 368

Response Profile

Response Profile
Ordered
Value
passed Total
Frequency
1 Yes 230
2 No 138

Probability modeled is passed='Yes'.

Note:211 observations were deleted due to missing values for the response or explanatory variables.

Stepwise Selection Procedure

Class Level Information

Class Level Information
Class Value Design Variables
ethnic Asian 1 0 0 0 0
  East Indian 0 1 0 0 0
  Eastern European 0 0 1 0 0
  European not Eastern 0 0 0 1 0
  Middle-Eastern and Pakistani 0 0 0 0 1
  Other and DK 0 0 0 0 0
course Catch-up 1 0      
  Elite 0 1      
  Mainstrm 0 0      

Step 0

Step 0. Intercept entered:

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

-2 Log L = 486.911

Residual Chi-Square

Residual Chi-Square Test
Chi-Square DF Pr > ChiSq
111.4077 14 <.0001

Step 1

Step 1. Effect hscalc entered:

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 488.911 398.544
SC 492.819 406.360
-2 Log L 486.911 394.544

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 92.3666 1 <.0001
Score 85.5385 1 <.0001
Wald 68.7679 1 <.0001

Residual Chi-Square

Residual Chi-Square Test
Chi-Square DF Pr > ChiSq
38.7030 13 0.0002

Note:No effects for the model in Step 1 are removed.

Step 2

Step 2. Effect hsgpa entered:

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 488.911 382.659
SC 492.819 394.384
-2 Log L 486.911 376.659

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 110.2512 2 <.0001
Score 94.7678 2 <.0001
Wald 71.1676 2 <.0001

Residual Chi-Square

Residual Chi-Square Test
Chi-Square DF Pr > ChiSq
22.0384 12 0.0371

Note:No effects for the model in Step 2 are removed.

Step 3

Step 3. Effect precalc entered:

Convergence Status

Model Convergence Status
Convergence criterion (GCONV=1E-8) satisfied.

Fit Statistics

Model Fit Statistics
Criterion Intercept Only Intercept and Covariates
AIC 488.911 370.567
SC 492.819 386.199
-2 Log L 486.911 362.567

Global Tests

Testing Global Null Hypothesis: BETA=0
Test Chi-Square DF Pr > ChiSq
Likelihood Ratio 124.3437 3 <.0001
Score 104.2190 3 <.0001
Wald 77.6336 3 <.0001

Residual Chi-Square

Residual Chi-Square Test
Chi-Square DF Pr > ChiSq
8.7229 11 0.6475

Note:No effects for the model in Step 3 are removed.

Note:No (additional) effects met the 0.05 significance level for entry into the model.

Model Building Summary

Summary of Stepwise Selection
Step Effect DF Number
In
Score
Chi-Square
Wald
Chi-Square
Pr > ChiSq Variable
Label
Entered Removed
1 hscalc   1 1 85.5385   <.0001 HS Calculus
2 hsgpa   1 2 17.3610   <.0001 High School GPA
3 precalc   1 3 13.8105   0.0002 Number precalculus correct

Type 3 Tests

Type 3 Analysis of Effects
Effect DF Wald
Chi-Square
Pr > ChiSq
hsgpa 1 13.5684 0.0002
hscalc 1 21.6572 <.0001
precalc 1 13.2413 0.0003

Parameter Estimates

Analysis of Maximum Likelihood Estimates
Parameter DF Estimate Standard
Error
Wald
Chi-Square
Pr > ChiSq
Intercept 1 -14.4886 2.2714 40.6869 <.0001
hsgpa 1 0.1147 0.0312 13.5684 0.0002
hscalc 1 0.0616 0.0132 21.6572 <.0001
precalc 1 0.3098 0.0851 13.2413 0.0003

Odds Ratios

Odds Ratio Estimates
Effect Point Estimate 95% Wald
Confidence Limits
hsgpa 1.122 1.055 1.192
hscalc 1.064 1.036 1.092
precalc 1.363 1.154 1.611

Association Statistics

Association of Predicted Probabilities and Observed Responses
Percent Concordant 82.9 Somers' D 0.658
Percent Discordant 17.1 Gamma 0.658
Percent Tied 0.0 Tau-a 0.309
Pairs 31740 c 0.829