Path1 with SAS

STA313 F 2004 Handout 8

Path Model 1 with SAS

/* path1.sas */
options linesize=79 noovp formdlim='_';
title 'STA313f04 Path 1 Example';

data path1;
     infile 'path1.dat';
     input x1 x2 y1 y2;

proc calis cov;          /* Analyze the covariance matrix (Default is corr) */
     title2 'Full (unrestricted) Model';
     var x1 x2 y1 y2;    /* Manifest vars are in the data set */
     lineqs              /* Simultaneous equations, separated by commas */
          y1 = b1 x1 + e1,
          y2 = b2 y1 + b3 x2 + e2;
     std                  /* Variances (not standard deviations) */
          x1 = sigsqx1,   /* Optional starting values in parentheses */
          x2 = sigsqx2,
          e1 = sigsqe1,
          e2 = sigsqe2;
     cov                   /* Covariances */
          x1 x2 = sigma12; /* Unmentioned pairs get covariance zero */
     bounds 0.0 < sigsqx1,
            0.0 < sigsqx2,
            0.0 < sigsqe1,
            0.0 < sigsqe2;

proc calis cov;          /* Analyze the covariance matrix (Default is corr) */
     title2 'Reduced (restricted) Model: b3=0';
     var x1 x2 y1 y2;    /* Manifest vars are in the data set */
     lineqs              /* Simultaneous equations, separated by commas */
          y1 = b1 x1 + e1,
          y2 = b2 y1 + e2;
     std                  /* Variances (not standard deviations) */
          x1 = sigsqx1,   /* Optional starting values in parentheses */
          x2 = sigsqx2,
          e1 = sigsqe1,
          e2 = sigsqe2;
     cov                   /* Covariances */
          x1 x2 = sigma12; /* Unmentioned pairs get covariance zero  */
     bounds 0.0 < sigsqx1,
            0.0 < sigsqx2,
            0.0 < sigsqe1,
            0.0 < sigsqe2;

proc iml;
     title2 'Compute G two ways';
     print " ";
     print "Based on Fit Function";
     G1 = 300*(3.3328-0.0227); pval1 = 1-probchi(G1,1);
     print "G = " G1 ", df = 1, p = " pval1;
     print " ";
     print "Based on chi-square";
     G2 = 300/299 * (996.5153-6.7874); pval2 = 1-probchi(G2,1);
     print "G = " G2 ", df = 1, p = " pval2;

Before looking at the list file, here is a little discussion of how the test statistic G is being computed with proc iml. Notice that except for getting the p-value, these calculations could be done with a hand calculator.

Let us use the term "saturated model" for a model with no constraints on the covariance matrix of the manifest variables. This is the language we have been using in class. Any (identified) model with the same number of parameters as the unique elements of the covariance matrix is also saturated, and yields the same -2 Log Likelihood -- that is, any saturated model has a -2 Log likelihood equal to

n p ( 1 +log(2 pi) ) + n log(|Sigma_hat|) .

The equality of the -2 LL quantities for any saturated model follows from the invariance principle of maximum likelihood estimation, just for the record.

Now suppose you fit a non-saturated model. The difference between the quantity above and -2 LL for the model you fit is a reasonable test for the "goodness of fit" of your model. The null hypothesis is that your model holds, versus the alternative that there are no restrictions at all on the variance-covariance matrix of the manifest variables. The difference between the two -2LL quantities is a G; it's asymptotically chi-square, with degrees of freedom the difference between the number of parameters in your model and the number of parameters (unique elements of the covariance matrix) of the saturated model. This "goodness of fit" chisquare will equal zero (with df=0) only if you are fitting a model that is one-to-one with the saturated model.

If you fit an unrestricted model (but still maybe restricted compared to the saturated model) and you also fit a (more) restricted model, the DIFFERENCE between the 2 goodness of fit chi-square statistics is exactly our test statistic G for testing the null hypothesis that the restricted model is true versus the alternative that the unrestricted model is true. There are two ways to get the goodness of fit chisquare statistic from the SAS output for a model. Of course you need to fit a restriced and an unrestricted model, and subtract to get G.

The first way is based on the "Fit Function" of the SAS output, which equals 0.0227 for the Full (unrestricted) model in the path1 example, and 3.3328 for the reduced (restricted) model. Multiply it by n, and you get that goodness of fit chisquare, directly. Multiply the difference by n, and you get the test statistic we are seeking. Thus, what we want is G = 300*(3.3328-0.0227) = 993.03. That's G1 in the proc iml above.

The second way to get G is from the SAS "Chi-square" statistic; Chi-square is equal to 6.7874 for the Full (unrestricted) model, and 996.5153 for the Reduced (restricted) model in the SAS output. This is almost the right number. It's what we want, but multiplied by (n-1)/n. Don't ask me why they do this, but of course for very large samples, (n-1)/n has no effect, and the G test is based on large-sample theory. We will multiply by n/(n-1) to get the traditional likelihood ratio test. Thus, G = 300/299 * (996.5153-6.7874) = 993.038. That's G2 in the proc iml; it's equal to G1 except for rounding error.

For comparison, when we did this example with R (in Handout 7) we got G = 993.038.

Now here is path1.lst.


_______________________________________________________________________________

                           STA313f04 Path 1 Example                           1
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
           Covariance Structure Analysis: Pattern and Initial Values

                            LINEQS Model Statement


                       Matrix      Rows    Columns    ------Matrix Type-------

Term 1            1    _SEL_          4          6    SELECTION
                  2    _BETA_         6          6    EQSBETA        IMINUSINV
                  3    _GAMMA_        6          4    EQSGAMMA
                  4    _PHI_          4          4    SYMMETRIC


                          The 2 Endogenous Variables

    Manifest        y1  y2
    Latent


                          The 4 Exogenous Variables

    Manifest        x1  x2
    Latent
    Error           e1  e2

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           2
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
           Covariance Structure Analysis: Pattern and Initial Values

              Manifest Variable Equations with Initial Estimates


        y1      =        .*x1      +  1.0000 e1
                           b1
        y2      =        .*y1      +       .*x2      +  1.0000 e2
                           b2                b3


                       Variances of Exogenous Variables

                       Variable Parameter      Estimate

                       x1       sigsqx1               .
                       x2       sigsqx2               .
                       e1       sigsqe1               .
                       e2       sigsqe2               .


                     Covariances Among Exogenous Variables

                       Var1 Var2 Parameter      Estimate

                       x1   x2   sigma12               .

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           3
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

            Observations         300    Model Terms              1
            Variables              4    Model Matrices           4
            Informations          10    Parameters               8


                     Variable          Mean       Std Dev

                     x1             0.16588       2.27566
                     x2            -0.20103       3.08880
                     y1             0.11331       2.68760
                     y2            -0.41910      10.88369


                Set Covariances of Exogenous Manifest Variables

                                    x1  x2

NOTE: Some initial estimates computed by two-stage LS method.


_______________________________________________________________________________

                           STA313f04 Path 1 Example                           4
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                          Vector of Initial Estimates

                 Parameter      Estimate    Type

            1    b2              2.09453    Matrix Entry: _BETA_[2:1]
            2    b1              1.00190    Matrix Entry: _GAMMA_[1:1]
            3    b3              2.98282    Matrix Entry: _GAMMA_[2:2]
            4    sigsqx1         5.17862    Matrix Entry: _PHI_[1:1]
            5    sigma12         0.44648    Matrix Entry: _PHI_[2:1]
            6    sigsqx2         9.54070    Matrix Entry: _PHI_[2:2]
            7    sigsqe1         2.02490    Matrix Entry: _PHI_[3:3]
            8    sigsqe2         3.23198    Matrix Entry: _PHI_[4:4]

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           5
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                       Levenberg-Marquardt Optimization

                         Scaling Update of More (1978)

                   Parameter Estimates                    8
                   Functions (Observations)              10
                   Lower Bounds                           4
                   Upper Bounds                           0

                              Optimization Start

Active Constraints                   0  Objective Function        0.0274155346
Max Abs Gradient Element  0.2050710365  Radius                               1


                                                                        Actual
                                                      Max Abs             Over
         Rest    Func      Act    Objective  Obj Fun Gradient             Pred
 Iter    arts   Calls      Con     Function   Change  Element  Lambda   Change

    1       0       2        0      0.02272  0.00469   0.0127       0    0.994
    2       0       3        0      0.02270 0.000025 0.000596       0    0.998
    3       0       4        0      0.02270 3.992E-8 0.000037       0    0.983
    4       0       5        0      0.02270 1.18E-10 1.741E-6       0    0.978

                             Optimization Results

Iterations                           4  Function Calls                       6
Jacobian Calls                       5  Active Constraints                   0
Objective Function        0.0227001914  Max Abs Gradient Element  1.7411686E-6
Lambda                               0  Actual Over Pred Change   0.9775696387
Radius                     0.000047076

ABSGCONV convergence criterion satisfied.

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           6
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

         Fit Function                                          0.0227
         Goodness of Fit Index (GFI)                           0.9889
         GFI Adjusted for Degrees of Freedom (AGFI)            0.9446
         Root Mean Square Residual (RMR)                       1.7981
         Parsimonious GFI (Mulaik, 1989)                       0.3296
         Chi-Square                                            6.7874
         Chi-Square DF                                              2
         Pr > Chi-Square                                       0.0336
         Independence Model Chi-Square                         1466.5
         Independence Model Chi-Square DF                           6
         RMSEA Estimate                                        0.0895
         RMSEA 90% Lower Confidence Limit                      0.0215
         RMSEA 90% Upper Confidence Limit                      0.1675
         ECVI Estimate                                         0.0771
         ECVI 90% Lower Confidence Limit                       0.0620
         ECVI 90% Upper Confidence Limit                       0.1176
         Probability of Close Fit                              0.1372
         Bentler's Comparative Fit Index                       0.9967
         Normal Theory Reweighted LS Chi-Square                6.7003
         Akaike's Information Criterion                        2.7874
         Bozdogan's (1987) CAIC                               -6.6202
         Schwarz's Bayesian Criterion                         -4.6202
         McDonald's (1989) Centrality                          0.9921
         Bentler & Bonett's (1980) Non-normed Index            0.9902
         Bentler & Bonett's (1980) NFI                         0.9954
         James, Mulaik, & Brett (1982) Parsimonious NFI        0.3318
         Z-Test of Wilson & Hilferty (1931)                    1.8416
         Bollen (1986) Normed Index Rho1                       0.9861
         Bollen (1988) Non-normed Index Delta2                 0.9967
         Hoelter's (1983) Critical N                              265

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           7
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                  Manifest Variable Equations with Estimates


       y1      =   1.0019*x1       +  1.0000 e1
       Std Err     0.0362 b1
       t Value    27.7053
       y2      =   2.0486*y1       +  2.9828*x2       +  1.0000 e2
       Std Err     0.0386 b2          0.0336 b3
       t Value    53.0060            88.6977


                       Variances of Exogenous Variables

                                                 Standard
           Variable Parameter      Estimate         Error    t Value

           x1       sigsqx1         5.17862       0.42354      12.23
           x2       sigsqx2         9.54070       0.78030      12.23
           e1       sigsqe1         2.02490       0.16561      12.23
           e2       sigsqe2         3.21678       0.26309      12.23


                    Covariances Among Exogenous Variables

                                                 Standard
          Var1 Var2 Parameter      Estimate         Error    t Value

          x1   x2   sigma12         0.44648       0.40732       1.10

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           8
                           Full (unrestricted) Model
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

            Manifest Variable Equations with Standardized Estimates


        y1      =   0.8483*x1      +  0.5295 e1
                           b1
        y2      =   0.4947*y1      +  0.8278*x2      +  0.1611 e2
                           b2                b3


                         Squared Multiple Correlations

                                     Error         Total
                    Variable      Variance      Variance    R-Square

               1    y1             2.02490       7.22317      0.7197
               2    y2             3.21678     123.88541      0.9740


                     Correlations Among Exogenous Variables

                       Var1 Var2 Parameter      Estimate

                       x1   x2   sigma12         0.06352

_______________________________________________________________________________

                           STA313f04 Path 1 Example                           9
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
           Covariance Structure Analysis: Pattern and Initial Values

                            LINEQS Model Statement


                       Matrix      Rows    Columns    ------Matrix Type-------

Term 1            1    _SEL_          4          6    SELECTION
                  2    _BETA_         6          6    EQSBETA        IMINUSINV
                  3    _GAMMA_        6          4    EQSGAMMA
                  4    _PHI_          4          4    SYMMETRIC


                          The 2 Endogenous Variables

    Manifest        y1  y2
    Latent


                          The 4 Exogenous Variables

    Manifest        x1  x2
    Latent
    Error           e1  e2

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          10
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
           Covariance Structure Analysis: Pattern and Initial Values

              Manifest Variable Equations with Initial Estimates


                 y1      =        .*x1      +  1.0000 e1
                                    b1
                 y2      =        .*y1      +  1.0000 e2
                                    b2


                       Variances of Exogenous Variables

                       Variable Parameter      Estimate

                       x1       sigsqx1               .
                       x2       sigsqx2               .
                       e1       sigsqe1               .
                       e2       sigsqe2               .


                     Covariances Among Exogenous Variables

                       Var1 Var2 Parameter      Estimate

                       x1   x2   sigma12               .

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          11
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

            Observations         300    Model Terms              1
            Variables              4    Model Matrices           4
            Informations          10    Parameters               7


                     Variable          Mean       Std Dev

                     x1             0.16588       2.27566
                     x2            -0.20103       3.08880
                     y1             0.11331       2.68760
                     y2            -0.41910      10.88369


                Set Covariances of Exogenous Manifest Variables

                                    x1  x2

NOTE: Some initial estimates computed by two-stage LS method.


_______________________________________________________________________________

                           STA313f04 Path 1 Example                          12
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                          Vector of Initial Estimates

                 Parameter      Estimate    Type

            1    b2              2.09622    Matrix Entry: _BETA_[2:1]
            2    b1              1.00190    Matrix Entry: _GAMMA_[1:1]
            3    sigsqx1         5.17862    Matrix Entry: _PHI_[1:1]
            4    sigma12         0.44648    Matrix Entry: _PHI_[2:1]
            5    sigsqx2         9.54070    Matrix Entry: _PHI_[2:2]
            6    sigsqe1         2.02490    Matrix Entry: _PHI_[3:3]
            7    sigsqe2        88.11855    Matrix Entry: _PHI_[4:4]

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          13
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                       Levenberg-Marquardt Optimization

                         Scaling Update of More (1978)

                   Parameter Estimates                    7
                   Functions (Observations)              10
                   Lower Bounds                           4
                   Upper Bounds                           0

                              Optimization Start

Active Constraints                   0  Objective Function        3.3330031833
Max Abs Gradient Element  0.0075986595  Radius                               1


                                                                        Actual
                                                      Max Abs             Over
         Rest    Func      Act    Objective  Obj Fun Gradient             Pred
 Iter    arts   Calls      Con     Function   Change  Element  Lambda   Change

    1       0       2        0      3.33283 0.000176 1.998E-6       0    1.000

                             Optimization Results

Iterations                           1  Function Calls                       3
Jacobian Calls                       2  Active Constraints                   0
Objective Function        3.3328270857  Max Abs Gradient Element  1.9984173E-6
Lambda                               0  Actual Over Pred Change              1
Radius                    0.0375337328

ABSGCONV convergence criterion satisfied.

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          14
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

         Fit Function                                          3.3328
         Goodness of Fit Index (GFI)                           0.6694
         GFI Adjusted for Degrees of Freedom (AGFI)           -0.1018
         Root Mean Square Residual (RMR)                       8.7264
         Parsimonious GFI (Mulaik, 1989)                       0.3347
         Chi-Square                                          996.5153
         Chi-Square DF                                              3
         Pr > Chi-Square                                       <.0001
         Independence Model Chi-Square                         1466.5
         Independence Model Chi-Square DF                           6
         RMSEA Estimate                                        1.0524
         RMSEA 90% Lower Confidence Limit                      0.9980
         RMSEA 90% Upper Confidence Limit                      1.1079
         ECVI Estimate                                         3.3804
         ECVI 90% Lower Confidence Limit                       3.0430
         ECVI 90% Upper Confidence Limit                       3.7431
         Probability of Close Fit                              0.0000
         Bentler's Comparative Fit Index                       0.3197
         Normal Theory Reweighted LS Chi-Square              295.2477
         Akaike's Information Criterion                      990.5153
         Bozdogan's (1987) CAIC                              976.4040
         Schwarz's Bayesian Criterion                        979.4040
         McDonald's (1989) Centrality                          0.1909
         Bentler & Bonett's (1980) Non-normed Index           -0.3605
         Bentler & Bonett's (1980) NFI                         0.3205
         James, Mulaik, & Brett (1982) Parsimonious NFI        0.1602
         Z-Test of Wilson & Hilferty (1931)                   22.0440
         Bollen (1986) Normed Index Rho1                      -0.3590
         Bollen (1988) Non-normed Index Delta2                 0.3211
         Hoelter's (1983) Critical N                                4

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          15
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

                  Manifest Variable Equations with Estimates


                y1      =   1.0019*x1       +  1.0000 e1
                Std Err     0.0362 b1
                t Value    27.7053
                y2      =   2.0499*y1       +  1.0000 e2
                Std Err     0.2020 b2
                t Value    10.1483


                       Variances of Exogenous Variables

                                                 Standard
           Variable Parameter      Estimate         Error    t Value

           x1       sigsqx1         5.17862       0.42354      12.23
           x2       sigsqx2         9.54070       0.78030      12.23
           e1       sigsqe1         2.02490       0.16561      12.23
           e2       sigsqe2        88.11855       7.20687      12.23


                    Covariances Among Exogenous Variables

                                                 Standard
          Var1 Var2 Parameter      Estimate         Error    t Value

          x1   x2   sigma12         0.44648       0.40732       1.10

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          16
                       Reduced (restricted) Model: b3=0
                                                 10:17 Friday, November 5, 2004

                              The CALIS Procedure
         Covariance Structure Analysis: Maximum Likelihood Estimation

            Manifest Variable Equations with Standardized Estimates


                 y1      =   0.8483*x1      +  0.5295 e1
                                    b1
                 y2      =   0.5062*y1      +  0.8624 e2
                                    b2


                         Squared Multiple Correlations

                                     Error         Total
                    Variable      Variance      Variance    R-Square

               1    y1             2.02490       7.22317      0.7197
               2    y2            88.11855     118.47013      0.2562


                     Correlations Among Exogenous Variables

                       Var1 Var2 Parameter      Estimate

                       x1   x2   sigma12         0.06352

_______________________________________________________________________________

                           STA313f04 Path 1 Example                          17
                              Compute G two ways
                                                 10:17 Friday, November 5, 2004




                            Based on Fit Function


                                G1                    PVAL1

                    G =     993.03 , df = 1, p =          0





                              Based on chi-square


                                G2                    PVAL2

                    G =  993.03803 , df = 1, p =          0