✅ The environment is cleared and ready.

/Users/casparm4/Github/rsm-data-analytics-in-finance-private/private/assignment
> s/03-assignment
📁 Base directory: /Users/casparm4/Github/rsm-data-analytics-in-finance-private
> /private/assignments/03-assignment
📁 Raw data folder: /Users/casparm4/Github/rsm-data-analytics-in-finance-privat
> e/private/assignments/03-assignment/data/raw
📁 Output directory: /Users/casparm4/Github/rsm-data-analytics-in-finance-priva
> te/private/assignments/03-assignment/output
📁 Tables folder: /Users/casparm4/Github/rsm-data-analytics-in-finance-private/
> private/assignments/03-assignment/output/tables
📁 Figures folder: /Users/casparm4/Github/rsm-data-analytics-in-finance-private
> /private/assignments/03-assignment/output/figures

✅ Dataset loaded successfully

✅ Dataset loaded correctly

Contains data from /Users/casparm4/Github/rsm-data-analytics-in-finance-private
> /private/assignments/03-assignment/data/raw/auto_firms_event_crosssection.dta
 Observations:            84                  
    Variables:            24                  06 Jan 2026 15:08
-------------------------------------------------------------------------------
Variable      Storage   Display    Value
    name         type    format    label      Variable label
-------------------------------------------------------------------------------
gvkey           str6    %-9s                  
conm            str28   %-9s                  
fic             str3    %-9s                  
german          long    %12.0g                
car_0_5         double  %10.0g                
car_0_10        double  %10.0g                
log_at          double  %10.0g                
leverage        double  %10.0g                
roa             double  %10.0g                
debt_equity     double  %10.0g                
roe             double  %10.0g                
margin          double  %10.0g                
cash_ratio      double  %10.0g                
at              double  %10.0g                
sale            double  %10.0g                
dltt            double  %10.0g                
dlc             double  %10.0g                
ceq             double  %10.0g                
ebitda          double  %10.0g                
ib              double  %10.0g                
oibdp           double  %10.0g                
che             double  %10.0g                
n_days_0_5      long    %12.0g                
n_days_0_10     long    %12.0g                
-------------------------------------------------------------------------------
Sorted by:

    Variable |        Obs        Mean    Std. dev.       Min        Max
-------------+---------------------------------------------------------
     car_0_5 |         84   -.0074558    .0727016  -.3523587   .2633667
    car_0_10 |         84    .0072848    .1414629  -.8872483   .2801222
      german |         84    .0833333    .2780454          0          1
      log_at |         84    10.36085    3.473371   2.858938   18.80747
    leverage |         84    .2161082    .1966058          0   .9964587
-------------+---------------------------------------------------------
         roa |         84    .0615514    .1036582  -.4132906   .3438476

✅ Summary statistics computed correctly

(obs=84)

             |  car_0_5   german   log_at leverage      roa
-------------+---------------------------------------------
     car_0_5 |   1.0000
      german |  -0.4954   1.0000
      log_at |  -0.2459  -0.0106   1.0000
    leverage |   0.0084   0.0700   0.0684   1.0000
         roa |  -0.0652   0.0153   0.3246  -0.3275   1.0000

✅ Correlation matrix computed correctly

file /Users/casparm4/Github/rsm-data-analytics-in-finance-private/private/assig
> nments/03-assignment/output/figures/scatter_car_german.png written in PNG for
> mat
✅ Scatter plot saved: scatter_car_german.png

✅ Scatter plot (CAR vs German) saved correctly

file /Users/casparm4/Github/rsm-data-analytics-in-finance-private/private/assig
> nments/03-assignment/output/figures/scatter_car_size.png written in PNG forma
> t
✅ Scatter plot saved: scatter_car_size.png

✅ Scatter plot (CAR vs Size) saved correctly

✅ Exploratory analysis plots created successfully

      Source |       SS           df       MS      Number of obs   =        84
-------------+----------------------------------   F(1, 82)        =     26.68
       Model |  .107686651         1  .107686651   Prob > F        =    0.0000
    Residual |  .331011944        82  .004036731   R-squared       =    0.2455
-------------+----------------------------------   Adj R-squared   =    0.2363
       Total |  .438698595        83  .005285525   Root MSE        =    .06354

------------------------------------------------------------------------------
     car_0_5 | Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
-------------+----------------------------------------------------------------
      german |  -.1295467   .0250819    -5.16   0.000    -.1794425   -.0796508
       _cons |   .0033398   .0072405     0.46   0.646    -.0110639    .0177435
------------------------------------------------------------------------------
✅ Baseline model (m1) estimated and stored

✅ Baseline model (m1) estimated correctly

      Source |       SS           df       MS      Number of obs   =        84
-------------+----------------------------------   F(4, 79)        =      9.08
       Model |  .138161095         4  .034540274   Prob > F        =    0.0000
    Residual |    .3005375        79  .003804272   R-squared       =    0.3149
-------------+----------------------------------   Adj R-squared   =    0.2802
       Total |  .438698595        83  .005285525   Root MSE        =    .06168

------------------------------------------------------------------------------
     car_0_5 | Coefficient  Std. err.      t    P>|t|     [95% conf. interval]
-------------+----------------------------------------------------------------
      german |    -.13205   .0244421    -5.40   0.000    -.1807007   -.0833992
      log_at |  -.0057776   .0021024    -2.75   0.007    -.0099622   -.0015929
    leverage |   .0303046   .0372949     0.81   0.419    -.0439291    .1045383
         roa |   .0413695   .0744365     0.56   0.580    -.1067927    .1895316
       _cons |   .0543134   .0222178     2.44   0.017     .0100899    .0985369
------------------------------------------------------------------------------
✅ Model with controls (m2) estimated and stored

✅ Model with controls (m2) estimated correctly

----------------------------------------------
    Variable |      m1              m2        
-------------+--------------------------------
      german | -.12954665***   -.13204996***  
      log_at |                 -.00577757**   
    leverage |                  .03030462     
         roa |                  .04136945     
       _cons |  .00333977       .05431344*    
-------------+--------------------------------
           N |         84              84     
          r2 |  .24546842       .31493398     
----------------------------------------------
      Legend: * p<0.05; ** p<0.01; *** p<0.001

✅ Model with controls has correct specification

(option xb assumed; fitted values)
✅ Residuals and fitted values generated

✅ Residuals and fitted values generated correctly

file /Users/casparm4/Github/rsm-data-analytics-in-finance-private/private/assig
> nments/03-assignment/output/figures/residuals_vs_fitted.png written in PNG fo
> rmat
✅ Residual plot saved: residuals_vs_fitted.png

✅ Residual plot created and saved correctly

Breusch–Pagan/Cook–Weisberg test for heteroskedasticity 
Assumption: Normal error terms
Variable: Fitted values of car_0_5

H0: Constant variance

    chi2(1) =  23.19
Prob > chi2 = 0.0000
✅ Breusch-Pagan test completed

✅ Heteroskedasticity diagnostics completed

(bin=9, start=-.21515974, width=.05122012)
file /Users/casparm4/Github/rsm-data-analytics-in-finance-private/private/assig
> nments/03-assignment/output/figures/histogram_residuals.png written in PNG fo
> rmat
✅ Residual histogram saved: histogram_residuals.png

✅ Residual histogram saved correctly

                   Shapiro–Wilk W test for normal data

    Variable |        Obs       W           V         z       Prob>z
-------------+------------------------------------------------------
       resid |         84    0.92648      5.253     3.644    0.00013

✅ Shapiro-Wilk test completed correctly

Linear regression                               Number of obs     =         84
                                                F(4, 79)          =       4.61
                                                Prob > F          =     0.0021
                                                R-squared         =     0.3149
                                                Root MSE          =     .06168

------------------------------------------------------------------------------
             |               Robust
     car_0_5 | Coefficient  std. err.      t    P>|t|     [95% conf. interval]
-------------+----------------------------------------------------------------
      german |    -.13205   .0480099    -2.75   0.007    -.2276113   -.0364886
      log_at |  -.0057776   .0016403    -3.52   0.001    -.0090425   -.0025126
    leverage |   .0303046   .0388468     0.78   0.438     -.047018    .1076272
         roa |   .0413695   .0601818     0.69   0.494    -.0784194    .1611583
       _cons |   .0543134   .0182133     2.98   0.004     .0180608    .0905661
------------------------------------------------------------------------------
✅ Model with robust SE (m3) estimated and stored

✅ Robust SE model (m3) estimated correctly

--------------------------------------------------------------
    Variable |      m1              m2              m3        
-------------+------------------------------------------------
      german | -.12954665***   -.13204996***   -.13204996**   
      log_at |                 -.00577757**    -.00577757***  
    leverage |                  .03030462       .03030462     
         roa |                  .04136945       .04136945     
       _cons |  .00333977       .05431344*      .05431344**   
-------------+------------------------------------------------
           N |         84              84              84     
          r2 |  .24546842       .31493398       .31493398     
--------------------------------------------------------------
                      Legend: * p<0.05; ** p<0.01; *** p<0.001

(output written to /Users/casparm4/Github/rsm-data-analytics-in-finance-private
> /private/assignments/03-assignment/output/tables/regression_table.tex)
✅ Regression table exported: regression_table.tex

✅ Regression table exported correctly

Data Analytics for Finance

Regression Analysis

Setup¶

Learning Objectives¶

Research Question¶

Background: Event Study and Abnormal Returns¶

Section 1: Load and Examine Data¶

Task 1.1: Load the Dataset¶

Task 1.2: Examine the Dataset¶

Task 1.3: Summary Statistics¶

Section 2: Exploratory Analysis¶

Task 2.1: Correlation Matrix¶

Task 2.2: Scatter Plot - CAR vs German¶

Task 2.3: Scatter Plot - CAR vs Size¶

Checkpoint: Exploratory Analysis Complete¶

Section 3: Baseline OLS Regression¶

Task 3.1: Estimate Baseline Model¶

Section 4: Add Control Variables¶

Task 4.1: Regression with Controls¶

Task 4.2: Compare Models¶

Section 5: Diagnostics - Heteroskedasticity¶

Task 5.1: Generate Residuals and Fitted Values¶

Task 5.2: Plot Residuals vs Fitted Values¶

Task 5.3: Breusch-Pagan Test¶

Test: Heteroskedasticity Diagnostics¶

Section 6: Diagnostics - Normality¶

Task 6.1: Histogram of Residuals¶

Task 6.2: Shapiro-Wilk Test¶

Section 7: Robust Standard Errors¶

Task 7.1: Re-estimate with Robust Standard Errors¶

Task 7.2: Compare All Three Models¶

Section 8: Export Publication-Quality Regression Table¶

Task 8.1: Create LaTeX Regression Table¶

Section 9: Interpretation and Conclusions (optional/solution included)¶

Task 9.1: Interpret Main Findings¶

Task 9.2: Discuss Limitations¶

References¶