updated analaysis with offshore considerations

This commit is contained in:
2026-02-18 17:07:03 -08:00
parent 61d274d6f3
commit 1342b06871
27 changed files with 9153 additions and 236 deletions

178
analysis/draft_appendix.md Normal file
View File

@@ -0,0 +1,178 @@
# Appendix: Heterogeneous Enforcement of Transparency
## Evidence from the Texas Railroad Commission
## Appendix A. Data Construction and Variables
### A1. Data integration
The analysis combines inspection and violation administrative records (2015-2025) into a district-year panel. Well-level linkage is done via `api_norm`.
### A2. Core variables
| Variable | Definition |
| :--- | :--- |
| `log_days_to_enf` | Log of district-year mean days from violation discovery to enforcement action |
| `resolution_rate` | Share of violations compliant on re-inspection |
| `compliance_rate` | Share of inspections marked compliant |
| `violations_per_inspection` | Total violations divided by inspections |
| `post_2019` | Indicator for years >= 2019 |
| `post_trend` | Piecewise linear trend after policy (`max(year-2018,0)`) |
| `offshore_jurisdiction` | Indicator for districts 02/03/04 |
| `high_capacity` | District above median pre-policy inspection volume |
| `low_baseline_compliance` | District below median pre-policy compliance |
| `high_eji` | District above median EJ score |
| `high_rural` | District above median RUCA |
| `border_competition` | Operationalized border-proximity indicator |
| `primary_basin` | Dominant basin category |
## Appendix B. Econometric Specifications
### B1. Interrupted panel (all districts; H1)
\[
Y_{dt}=\alpha_d + \beta_1 \text{YearNum}_t + \beta_2 \text{Post2019}_t + \beta_3 \text{PostTrend}_t + \varepsilon_{dt}
\]
### B2. District heterogeneity (H2)
\[
Y_{dt}=\alpha_d + \gamma_t + \sum_d \theta_d (\text{District}_d\times \text{Post2019}_t) + \varepsilon_{dt}
\]
### B3. Offshore moderation (H5)
\[
Y_{dt}=\alpha_d + \gamma_t + \sum_d \theta_d (\text{District}_d\times \text{Post2019}_t) + \phi(\text{Post2019}_t\times \text{Offshore}_d) + \varepsilon_{dt}
\]
All models report district-clustered standard errors.
## Appendix C. Main Run Outputs
### C1. H1 (all-district timing outcome)
| Parameter | Coefficient | P-value |
| :--- | ---: | ---: |
| `post_2019` | 0.1514 | 0.3294 |
| `post_trend` | -0.3603 | 0.0010 |
Interpretation: no significant immediate level shift; significant post-policy acceleration slope.
Substantively, this table supports the main-text conclusion that the policy effect is best characterized as gradual acceleration through the enforcement pipeline rather than a single break at policy adoption.
### C2. Event-study decomposition (all districts; ref=2018)
| Year | Coefficient | P-value |
| :--- | ---: | ---: |
| 2015 | -0.4592 | 0.0658 |
| 2016 | -0.3359 | 0.1615 |
| 2017 | -0.0385 | 0.7502 |
| 2019 | -0.1149 | 0.2843 |
| 2020 | -0.1666 | 0.3878 |
| 2021 | -0.4192 | 0.1072 |
| 2022 | -0.5853 | 0.0333 |
| 2023 | -0.4899 | 0.1160 |
| 2024 | -0.7829 | 0.0057 |
| 2025 | -1.4800 | <0.001 |
Pre-policy years are jointly non-significant in this decomposition.
The coefficient pattern reinforces parallel-pretrend credibility while showing that the post-policy effect strengthens in later years, consistent with delayed organizational adaptation.
### C3. Offshore differential annual effects (ref=2018)
| Year | Offshore differential coef | P-value |
| :--- | ---: | ---: |
| 2019 | 0.3479 | 0.1581 |
| 2020 | 0.1796 | 0.7089 |
| 2021 | 0.9121 | 0.1095 |
| 2022 | 0.7532 | 0.0652 |
| 2023 | 0.9166 | 0.0325 |
| 2024 | 1.0693 | 0.0280 |
| 2025 | 0.7233 | 0.2091 |
These estimates indicate that offshore jurisdictions diverge from non-offshore districts in specific post years rather than uniformly across the entire post period. The strongest differentials appear in 2023-2024.
### C4. H5 offshore moderator (conditional model)
| Parameter | Coefficient | P-value |
| :--- | ---: | ---: |
| `post_2019:offshore_jurisdiction` | 0.3819 | <0.001 |
See **Figure 4** in the main text (`district_treatment_effects_map_psj.png`) for the geographic distribution of district treatment effects.
Read alongside C3, this pooled interaction should be interpreted as an average offshore differential in the post period after district heterogeneity is already modeled, not as a claim that offshore status is the dominant driver of all district variation.
### C5. H3 moderator tests
Main block:
- H3a Capacity: -0.0188 (p=0.9415)
- H3b Baseline performance: -0.0884 (p=0.7144)
- H3e Border proximity: -0.2768 (p=0.3082)
- H5 (same block estimate): 0.6317 (p=0.1055)
Deep-dive block:
- H3c EJ: 0.1818 (p=0.4866)
- H3f Rurality: 0.2213 (p=0.4649)
- H3e Border proximity: -0.3626 (p=0.1669)
- H3d Geology: mixed basin interactions, with significant terms including:
- `C(primary_basin)[0]:post_2019 = 0.5322` (p<0.001)
- `C(primary_basin)[3]:post_2019 = -0.5707` (p<0.001)
Taken together, these moderator results imply that broad structural covariates provide limited explanatory leverage in this run, while basin composition remains the clearest structural correlate of differential policy response.
## Appendix D. Spatial Test (H4)
Moran's I on district treatment effects:
- Morans I = -0.0493
- Permutation p-value = 0.8550
Conclusion: no significant global spatial autocorrelation.
The sign and magnitude of Morans I are both small, indicating no evidence that high- or low-response districts are systematically clustered in ways consistent with regional diffusion.
## Appendix E. Robustness Tables
### E1. Placebo policy years (all-district interrupted model)
| Placebo year | Coefficient (`post`) | P-value |
| :--- | ---: | ---: |
| 2017 | 0.6565 | 0.0020 |
| 2021 | -0.0245 | 0.9191 |
The significant 2017 placebo estimate suggests that single-cut timing designs can produce spurious break effects, which is why the main analysis emphasizes trend-change evidence and event-study diagnostics instead of level shifts alone.
### E2. Alternative outcomes (all-district interrupted model)
| Outcome | `post` coef (p) | `post_trend` coef (p) |
| :--- | :--- | :--- |
| Resolution rate | 4.3721 (0.2104) | -2.9371 (0.1424) |
| Compliance rate | -0.1311 (0.9316) | -0.5562 (0.1870) |
| Violations per inspection | -0.0082 (0.6690) | 0.0106 (0.0600) |
This table shows that timing acceleration does not mechanically translate into improvements across all compliance-oriented outcomes in the same period, highlighting outcome-specific channels of policy response.
### E3. Sample restrictions (all-district interrupted model)
| Restriction | `post_2019` coef (p) | `post_trend` coef (p) |
| :--- | :--- | :--- |
| Full sample | 0.1514 (0.3294) | -0.3603 (0.0010) |
| Exclude extreme districts | 0.1917 (0.1930) | -0.2972 (0.0133) |
| Exclude 2015-2016 | 0.1942 (0.1958) | -0.2313 (0.0950) |
| Exclude 2020-2021 | 0.1516 (0.2959) | -0.3599 (0.0016) |
Across restrictions, the post-trend estimate remains negative and generally significant, while the post level term stays weak. This stability is central to the articles interpretation of gradual policy-induced acceleration.
### E4. Specification sensitivity
| Specification | `post` effect | `post_trend` effect |
| :--- | :--- | :--- |
| Linear interrupted | -41.9298 (p=0.3104) | -67.0420 (p=0.0100) |
| Winsorized interrupted | 0.2137 (p=0.1021) | -0.3147 (p=0.0016) |
| Year FE + district post terms | 13 interaction terms | N/A |
Specification checks again point to the same empirical hierarchy: slope effects are more robust than level effects, and district-specific post terms remain necessary to represent the observed heterogeneity.
## Appendix F. Interpretation Notes
1. The strongest system-wide evidence in this run is a **post-policy slope change**, not a one-time 2019 level shift.
2. District heterogeneity is substantial and statistically material.
3. Offshore jurisdiction contributes meaningfully in conditional models, but placebo behavior indicates caution in purely timing-based causal claims.
4. Spatial diffusion is not supported by global autocorrelation tests.