Advanced Methods¶
This page documents the structural-econometric machinery underlying the v5 (JPubE submission-ready) version of the paper. Five components combine: (1) drop-out point identification of type-specific cost distributions; (2) auction-level unobserved-heterogeneity correction; (3) Athey--Seira treatment of equilibrium entry; (4) Bayes--Nash counterfactual simulation; (5) Saez--Stantcheva welfare weights.
1. Type-specific cost distributions (Haile and Tamer 2003)¶
The Pregao auction is a descending-clock electronic English-reverse auction. Under independent private values, the weakly dominant strategy is to exit when the running clock price reaches one's own cost. Drop-out prices of losers therefore point-identify their costs, without requiring inversion of a bid function or parametric assumptions on bidding.
Three estimators of Fck are used:
| Estimator | Bias | Use |
|---|---|---|
| Losers-only | Overstates Fck in the left tail (omits the lowest cost draw per auction) | Reference / DiD-equivalent |
| All-bidders (baseline) | Slight downward bias throughout (treats winner's final clock price ≈ c(2) as their cost) | Main specification |
| Turnbull NPMLE | Point-identified (treats winner formally as left-censored at c(2)) | Robustness benchmark |
Across the eight (class × type × period) strata, the median estimates satisfy Turnbull ≤ all-bidders ≤ losers-only in all 8 strata. The BNE decomposition computed on the Turnbull input differs from the all-bidders baseline by at most 16% of the total effect across classes.
Cross-modality consistency check¶
Convite auctions in the same sample are first-price sealed-bid; the Guerre, Perrigne, and Vuong (2000) inversion recovers Fck from the Convite bid distribution. The Convite GPV and Pregao drop-out estimates coincide to approximately one percentage point at the median in pharma non-SME Pre — the stratum where the comparison is least confounded by the policy reform. The check also serves as an indirect collusion screen: the bidder-group detection tests of Conley and Decarolis (2016) would flag systematic divergence between GPV-inverted bids and drop-out exits in collusive settings; the convergence reported here is consistent with non-collusive equilibrium play.
2. Auction-level heterogeneity correction (Krasnokutskaya 2011)¶
The normalized log-bid is decomposed as
where at is an auction-level scale common to all bidders in auction t, and eit is bidder-specific. Method-of-moments variance decomposition delivers stratum-specific σa2,k and σe2,k, and a best-linear-predictor shrinkage strips at out of every log-bid:
Intraclass correlation ρk ranges from approximately 0.36 to 0.59 across the three Pregao cells outside the low-n pharma-SME-Pre stratum — comparable to the 0.50--0.70 range Krasnokutskaya (2011) reports for California highway contracts.
The correction closes 93% of the cross-modality median gap in pharma SME Post, 23% in pharma non-SME Post, and leaves the non-pharma gap largely intact — evidence that auction-level heterogeneity is the binding constraint in pharma but only part of the story in non-pharma.
3. Equilibrium entry (Athey and Seira 2011 spirit)¶
Entry is modeled as Poisson arrivals with stratum-specific rates. The Bayes-Nash counterfactual takes equilibrium arrival rates as primitives rather than solving for them explicitly:
- Pre-period counts (NSMEPre, N¬Pre) — status-quo arrival rates under the open regime
- Post-period counts (NSMEPost, 0) — equilibrium arrival under the restricted regime
In the main specification, FcSME,Post is treated as an equilibrium-selection object generated by the protected regime rather than forced to equal FcSME,Pre. The Online Appendix reports the strict-invariance limiting case in which that equality is imposed as a benchmark — the same rule, but with the protected SME pool drawn from the open-regime primitive.
A KS test on UH-cleaned non-SME Fc rejects strict invariance in three of the four Pregao strata (KS distances 0.06--0.14), providing the empirical foundation for adopting equilibrium selection as the main specification rather than strict invariance.
A zero-profit calculation applied to the simulated winning-margin profit distribution delivers an implied entry cost per bidder in the R$0.11--R$2.46 range, with non-SMEs facing roughly 4.5 times the SME entry cost across both classes.
4. Bayes-Nash counterfactual simulation¶
With Fck and entry counts in hand, the BNE counterfactual is a Monte Carlo over draws from the estimated primitives. For each of B = 2,000 simulated auctions:
- Draw nSME ~ Poisson(N̄SME) and analogously for non-SMEs.
- Sample costs from the corresponding Fck.
- Sort the realized costs.
- Record c(1) (production cost of the lowest-cost firm), c(2) (expected transaction price under revenue equivalence), and the winner's type.
Aggregating over draws yields the means, quantiles, and decompositions reported in Results. Standard errors come from a cluster bootstrap at the auction level with Bbs = 500 replicates, which preserves within-auction correlation through the auction-level scale at.
The simulated total effect on the second-order statistic decomposes arithmetically as
5. Welfare weights (Saez and Stantcheva 2016)¶
A social planner aggregating producer and consumer surplus with weight wSME on SME producer surplus is indifferent between V0 (full set-aside) and V3 (10% price preference) when
The Loss term combines allocative DWL and the marginal-cost-of-public-funds distortion from the welfare arithmetic of Extensions:
The implied weights wSME★ = 2.42 (non-pharma) and 2.61 (pharma) under the main specification sit above the 1.2--1.5 range typical of Brazilian cash-transfer programs. Under strict invariance the pharmaceutical weight falls to 0.7, below the utilitarian benchmark — the diagnostic bifurcation that organizes the policy discussion.
6. Bridging the structural counterfactual to the DiD coefficient¶
Four mechanical sources of divergence account for the structural object via additive contributions against the DiD baseline:
| Source | Non-pharma | Pharma |
|---|---|---|
| (a) Sample restriction (Pregao only, cε ≤ 3, n ≥ 2) | +0.025 | +0.060 |
| (b) UH cleaning (Krasnokutskaya scale shrinkage) | +0.040 | +0.075 |
| (c) Functional form (c(2) vs. linear log-mean) | +0.050 | +0.045 |
| (d) Conditioning set (counterfactual vs. realized entry) | +0.020 | +0.005 |
| Sum + DiD baseline ≈ | +0.255 | +0.305 |
| Structural BNE simulation, pS3 − pS1 | +0.259 | +0.308 |
| Residuals (non-linear interactions) | +0.004 | +0.003 |
The 3--4× gap between the structural and the DiD is informally attributable to the four sources above and is predictable in direction under the maintained structural assumptions. A misspecified model would not be guaranteed to deliver a gap of either consistent sign or consistent magnitude. The fact that the gap obtains in the same direction and at comparable magnitude in both classes is consistent with correct specification of asymmetric IPV with type-removal.
7. Appendix DiD (validation, not main object)¶
The reduced-form DiD against the 76 never-treated product groups serves as design validation rather than the main object. With item and month fixed effects, the 18-month coefficient on absolute log-price is −0.10 to −0.11, and on the price-to-reference ratio approximately −0.06. These magnitudes are reported in Appendix B of the manuscript.