Skip to content

Analyses — Cheap Signals, Costly Proof

AN pages predating 2026-06-04 are superseded by the canonical re-estimation

On 2026-06-04 the paper's validation target was replaced by a reproducible, non-circular label — 651 always-loser cobidders (341 frequent-loser, 310 non-frequent-loser; the frequent-loser flag is never used to build the label) — and every result was re-estimated. The old 193-cobidder target (and the AUC/PR/timing/case-dominance/ bid-benchmark numbers derived from it) is retired as circular and irreproducible. Individual AN pages authored before this date carry a banner to that effect and may still display pre-revision numbers; the canonical figures live on the Paper, Results, and Robustness pages and in NEW_NUMBERS_MAP.md. Where an AN page and the canonical pages diverge, the canonical pages win. AN entries below are not renumbered or deleted.

This page is the directory of analyses for Cheap Signals, Costly Proof: The Reach and Limits of Award-Layer Screening in Cartel Enforcement. Each AN-NNN page is anchored by a YAML frontmatter block that drives the generated index below.

This page is the directory of designs and results. For curated claims about the world — sentences that could go in the paper — see Findings. For the testable predictions and their cross-references to AN pages, see Hypotheses.


What the analyses establish

The paper's contribution is a decomposition method plus a reach-and-limits map, not a cartel detector. Cheap award-layer records (who participated, who lost) can order forensic priority — they cannot prove conduct. The analyses below decompose what looks like a strong screen into three pieces and trace where each one reaches and where it stops:

  1. Mechanical co-participation exposure. Most of the raw discriminating power is exposure: firms that bid in more tenders mechanically overlap more adjudicated cartel environments. Against the canonical 651-cobidder label the raw award score reaches only ROC 0.761 / PR-AUC 0.143, and genuine label-blind opportunity ranks the label at only 0.553 (ranking by observed contact reaches 0.905, but that is mechanical label encoding — a cobidder is a firm with positive contact — not a competing model). Stripping exposure out — comparing firms within the same opportunity stratum — leaves a within-stratum AUC of 0.471 (≈ chance); the only positive is a fragile nested increment of +0.010 (DeLong p = 0.013) that is not robust across designs (matched permutation p = 0.127; FL-enrichment p = 0.067). There is no robust residual signal net of opportunity.
  2. Single-case concentration. Much of the operational performance leans on one adjudicated case. Leave-largest-out cuts the single-case PR-AUC from 0.143 to 0.090 (−37%); one case accounts for ≈ 32% of positives and 45.4% of true positives at k = 500. The estimated ranking is case-sensitive, not portable.
  3. Timing. Strict ex-ante construction reaches only ~0.68 inside the training always-loser pool; at full-universe scale it falls below chance (ROC ≈ 0.474, precision@500 = 0). The screen is largely retrospective among incumbents, not a prospective alarm.

What the screen does not do is equally load-bearing: the FL-binary AUC ≈ 0.49 against direct CADE defendants (the cartel's winners) is a scope boundary by design (the continuous score ranks them at 0.66–0.70), not a failure.

Reading the catalogue by threat

Threat the analysis confronts AN pages
Construct — how the rank is built; is the FL14 cutoff arbitrary? AN-001, AN-002, AN-003, AN-023, AN-025
Opportunity — is it just exposure / participation volume? AN-004, AN-005, AN-027, AN-028, AN-041, AN-043, AN-044
Cross-platform — does the audit port to a second procurement system? AN-043
Timing — would it flag firms before cases closed? AN-006, AN-013, AN-015, AN-029, AN-030
Single-case concentration — does one case carry it? AN-013, AN-034, AN-036
Scope — what the screen cannot and should not cover AN-007, AN-016, AN-018, AN-027
Division of labor — award layer vs bid layer AN-010, AN-011, AN-033, AN-035
Price = scope, not damages AN-019, AN-020, AN-022, AN-037, AN-038, AN-039, AN-040
Leakage — is ~1.0 AUC memorization? AN-014, AN-017

Throughout: the rank orders forensic priority; it does not detect or prove. FL14 is an administrative cutoff (median + 1.5 × IQR), not a structural threshold; the continuous log-participation score is the underlying primitive. Price differentials are read as scope, not damages or overcharge.


How to read an AN page

Each an-NNN-<slug>.md follows the same skeleton:

  • Frontmatter: identifier, status, hypothesis bearing on, script and output path, design block (sample, specification, FEs, etc.).
  • ## Question — the precise research question.
  • ## Design — sample, variation, specification, identification threats.
  • ## Results — headline number + headline table or figure, with a link to the underlying output.
  • ## Interpretation — what the result means; confidence color with justification.
  • ## Follow-ups — forward-looking only; analyses not yet run.

Status. pending (scaffolded, not yet run or not yet interpreted) → done (script run, headline + interpretation written, confidence set) → stale (superseded by a newer AN or by upstream-data change).

Confidence. 🟢 green (clean identification, large sample, robust), 🟡 yellow (informative with caveats), 🔴 red (unreliable, kept for the record only). Set only when status: done.


Generated index

All 44 AN pages, auto-generated from the YAML frontmatter of each docs/analyses/an-NNN-*.md via scripts/gen_analysis_index.py + scripts/render_indexes.py. The machine-readable form lives at docs/reference/analysis-index.yaml.

AN Type Status Conf. Hypothesis Question
AN-001 descriptive done 🟢 cobidder-concentration How is the persistent-zero-win-participation rank constructed, and what is its distribution across always-los…
AN-002 robustness done 🟡 cobidder-concentration How does the cobidder AUC change as the IQR threshold is varied, and is the median + 1.5 × IQR cutoff disting…
AN-003 descriptive done 🟢 cobidder-concentration How are CADE direct defendants and adjudication-anchored cobidders linked to BEC firms via CNPJ root, and wha…
AN-004 descriptive done 🟢 cobidder-concentration Does the FL14 stratum contain a disproportionate share of CADE-adjudication-anchored cobidders relative to th…
AN-005 placebo done 🟢 exposure-discipline Does cobidder concentration in the FL14 stratum survive a participation-volume-matched placebo, and how far i…
AN-006 robustness done 🟡 timing-discipline Does cobidder concentration survive when the FL score is formed strictly before the target window?
AN-007 placebo done 🟢 direct-defendants-null Does the FL score discriminate direct CADE defendants? It should not — by design.
AN-008 descriptive done 🟡 cobidder-profile-distinct Within the FL14 stratum, how do cobidders differ from non-cobidder FLs along buyer breadth and operational fo…
AN-009 descriptive done 🟡 cobidder-profile-distinct Do cobidders inside the FL14 stratum operate in more concentrated product portfolios than non-cobidder FLs, a…
AN-010 descriptive done 🟢 award-bid-complementarity How does the seven-feature Imhof–Wallimann bid-distribution pipeline perform on the cobidder target, and what…
AN-011 descriptive done 🟢 award-bid-complementarity Does the continuous log(1+tenders_count) dominate the binary FL14 on the cobidder target?
AN-012 descriptive done 🟡 gatekeeping-cost-of-evidence What are the in-sample precision@k and lift metrics for the FL ranking used as a forensic gatekeeper?
AN-013 robustness done 🟢 gatekeeping-cost-of-evidence What are the temporal-holdout precision@k and lift metrics, and how much does the in-sample evaluation inflat…
AN-014 robustness done 🟢 gatekeeping-cost-of-evidence How much does item-level evaluation leak relative to out-of-fold and temporal-holdout retraining?
AN-015 descriptive done 🟢 award-bid-complementarity D1 gate diagnostic — does the continuous score dominate FL14 on a harmonized same-sample horse race, and do t…
AN-016 descriptive done 🟡 price-scope-sign-reversal D2 gate diagnostic — does the FL screen discriminate cobidders better in Convite or in Pregão environments?
AN-017 robustness done 🟢 cobidder-concentration D3 gate diagnostic — does the continuous score preserve the loser-side thesis without FL14, and what is the i…
AN-018 descriptive done 🟢 direct-defendants-null D4 gate diagnostic — what share of direct CADE defendants are always-losers, and what is their win-rate distr…
AN-019 descriptive done 🟡 price-scope-sign-reversal Does the negotiated-price coefficient at the procurement-cap threshold reverse sign when FL14 presence is int…
AN-020 descriptive done 🟡 price-scope-sign-reversal Does the 2018 procurement decree shift price dynamics differently across modalities, consistent with the scop…
AN-021 robustness done 🟡 cobidder-profile-distinct Does the "first-time FL" effect on cobidder concentration survive propensity-score matching?
AN-022 placebo done 🟡 price-scope-sign-reversal Do FL-margin price effects differ by procurement modality, and does the Pregão-only subsample replicate the f…
AN-023 robustness done 🟡 cobidder-concentration Does the operational mapping from theory (loser-side concentration) to implementation (FL14) survive an expli…
AN-024 descriptive done 🟡 cobidder-profile-distinct How does the unified mechanism profile (HHI × pairs × heterogeneity quadrants) characterize FL cobidders rela…
AN-025 robustness done 🟢 cobidder-concentration How does cobidder AUC vary as the FL cutoff sweeps from FL2 through FL100, and is FL14 picking up an arbitrar…
AN-026 robustness done 🟢 cobidder-concentration Does the cobidder concentration result survive across always-loser sub-populations defined by bid-microdata a…
AN-027 descriptive done 🟢 exposure-discipline How does AUC behave when the universe and the positive class are systematically varied — does the loser-side…
AN-028 descriptive done 🟢 exposure-discipline Within the always-loser stratum, are cobidders distinguishable from non-cobidder FLs along dimensions other t…
AN-029 robustness done 🟢 timing-discipline Does the FL screen preserve discrimination under three progressively-earlier train windows, evaluated against…
AN-030 descriptive done 🟢 timing-discipline How much do the firms, markets, and procuring buyers in 2017–2019 overlap with those in 2009–2016? Is the out…
AN-031 descriptive done 🟡 cobidder-profile-distinct Do cobidders display bid-level behavior distinct from non-cobidder FLs, independent of participation volume?
AN-032 robustness done 🟡 cobidder-profile-distinct Does the quadrant-level heterogeneity (HHI × pairs) of the cobidder profile survive propensity-score matching…
AN-033 descriptive done 🟢 award-bid-complementarity How significant is the incremental value of the award-layer score added to the Imhof bid-distribution pipelin…
AN-034 descriptive done 🟢 award-bid-complementarity When deployed sequentially (FL gatekeeper → Imhof forensic stage) vs jointly, how does the cost-of-evidence t…
AN-035 descriptive done 🟢 gatekeeping-cost-of-evidence Across the full architecture × k × regime grid, what are the recall, precision, and bid-microdata cost trade-…
AN-036 robustness done 🟡 gatekeeping-cost-of-evidence Are the precision@k metrics stable across cross-validation folds, or do they depend on a specific random spli…
AN-037 descriptive done 🟢 price-scope-sign-reversal How does the FL-margin price coefficient transform across baseline → overlap-cell → ATT specifications, and d…
AN-038 descriptive done 🟡 price-scope-sign-reversal At the item-group and operating-cell level, where does the negative FL-price coefficient hold and where does…
AN-039 descriptive done 🟢 price-scope-sign-reversal Do cartels with cover bidders endogenously select into cells where the underlying (non-treated) price level i…
AN-040 descriptive done 🟢 price-scope-sign-reversal Within overlap cells, does FL presence depress the observed winner bid relative to the reference price? Does…
AN-041 descriptive done 🟢 cobidder-profile-distinct Does the within-FL distinctness of cobidders (AN-028 participation dimensions, AN-031 bid-level gap-to-winner…
AN-042 descriptive done 🟡 cobidder-profile-distinct Are cobidders distinct from non-cobidder FLs on bid TIMING (revision intensity, inter-bid interval, last-bid…
AN-043 validation provisional 🟡 exposure-discipline Does the award-layer evidence-triage protocol replicate on a SECOND, institutionally distinct procurement platform (feder…
AN-044 methodological provisional 🟡 exposure-discipline Why does the raw award-screen AUC inflate, and can the inflation be characterized as a body-level object (the paper's lead contribution) with a portable leading-order sufficient statistic? …

Status legend. done = analysis run and interpretation written; pending = scaffolded only; stale = superseded.

Confidence legend. 🟢 green (clean identification, robust); 🟡 yellow (informative with caveats); 🔴 red (kept for the record, not load-bearing).