.

Data that earns its keep: verifying a 5,000company list for private equity

Data that earns its keep: verifying a 5,000company list for private equity

Large private equity lists are deferred costs unless verification is rigorous where it moves deals and transparent where it does not, so the standard is “faster to qualified conversations than a raw export” across origination, banker/intermediary mapping, LP outreach, and operator bench building. The method prioritizes decision power, regulatory hygiene, deliverability, and entity truth, with ambiguity documented in metadata, not buried in the rows.​

Source triangulation

Each record must align across the live website, a filings source, and LinkedIn; disagreements trigger a flag and manual check, and dead domains or unverifiable snapshots are excluded to prevent inflated counts and ghost targets. For UK exposures, layer identity verification for directors and PSCs under ECCTA timelines so list integrity reflects legal reality, not legacy filings or outdated boards.​

Role truth

Map contacts to buying and decision power by function and mandate: ownerCEO/CFO for founderled SMBs, Head of Corp Dev for strategics, Partner/MD for GPs, and CIO/Head of PE for LPs, with titles normalized to clear levels that reduce false positives without discarding useful connectors. Calibrate exceptions where influence is nonhierarchical, such as VP Engineering for technical diligences or Operating Partners for portfolio boltons.​

Email logic

Run syntax and domain checks first, then estimate bounce rates via stratified sampling with confidence intervals and publish segmentlevel notes to size sends rationally and protect sender reputation. Highrisk segments ship with a warning label or not at all, reflecting compliance norms that value list quality and consent alongside outreach speed.​

Firm tagging

Tag strategies, cheque sizes, control vs minority, platform vs addon, EBITDA bands, and sector themes to controlled vocabularies with an evidence line per tag (mandate excerpt, portfolio pattern, or filing), keeping “generalist” rare and “opportunistic” explicit. This taxonomy lets teams filter to bilaterallikely targets, rightsized processes, and LP personas that match strategy claims.​

Refresh cadence

Ship rolling updates with visible timestamps, version IDs, and change logs so teams can diff releases, decide if an update warrants a new campaign, and trace corrections endtoend like a verification exercise or data room append. Fold ECCTA/IDV status into portfolio and target entities on UK deals to avoid lastminute compliance blockers after outreach begins.​

Quality controls you can audit

  • Randomsample QA per release with documented error rates and specific corrective actions to close the loop.​
  • Knownbad patterns (catchalls, role inboxes) filtered, counted, and codified as suppression rules before any scale sends.​
  • Confidence bands for deliverability by segment so send volumes and cadences match expected outcomes.​
  • Coverage charts by region, strategy, cheque size, and control profile so sourcing gaps are obvious precampaign.​

Practical outputs

Provide saved views for platform vs addon, control vs minority, cheque size, EBITDA band, and sector theme, with notes where confidence is lower so investment teams can choose path and pace. This list only ships if it beats a raw export on speed to qualified outreach and includes audit trails, confidence bands, suppression rules, and any jurisdictional IDV flags; otherwise it is reworked or cut.​