Identity resolution software with deterministic + probabilistic matching
Deterministic-first + probabilistic-fallback identity resolution at multi-location-operator scale — feeds the canonical customer graph every AI content and decisioning agent consumes.
The problem
The same customer hits your website on desktop Tuesday, mobile app Wednesday, in-store Friday, calls the support line Saturday, and emails sales Monday. Mixpanel sees three different visitor IDs. Klaviyo has two email addresses. Your POS records the in-store visit as anonymous. Your call-tracking platform logs the phone number. They are all the same customer but no system knows it.
LiveRamp ($50,000-$500,000+/year — category leader), TransUnion Neustar, Acxiom, Merkle Sonar, Throtle, and Tapad are built for ad-tech identity resolution at enterprise scale with 6-9 month integration. Segment ($120-$1,800+/month), Tealium ($50,000+/year), mParticle, Klaviyo CDP, and Bloomreach bundle deterministic-rule-based identity stitching; probabilistic matching requires custom configuration. Salesforce Customer 360 ($30,000-$300,000+/year), Adobe Real-Time CDP, Oracle Unity, and Treasure Data include identity resolution but tie deeply to their platform. Informatica MDM ($100,000-$500,000+/year), Reltio, and Stibo Systems run customer + product MDM at enterprise scale with 6-12 month implementation. Identity-graph data brokers (LiveRamp IdentityLink, Tapad Device Graph, Experian Mosaic, Oracle Data Cloud) sell access to their graphs. DIY costs $80,000-$150,000/year per data-engineer FTE and requires ML expertise for probabilistic matching.
The gap is identity resolution combining deterministic + probabilistic matching at multi-location-operator scale with per-vertical and per-jurisdiction privacy constraints.
What success looks like
Deterministic matches (email, phone, login ID, loyalty ID, POS transaction key, payment method) resolve first with high confidence. Probabilistic-fallback matching (device fingerprint, browser cookie, behavioral signature) fills gaps with calibrated confidence scores. Operator-defined thresholds determine which matches merge into the canonical customer graph; borderline matches route to borderline-routing review.
Multi-location operators get cross-location resolution with cross-location LTV roll-up. Multi-brand portfolios get corporate base graph with per-brand sub-graphs; cross-brand resolution gated by privacy policies and operator-defined consent. In-store anonymous visits link to known identity at loyalty scan, payment, or email capture moment; behavioral-signal-ingestion feeds the resolution graph at runtime.
Per-vertical-compliance-overlay and per-jurisdiction-overlay-config condition resolution per applicable privacy regime. CPRA / CCPA / GDPR data-subject-access-request response surfaces every identifier linked to a subject; right-to-deletion propagates across the canonical graph. Behavioral-cohort-computation, customer-change-event-emission, ltv-math-primitives, churn-prediction, and save-flow-propensity-scoring all consume the resolved customer graph as their canonical input. LiveRamp, TransUnion Neustar, and other ad-tech graphs remain useful for off-platform addressability; in-house operations run on this resolved graph.
How most operators solve this today
Six tiers of incumbent tools — none combine deterministic + probabilistic resolution at multi-location-operator scale with per-vertical / per-jurisdiction privacy constraints.
Identity-resolution specialists (LiveRamp, TransUnion Neustar, Acxiom, Merkle Sonar, Throtle, Tapad)
$50,000-$500,000+/year
Built for ad-tech identity resolution at enterprise scale. Expensive plus complex 6-9 month integration. Multi-location-operator scale is uneconomic.
CDP-bundled identity (Segment, Tealium, mParticle, Klaviyo CDP, Bloomreach)
$120-$50,000+/year
Deterministic-rule-based identity stitching bundled. Probabilistic matching requires custom configuration plus ML expertise.
Enterprise customer 360 (Salesforce Customer 360, Adobe Real-Time CDP, Oracle Unity, Treasure Data)
$30,000-$500,000+/year
Identity resolution included but deeply tied to their platform. Over-built for multi-location-operator scale.
MDM platforms (Informatica MDM, Reltio, Stibo Systems, Profisee, TIBCO EBX)
$50,000-$500,000+/year
Customer plus product MDM at enterprise scale. 6-12 month implementation. Not optimized for marketing-operator identity.
Identity-graph data brokers (LiveRamp IdentityLink, Tapad Device Graph, Experian Mosaic, Oracle Data Cloud)
$30,000-$300,000+/year
Broker-maintained identity graphs. Operator brings their data and gets resolved-to-broker-graph results. Privacy-regime compatibility varies.
DIY (custom SQL deduplication + fuzzy matching + ML probabilistic matching)
$80,000-$150,000/year per data-engineer FTE
Requires ML expertise for probabilistic matching. API drift maintenance consumes ~1/3 FTE time.
What changes when this is an agent skill
The Completions identity-resolution skill combines deterministic-first + probabilistic-fallback matching at multi-location-operator scale ($2,000-$10,000/month target band).
Deterministic matches (email, phone, login ID, loyalty ID, POS transaction key, payment method) resolve first with high confidence (1.0). Probabilistic matches (device fingerprint, browser ID, behavioral signature) receive calibrated confidence scores (0.0 to 1.0). Operator-defined thresholds determine which matches merge into the canonical graph; borderline matches route to borderline-routing review.
Per-brand / per-vertical / per-jurisdiction privacy constraints compose with per-vertical-compliance-overlay (loop 001) and per-jurisdiction-overlay-config. CPRA / CCPA / GDPR data-subject-access-request response surfaces every identifier linked to a subject; right-to-deletion propagates across the graph.
Multi-brand portfolios get corporate base graph with per-brand sub-graphs; cross-brand resolution gated by privacy policies and operator-defined consent. Behavioral-cohort-computation, customer-change-event-emission, ltv-math-primitives, churn-prediction, and save-flow-propensity-scoring all consume the resolved customer graph as their canonical input.
Agents that include this skill
Skills live inside agent rentals. To get this skill in production, hire any of the agents below — context-tuning at onboarding is included in the first month.
Customer Data Graph Foundation Agent
Resolves DTC subscriber identity, computes LTV math, and emits the canonical customer-data-graph downstream subscription agents consume.
Early-adopter
$2,500–$4,500/mo
FAQ
- What is identity resolution software?
- Software that resolves cross-device, cross-channel, cross-location customer identifiers into one canonical customer record. This skill combines deterministic-first matching (email, phone, login) with probabilistic-fallback (device fingerprint, behavioral signature) and exposes the resolved graph as a runtime signal for every AI content and decisioning agent.
- How is this different from LiveRamp or TransUnion Neustar?
- Identity-resolution specialists are built for ad-tech identity resolution at enterprise scale ($50,000-$500,000+/year, 6-9 month integration). This skill is purpose-built for multi-location-operator scale ($2,000-$10,000/month).
- How is this different from CDP-bundled identity (Segment, Tealium, mParticle)?
- CDPs bundle deterministic-rule-based identity stitching; probabilistic matching requires custom configuration plus ML expertise. This skill combines deterministic + probabilistic with calibrated confidence scoring out of the box.
- How is this different from Salesforce Customer 360 or Adobe Real-Time CDP?
- Enterprise customer 360 platforms include identity resolution at $30,000-$500,000+/year. This skill is purpose-built for multi-location-operator scale.
- What identifiers does the skill resolve across?
- Email, phone, login ID, loyalty ID, POS transaction key, browser cookie, device ID, behavioral fingerprint, payment method, address (with privacy controls).
- How are deterministic and probabilistic matches calibrated?
- Deterministic matches receive high confidence (1.0). Probabilistic matches receive calibrated confidence scores (0.0 to 1.0). Operator-defined thresholds determine which matches merge into the canonical graph; borderline matches route to borderline-routing review.
- How does this compose with CPRA / CCPA / GDPR / per-state cannabis privacy constraints?
- Per-jurisdiction-overlay-config gates resolution per applicable privacy regime. Data-subject-access-request response surfaces every identifier linked to a subject; right-to-deletion propagates across the canonical graph.