Completions

For CDP architects + customer-data leadership + identity engineers

LiveRamp resolves identity across the broader publisher ecosystem. The first-party algorithm running across your POS, your booking system, and your loyalty platform is your wiring.

Third-party cookies are deprecated. iOS 14 ATT limited cross-app tracking. The probabilistic device- fingerprinting that worked five years ago has degraded. The first-party deterministic-plus-probabilistic resolution algorithm running on operator-owned signals with confidence-weighted merge decisioning + identity- conflict reconciliation + per-jurisdiction PII gating is what now produces the unified customer record at multi-location scale.

By Jay Christopher11 min read

What this gets you

  • Two-layer resolution algorithm— deterministic on explicit identifiers (email + phone + loyalty ID + payment instrument + hashed identifier with matching salt) at near-100 percent confidence; probabilistic on inferred signals (device + IP + behavior + geo + temporal) with confidence-varying scoring.
  • Confidence-weighted merge decisioning— high-confidence matches auto-merge; borderline matches route to analyst review with signal evidence surfaced; low-confidence matches stay separate. Reviewer decisions feed confidence- threshold tuning per signal type per cycle.
  • Identity-conflict reconciliation— when two confident matches disagree (signal stream A says identity-1 matches identity-2; stream B says identity-1 matches identity-3), the conflict- resolution layer evaluates strength and produces a winning merge with the losing match audit-logged.
  • Post-cookie + iOS 14 ATT first-party-data strategy — increased weight on deterministic identifiers, expanded first-party-data collection via offline POS plus call-tracking plus walk-in foot-traffic plus loyalty, probabilistic resolution focused on intra-session and intra-device signals where deprecation has less impact.
  • Per-jurisdiction PII policy gating + per-vertical reconciliation policies — GDPR consent state + CCPA opt-out + HIPAA BAA coverage + cannabis-state PHI restrictions gate which identifiers participate in matching; healthcare conflicts route more conservatively; financial services require explicit reviewer approval for multi-PII-anchor merges.

The third-party identity graph stopped working three years ago

A 200-location operator wired LiveRamp into the CDP in 2019 to resolve identity across publishers and ad platforms. The third-party identity graph linked the operator first-party email list to the third-party cross-publisher identifier set. The operator could target paid-media audiences against the resolved identity. The probabilistic device-fingerprinting layer ran underneath LiveRamp and produced ~40-50 percent additional cross-device matches against the pure-deterministic baseline.

Three years later the world changed. Third-party cookies are deprecated. iOS 14 App Tracking Transparency limited cross-app tracking. The cross- publisher third-party graph that LiveRamp maintains has thinned because the upstream identity signals fed into the graph have thinned. Probabilistic device- fingerprinting that worked at 40-50 percent additional match rate has dropped to 10-15 percent. The operator addressable audience has shrunk. The cross-channel attribution that depended on the resolved identity has degraded.

The new strategy is first-party-first. Increase the weight on deterministic identifiers — encourage more frequent login in the customer experience, push loyalty signup at every touchpoint, capture phone and email at booking, link loyalty to payment- instrument fingerprint at POS. Expand the deterministic anchor set across offline channels. Re-tune probabilistic resolution to focus on intra- session and intra-device signals where cookie deprecation has less impact. Run the first-party identity-resolution algorithm on the operator-owned substrate the customer-graph agent maintains.

The Resolve axis on the customer-graph agent runs the algorithm. Deterministic layer first — match on email + phone + loyalty ID + payment-instrument fingerprint + hashed identifier with matching salt at near-100 percent confidence. Probabilistic layer second — match on device + IP + behavior + geo + temporal with confidence-varying scoring. Confidence- weighted merge decisioning produces the canonical customer record across every device, every channel, every location.

What is in market — and what each category leaves to you

The third-party identity-graph layer is thinning. The first-party operator-owned resolution algorithm at multi-location scale is architecture.

Third-party identity graphs — LiveRamp, Acxiom, Throtle, Tapad (Experian), Neustar

Excellent at cross-publisher identity linking across the broader consumer ecosystem. Match rate has thinned with cookie deprecation and ATT. The third-party graph remains valuable for paid-media audience targeting and look-alike modeling; it is no longer sufficient on its own for resolving customer identity across the operator-owned data estate.

CDP identity layers — Adobe Identity, Treasure Data, Tealium AudienceStream, Salesforce CDP, mParticle, Segment, BlueConic

Strong at resolving identity within the operator data estate using the platform identity-graph primitive. Deterministic-plus-probabilistic algorithm tuning per identifier mix, confidence- weighted merge decisioning, identity-conflict reconciliation, and per-vertical regulatory reconciliation policies are operator-side architecture above the CDP identity primitive.

First-party identity specialists — Habu, Hightouch

Strong at first-party resolution and reverse-ETL identity-graph projection to downstream destinations. The customer-graph-integrated algorithm running on the operator substrate with the per-vertical regulatory reconciliation policies sits at the customer-graph layer.

Open-source identity-resolution — PostHog (group-aware identity), Snowplow, RudderStack

Capable foundations for operators with in-house data-engineering capacity. The two-layer deterministic-plus-probabilistic algorithm implementation, the confidence-weighted merge decisioning, the identity-conflict reconciliation, and the per-jurisdiction PII policy engine are the operator-side build above the open-source layer.

The CRM-deduplication script the data team runs every Saturday

The status quo at most multi-location operators below the enterprise-CDP tier. The data team runs a SQL script every Saturday that merges CRM contacts matching on email or phone exactly. The script handles deterministic matches. Probabilistic matches stay as separate contacts. Cross-device identity stays unresolved. The data team treats it as good enough until the next attribution audit surfaces the gap.

The pipeline, end to end

  1. Position in the 7-axis customer-graph pipeline. Ingest (behavioral-signal-ingestion) + Resolve (this skill) + Version (versioned-customer-history) + Compute-LTV (ltv-math-primitives) + Cohort (behavioral-cohort-computation) + Subscription-Ingest + Emit-Change (customer-change-event-emission). The Resolve axis reads from Ingest and writes to Version-and-downstream.
  2. Deterministic resolution layer. Match on explicit shared identifiers — email + phone + loyalty ID + payment-instrument fingerprint + hashed identifier with matching salt + government ID hash. Match confidence near-100 percent when identifiers match. Deterministic layer handles high-confidence cases first to constrain the probabilistic search space.
  3. Probabilistic resolution layer. Match on inferred signals — device fingerprint (canvas + audio + WebGL + user-agent + IP), behavioral pattern (page-sequence + dwell-time + interaction-pattern), location proximity (geo + time-window), temporal sequence (events fitting plausible customer-journey timing), household-level aggregation (shared IP plus shared device plus shared address signal).
  4. Confidence-weighted merge decisioning. Every candidate match carries a confidence score. High-confidence (deterministic match plus consistent probabilistic signals) auto-merge. Borderline (deterministic match but conflicting probabilistic signals, or strong probabilistic match without deterministic anchor) route to analyst-review queue with signal evidence surfaced. Low-confidence stay separate until additional signal arrives.
  5. Identity-conflict reconciliation. When two confident matches disagree (signal stream A says identity-1 matches identity-2; stream B says identity-1 matches identity-3), conflict-resolution layer evaluates strength (signal count + signal recency + signal type weight) and produces a winning merge with the losing match logged for audit. Persistent conflicts route to analyst review.
  6. Per-jurisdiction PII policy gating. GDPR consent state + CCPA opt-out + HIPAA BAA coverage + cannabis-state PHI restrictions gate which identifiers participate in matching. PHI under HIPAA cannot match cross-channel without explicit BAA-covered consent. The policy engine shares substrate with the marketing-compliance cross-agent overlay.
  7. Per-vertical reconciliation policies. Healthcare conflicts under HIPAA route more conservatively than non-regulated conflicts. Financial-services under FINRA require explicit reviewer approval for any merge involving multiple PII anchors. Cannabis state-by-state policies vary per state. Per-vertical policies tune the threshold and the escalation requirement per cycle.
  8. Post-cookie + iOS 14 ATT first-party strategy. Deterministic identifier weights increase. First- party-data collection expands through offline POS + call-tracking + walk-in foot-traffic + booking events + loyalty interactions. Probabilistic resolution focuses on intra-session and intra-device signals where cookie deprecation has less impact. The post-cookie architecture shift reshapes the algorithm rather than tunes its parameters.
  9. Reviewer decision feedback loop. Analyst-review queue decisions feed the confidence- threshold tuning per signal type per cycle. Decisions that consistently approve borderline matches loosen the auto-merge threshold; decisions that consistently reject borderline matches tighten it. The system self-improves at where the threshold should sit for each operator data mix.
  10. Match-rate, precision, and recall measurement. Match rate (events resolving to a known identity vs total events). Precision (merges that prove correct against periodic ground-truth audit). Recall (true matches the pipeline found vs estimated total true matches). Per-vertical metrics tracked separately for regulated-vertical reconciliation tuning.
  11. Right-to-deletion propagation. Deletion events propagate through the customer- data-orchestration emit-change pipeline. Affected identities tombstone in the version store. Subsequent resolution skips the tombstoned identity. The tombstone propagates to downstream consumers so they stop targeting or referencing the deleted identity.
  12. Cross-arc integration. The Resolve axis feeds the Version axis (versioned- history substrate), the Compute-LTV axis (LTV computed on resolved customer record), the Cohort axis (behavioral cohorts computed on resolved customer record), and the Emit-Change axis (changes to the resolved record broadcast to downstream consumers). The seven-axis customer-graph pipeline operates on the resolved-identity substrate the Resolve axis produces.
  13. ROI measurement. Match-rate trend per cycle. Cross-device attribution accuracy pre vs post deployment. CRM-duplicate rate pre vs post (target dropping). Marketing attribution accuracy on multi-touch journeys against ground- truth audit. Per-vertical attribution-completeness metrics. Signal feeds threshold-and-policy tuning per cycle.

Frequently asked

What is identity resolution software?

Identity resolution software links the multiple identifiers a single customer touches (anonymous device cookies, login emails, hashed phone numbers, loyalty IDs, payment-instrument fingerprints, household-level aggregations) into one canonical customer record. The category covers third-party-graph providers (LiveRamp, Acxiom, Throtle, Tapad now Experian, Neustar) that maintain cross-publisher identity graphs of the broader consumer ecosystem, enterprise CDPs with identity layers (Adobe Identity, Treasure Data, Tealium AudienceStream, Salesforce CDP, mParticle, Segment, BlueConic), and identity-resolution specialists with first-party data emphasis (Habu, Hightouch). The operator-owned first-party resolution algorithm that runs deterministic-plus-probabilistic matching with confidence-weighted merge decisioning on the customer-graph substrate at multi-location scale is operator-side architecture.

What is the difference between deterministic and probabilistic identity resolution?

Deterministic resolution matches on explicit shared identifiers — same email address, same phone number, same loyalty ID, same hashed identifier with matching salt, same payment-instrument fingerprint. Match confidence runs near-100 percent when identifiers match because the match relies on an unambiguous shared key. Probabilistic resolution matches on inferred signals — device fingerprint (canvas plus audio plus WebGL plus user-agent plus IP), behavioral pattern (page-sequence plus dwell-time plus interaction-pattern), location proximity (geo plus time-window), temporal sequence (events that fit a plausible customer-journey timing), household-level aggregation. Match confidence varies per signal mix. Real-world multi-location identity resolution needs both layers running in sequence.

How is this different from LiveRamp, Acxiom, Throtle, Adobe Identity, Treasure Data, or Tealium AudienceStream?

LiveRamp, Acxiom, Throtle, Tapad/Experian, Neustar ship third-party identity graphs that link identifiers across the broader consumer-data ecosystem (publishers, broadcasters, retailers, ad platforms). They are excellent at cross-publisher resolution. Adobe Identity, Treasure Data, Tealium AudienceStream, Salesforce CDP, mParticle, Segment, BlueConic ship CDP identity layers that resolve within the operator data estate. The operator-owned first-party resolution algorithm that runs deterministic-plus-probabilistic matching tuned per identifier mix, the confidence-weighted merge decisioning that handles borderline matches with analyst-review escalation, the identity-conflict reconciliation when two confident matches disagree, the per-jurisdiction PII policy that gates which identifiers can be used in matching under HIPAA-plus-cannabis-state-plus-GDPR, and the integration with the broader 7-axis customer-graph pipeline are operator-side architecture on top of the platform layer.

How does confidence-weighted merge decisioning work?

Every candidate match between two identity records carries a confidence score from the resolution algorithm. High-confidence matches (deterministic identifier match plus consistent probabilistic signals) auto-merge into the canonical customer record. Borderline matches (deterministic match but conflicting probabilistic signals, or strong probabilistic match without deterministic anchor) route to an analyst-review queue with the candidate match plus the signal evidence surfaced. Low-confidence matches stay separate until additional signal arrives. Reviewer decisions feed the confidence-threshold tuning per signal type per cycle. The system self-improves at where the auto-merge threshold should sit for each operator data mix.

How do you handle the post-cookie + iOS 14 ATT first-party-data strategy?

Third-party cookies are deprecated and iOS 14 App Tracking Transparency limits cross-app tracking. Probabilistic resolution that relied heavily on device fingerprinting has degraded coverage. The strategy compensates three ways. First, increase the weight on deterministic identifiers (logged-in email plus phone plus loyalty ID) by encouraging more frequent login in the customer experience. Second, expand first-party-data collection (POS receipts plus call-tracking plus walk-in foot-traffic plus booking events plus loyalty interactions) to provide deterministic anchors in offline channels. Third, focus probabilistic resolution on intra-session and intra-device signals where cookie deprecation has less impact. The post-cookie strategy is a substantive architecture shift, not a configuration tweak.

How do you handle identity-conflict reconciliation when two confident matches disagree?

A single customer can produce conflicting identity matches when one signal stream says identity-A matches identity-B and another stream says identity-A matches identity-C. The conflict-resolution layer evaluates the strength of each match (signal count + signal recency + signal type weight) and produces a winning merge with the losing match logged for audit. Persistent conflicts that resist the algorithm route to analyst review with the conflict evidence surfaced. Per-vertical reconciliation policies tune the weights — healthcare conflicts under HIPAA route more conservatively than non-regulated conflicts; financial-services under FINRA require explicit reviewer approval for any merge involving multiple PII anchors.

Hire the agent that runs the Resolve axis

The customer-data-graph agent owns the 7-axis pipeline — Ingest + Resolve + Version + Compute-LTV + Cohort + Subscription-Ingest + Emit-Change — sitting on top of whichever third-party identity graph (LiveRamp, Acxiom, Throtle, Tapad, Neustar), enterprise CDP (Adobe Identity, Treasure Data, Tealium AudienceStream, Salesforce CDP, mParticle, Segment, BlueConic), or open-source pipeline (PostHog, Snowplow, RudderStack) you license downstream. Two-layer deterministic-plus-probabilistic algorithm, confidence- weighted merge decisioning, identity-conflict reconciliation, per-jurisdiction PII policy gating, per-vertical reconciliation policies, post-cookie first-party-data strategy.

We scope on the call and send a private checkout link after.

Related reading: Behavioral-signal ingestion · Customer journey tracking · DSAR + versioned history