17 minute read

How to Run Incrementality Testing with Attribution: A Step-by-Step Guide for Smarter Ad Spend

Written by

Matt Pattoli

Founder at Cometly

Follow On YouTube

Published on

May 10, 2026

Copy link

Most marketing teams rely on attribution models to understand which channels and campaigns drive revenue. But attribution alone leaves a critical question unanswered: would those conversions have happened anyway, even without the ad?

Think about it this way. A customer clicks a retargeting ad on Monday and buys on Thursday. Your attribution model lights up and credits that retargeting campaign. But what if that customer had already decided to buy before they saw the ad? What if they were heading to your website regardless, and the retargeting ad was just along for the ride?

This is the fundamental blind spot in attribution, and it is more common than most teams realize. Attribution models are excellent at mapping the customer journey and assigning credit to touchpoints. What they cannot do is distinguish between conversions that happened because of an ad and conversions that happened despite an ad being present.

Incrementality testing solves this problem. Borrowed from the logic of randomized controlled trials, incrementality testing measures the true causal lift your marketing creates by comparing a group that sees your ads against a holdout group that does not. The difference in conversion rates between those two groups is your incremental lift: the conversions that genuinely would not have happened without your marketing.

When you combine incrementality testing with attribution data, you build a framework that tells you not just which touchpoints a customer interacted with, but which ones actually changed their behavior. That combination is where confident, efficient budget decisions come from.

This guide walks you through the full process, from forming a testable hypothesis to reading results and feeding insights back into your attribution models. Whether you are running campaigns across Meta, Google, TikTok, or multiple platforms simultaneously, this process will help you move beyond correlation and start measuring true causation in your marketing.

Step 1: Understand What Incrementality Actually Measures (and Why Attribution Alone Falls Short)

Before you design a single test, you need a clear picture of what you are actually measuring and why it matters. Incrementality is not a replacement for attribution. It is the missing piece that validates whether your attribution data reflects reality.

Here is the core definition: incrementality measures the conversions that would not have occurred without a specific marketing action. If 100 people in your test group converted and 80 people in your holdout group converted at the same rate, your ad drove 20 incremental conversions. Those 20 are the ones you actually caused.

Standard attribution models, whether last-click, first-click, multi-touch, or data-driven, are built to assign credit. They look at the touchpoints a customer interacted with before converting and distribute credit among them. The problem is that credit assignment is not the same as causal proof. Understanding types of attribution models in digital marketing is essential context for recognizing where each model's blind spots lie.

Consider a few scenarios where attribution commonly misleads:

Branded search cannibalizing organic: A customer types your brand name into Google, clicks a paid branded search ad, and converts. Your attribution model credits the paid search campaign. But that customer was already searching for your brand by name. They likely would have clicked the organic result if the ad had not been there. The ad did not create intent; it just intercepted it.

Retargeting claiming credit for already-decided buyers: A customer visits your pricing page, leaves, sees a retargeting ad, and then comes back to purchase. The retargeting ad looks like the hero in your attribution report. But many of these customers were already deep in their decision process. The ad showed up at a convenient moment, not a pivotal one.

Last-click overvaluing bottom-funnel: Last-click attribution consistently rewards the final touchpoint before conversion, which is often a direct visit or branded search. This creates a distorted picture that undervalues the awareness and consideration campaigns that actually moved the customer through the funnel.

Incrementality testing and attribution are complementary. Attribution maps the customer journey and shows you the sequence of touchpoints. Incrementality validates which parts of that journey truly changed the outcome. Together, they give you the full picture: what happened and whether it mattered. This is precisely why understanding the importance of attribution models goes hand in hand with learning to validate them through experimentation.

Step 2: Choose the Right Channel, Campaign, or Tactic to Test

You cannot test everything at once, and you should not try to. The goal of your first incrementality test is to generate a clear, actionable insight about a specific channel or campaign. Scope matters here.

Start with the channel or campaign where you suspect the biggest gap between attributed credit and true incremental impact. Retargeting and branded search are two of the most common culprits, for the reasons described above. If your retargeting campaigns show strong attributed ROAS but you have a nagging sense that those customers were already on their way to converting, that is a strong candidate for your first test.

Use your attribution data to surface candidates. Look for channels that consistently show high attributed conversion volume but where you have low confidence in actual incremental value. Ask yourself: if I paused this channel entirely for a month, would total revenue actually drop, or would those customers just convert through a different path? If your attribution data doesn't match across platforms, that discrepancy itself is a signal pointing you toward the right test candidate.

Prioritize based on spend volume. Testing a channel that represents a meaningful portion of your total ad budget will yield the most actionable result. If the test reveals low incrementality, you have an opportunity to reallocate significant budget. If it confirms strong incrementality, you have data-backed justification to scale.

A few practical selection criteria to guide your decision:

High spend, uncertain value: Any channel where you are spending heavily but relying primarily on attributed conversions to justify the investment is a priority for testing.

High overlap with organic behavior: Channels that target people who are already familiar with your brand or who are actively searching for you are more likely to show low incrementality.

Sufficient conversion volume: Your test needs enough conversions to produce statistically meaningful results. A channel that generates only a handful of conversions per week will take a very long time to reach significance.

Keep your first test focused on a single variable. Do not attempt to test multiple channels simultaneously in your first experiment. Isolating one variable is what makes the results interpretable.

Step 3: Design Your Holdout Experiment with Control and Test Groups

The design of your experiment determines the quality of your results. A well-structured holdout test is straightforward in concept but requires careful execution.

The core structure is simple: a test group that sees your ads and a control group (the holdout) that does not. Both groups should be as identical as possible in every other way. The only variable that differs between them is exposure to your ads. Any difference in conversion rates between the two groups at the end of the test is your incremental lift.

There are two primary approaches to structuring your holdout:

Geographic holdouts: You pause ads in specific markets, regions, or DMAs while continuing to run them everywhere else. The paused regions become your control group. This approach works well for cross-platform tests because it removes ad exposure across all channels simultaneously in the holdout geography. The main risk is that the holdout and test markets may have different baseline characteristics, so choose markets that are as similar as possible in terms of audience composition and historical conversion rates.

Audience-based holdouts: Within a single platform like Meta or Google, you randomly split your audience so that a portion never sees the ads. This is often more statistically clean than geographic holdouts because the randomization happens at the individual level. The limitation is that it only measures incrementality for that specific platform, not across your full media mix.

Sizing your groups correctly is critical. The holdout needs to be large enough to produce statistically meaningful results, but not so large that you sacrifice significant revenue during the test period. A holdout of 10 to 20 percent of your target audience is a common starting point, though the right size depends on your conversion volume and the lift you expect to detect.

Set a clear test duration before you start and commit to it. For most campaigns, two to four weeks is a reasonable window, though this depends on your conversion cycle. Understanding your attribution window performance helps you determine the right test length, since your conversion cycle directly dictates how long you need to observe results.

Document your hypothesis in writing before launching. For example: "We believe our Meta retargeting campaigns drive incremental lift beyond what organic and direct traffic would capture, and we expect to see a meaningful difference in conversion rates between the test and control groups." Writing this down forces clarity and prevents you from adjusting your interpretation of results after the fact.

Step 4: Ensure Clean Tracking and Baseline Data Before You Launch

A well-designed experiment with poor data is worthless. Before you launch your incrementality test, you need to verify that your measurement infrastructure is solid enough to trust the results.

Start with your attribution platform. Verify that every touchpoint is being captured accurately across the channels involved in your test. Gaps in tracking will create noise in both your test and control measurements, making it impossible to calculate true incremental lift with confidence. If your attribution data has known blind spots, address them before the test begins, not after. A thorough approach to solving attribution data discrepancies before launching your test will save you from unreliable results later.

Set up server-side tracking if you have not already. Browser-based tracking faces real limitations today: ad blockers, browser privacy restrictions, and Apple's App Tracking Transparency all create gaps in the data. Server-side tracking bypasses these limitations by sending conversion data directly from your server rather than relying on browser cookies and pixels. For an incrementality test, where you need precise measurement of both groups, server-side tracking is not optional. It is the foundation of accurate results.

Establish baseline conversion rates for both your test and control groups using at least two weeks of pre-test data. This gives you a reliable comparison point and helps you confirm that the two groups are behaving similarly before the test begins. If your holdout region or audience segment shows a significantly different baseline conversion rate than your test group, your results will be difficult to interpret. Knowing how to track users without third-party cookies is increasingly important for maintaining accurate baseline measurements in today's privacy-first environment.

Ensure your CRM data and ad platform data are synced so you can track full-funnel outcomes. Top-of-funnel conversions like form fills or email signups are a starting point, but the real question is which group generated more pipeline and revenue. If you can only measure surface-level conversions, you will miss the full picture of incremental impact.

This is where a platform like Cometly becomes particularly valuable. Cometly connects your ad platforms, CRM, and website data into one unified view, so your baseline and test measurements are consistent and complete. When all of your data flows through a single attribution layer, you can track every touchpoint from ad click to closed revenue, giving your incrementality test the data quality it needs to produce trustworthy results.

Step 5: Run the Test and Monitor Without Interfering

Once your test is live, your primary job is to resist the urge to touch it. This is harder than it sounds, especially when you are watching daily metrics and seeing fluctuations.

Do not adjust bids, budgets, or creative during the test window. Any change you make mid-test introduces a new variable that contaminates your results. If you increase budget in week two because performance looks strong, you are no longer measuring the same thing you started measuring. Commit to the test parameters you set in advance and hold to them.

Monitor for external variables that could affect results. Seasonal events, competitor promotions, major news cycles, or platform algorithm updates can all create noise. If something significant happens during your test period, document it. You may need to account for it when analyzing results, or in extreme cases, you may need to restart the test entirely.

Track both groups daily to watch for anomalies. If your control group suddenly spikes in conversions mid-test, that is a signal worth investigating. Similarly, if your test group drops sharply, check for ad delivery issues, budget pacing problems, or tracking failures. Leveraging real-time attribution tracking during your test window makes it much easier to spot these anomalies as they happen rather than discovering them after the test concludes.

If you are using geographic holdouts, watch for spillover effects. Customers in your holdout region may still be exposed to your ads through travel, VPN usage, or cross-market media coverage. Some degree of spillover is usually unavoidable, but significant spillover will compress your measured lift and make your results an underestimate of true incrementality.

The discipline of not interfering is what separates a valid experiment from a compromised one. Let the test run its course.

Step 6: Analyze Results and Calculate True Incremental Lift

When your test concludes, the analysis phase begins. This is where you translate raw data into actionable insight.

Start with the core calculation. Incremental lift is calculated as: (Test Group Conversion Rate minus Control Group Conversion Rate) divided by Control Group Conversion Rate. This gives you the percentage lift your ads created above the baseline. For example, if your test group converted at 4% and your control group converted at 3%, your incremental lift is 33%. That means roughly one-third of the conversions in the test group were caused by your ads.

Before you draw conclusions, determine statistical significance. A difference between groups might be real, or it might be random noise. Use a standard significance threshold, typically 90 to 95 percent confidence, before treating the result as meaningful. Many marketers run tests that are too short or use holdout groups that are too small, which leads to inconclusive results. If your test did not reach significance, you need more data, not a different interpretation of the numbers you have.

Compare your incrementality results against what your attribution model credited to that channel. This comparison is the most revealing part of the analysis. If your attribution model credited a channel with 500 conversions during the test period but your incrementality analysis shows only 200 truly incremental conversions, you have a 300-conversion gap. That gap represents the model's blind spot: conversions that were attributed to the channel but would have happened regardless. This is a common scenario explored in depth when examining the dilemma of attribution in marketing.

Segment your results to find where incremental impact is strongest. Break down results by audience type, funnel stage, and creative variant. You may find that your retargeting campaign drives strong incrementality among new visitors but very little among customers who are already deep in the purchase process. That kind of segmentation insight tells you exactly where to focus spend and where to pull back.

Calculate your true incremental cost per acquisition and incremental ROAS. These metrics replace the surface-level numbers your attribution model produces with figures that reflect actual causal impact. Dedicated revenue attribution tracking tools can help you connect incrementality findings to actual revenue data, giving you a clearer picture of true return on ad spend.

Step 7: Feed Incrementality Insights Back Into Your Attribution and Budget Strategy

The real value of incrementality testing is not just in the test results themselves. It is in what you do with those results to improve your ongoing attribution and budget decisions.

Use your findings to recalibrate how you weight channels in your attribution model. If your retargeting test showed only modest incremental lift, reduce the credit weight assigned to retargeting in your model. If your prospecting campaigns showed strong incrementality, increase their weight. The goal is to make your attribution model reflect causal reality, not just touchpoint presence. Knowing when to switch attribution models entirely is another decision that incrementality data can inform.

Reallocate budget based on incremental ROAS rather than attributed ROAS. Shift spend away from low-incrementality channels and toward the channels that are genuinely creating new demand. This reallocation is often where significant efficiency gains are found. Channels that look like strong performers on attributed metrics but show weak incrementality are consuming budget that could be working harder elsewhere.

Build a recurring testing calendar. Channel effectiveness changes over time due to audience saturation, competitive dynamics, and platform algorithm updates. A retargeting campaign that showed strong incrementality a year ago may show much weaker results today as your audience has grown more familiar with your brand. Running one major incrementality test per quarter keeps your attribution model calibrated and your budget decisions grounded in current data.

Use AI-powered tools to automate ongoing optimization between test cycles. Platforms like Cometly provide AI-driven recommendations that identify high-performing ads and campaigns across channels, helping you scale the efforts that are genuinely driving new revenue. Rather than waiting for your next quarterly test to make adjustments, you can act on real-time signals while maintaining the discipline of periodic controlled experiments to validate those signals.

Feed enriched conversion data back to ad platforms. When you send accurate, conversion-ready events back to Meta, Google, and other platforms, their machine learning algorithms optimize toward the audiences that truly convert, not just the ones that appear to convert in surface-level reports. Cometly's Conversion Sync capability does exactly this, closing the loop between your attribution data and the ad platform algorithms that determine who sees your ads. Better data in means better targeting out, which improves the incrementality of your campaigns over time.

The combination of recurring incrementality tests, a recalibrated attribution model, and AI-powered optimization creates a compounding advantage. Each test makes your model more accurate. A more accurate model leads to better budget decisions. Better budget decisions improve campaign performance. And improved campaign performance generates richer data for the next test cycle.

Putting It All Together: Your Incrementality Testing Checklist

Running incrementality testing alongside your attribution setup is one of the highest-leverage investments you can make in marketing efficiency. It moves you from guessing which channels work to knowing which ones create real, measurable lift.

Here is a quick checklist to keep you on track as you build this practice into your team's workflow:

1. Identify the channel or campaign you want to test based on spend volume and attribution uncertainty.

2. Design a clean holdout experiment with properly sized control and test groups, and document your hypothesis before launching.

3. Verify your tracking setup, implement server-side tracking, and establish baseline data for both groups before the test begins.

4. Run the test for the full duration without making mid-test changes, and monitor for external variables that could contaminate results.

5. Calculate incremental lift, assess statistical significance, and compare your results against what your attribution model credited to that channel.

6. Recalibrate your attribution model weights and reallocate budget based on incremental ROAS rather than attributed ROAS.

7. Repeat quarterly to keep your data and decisions sharp as market conditions evolve.

When your attribution data and incrementality results work together, you stop wasting budget on channels that just look good on paper and start investing in the ones that actually grow your business. That clarity is what separates teams that scale efficiently from those that keep optimizing toward the wrong signals.

Ready to build this kind of precision into your marketing measurement? Get your free demo of Cometly today and see how AI-driven attribution and conversion sync can help you capture every touchpoint, validate true impact, and make every dollar of ad spend work harder.

See Cometly in Action