Message A/B tests

Experiment with up to 26 message variations to determine audience engagement.

View as Markdown

About A/B tests for messages

Create variants of message content by duplicating the initial variant or by starting from scratch. Each variant returns analytic data to help you determine the most effective way to engage your audience. Message A/B tests can include a control group, and you can adjust audience allocation across the message variants and control group.

Set up the test in two steps:

  1. Create two or more message variants — Just like in the Message composer, for each variant, select channels, configure content for each channel, and set up delivery.

  2. Allocate an audience — You can designate all users as eligible for the test or target specific users. Of that group, set the percentage that will participate in the test. Audience members are randomly selected.

    You can also include a control group, which is the portion of your audience that doesn’t receive messages. It’s disabled by default, and you can enable it when setting the test audience. When enabled, the control group appears alongside variants in the By Channel view of your test results.

    The overall audience percentage is automatically divided evenly between variants and the control group, but you can set your own values.

After creating variants and setting the audience, you can start the test and review its results.

To have Airship optimize your experiment in real time and maximize conversions, use an Intelligent Rollout instead.

When running a message experiment and a Holdout ExperimentMeasures the effects of excluding a group of audience members from all messages or messages with specific Campaign Categories. You can compare the performance of the two audience groups in reports for selected goal events. simultaneously, Airship prevents holdout group users from being included in the message experiment. This eliminates potentially skewed data in cases where there are overlapping experimentation audiences. It also ensures that the most critical experiments maintain integrity.

To prepare for your tests, see About A/B testing.

Supported channels and measuring engagement

These channels and message types are supported for message A/B tests:

  • App — Push notifications, in-app messages, and Message Center
  • Web
  • Email
  • SMS
  • Open channel

Airship uses the following engagement signals to determine the top-performing variant:

  • Push — Direct clicks on a push message
  • Message Center — Message reads
  • Email — Clicks on any link in the email, excluding unsubscribe links
  • SMS — Clicks on a link in the message

Create a message A/B test

First, select the Create dropdown menu (), then A/B Test. Or you can start from your list of all message experiments by going to Experiments, then Message Experiments, selecting Add experiment, and then A/B Test.

Next, select the test name and change it to something descriptive, then select the check mark to save it.

To finish setting up your test, you must add message variants and determine the audience. You can configure them in any order.

Tip

You can also create a message experiment from the Message composer, with the message as the first variant. In the Review step, select Create Experiment, then A/B Test or Intelligent Rollout.

Add message variants

You can add up to 26 variants to an A/B test:

  1. Select Add variant. After completing a step, select the next step in the header to move on.

  2. For Channels:

    First, select a Channel CoordinationA group of strategies for targeting users where they are most likely to receive your message. strategy:

    • Fan Out targets a Named User on all the channels they are opted in to, maximizing the chances they receive your message.
    • Last Active targets a Named User on the opted-in channel they used most recently.
    • Priority Channel targets a Named User on the first channel they are opted in to, in the priority order you set.

    Then, enable the channel types to include in your audience. For Mobile Apps, also select from the available platforms. For Priority Channel, also drag the channel types into priority order.

    Note

    For projects using the channel-level segmentation system, instead of Channel Coordination, enable the channels you want to send the message to.

    Use Channel conditions to filter which channels are included in the audience. A channel must meet the conditions to remain in the audience.

    For example, if your audience includes users with app, email, and SMS channels, and you set a channel condition requiring membership in an email Subscription List:

    • Only email channels that meet that condition would remain in the audience.
    • All app and SMS channels would be excluded.

    To set channel conditions, use the same process as when building a SegmentA reusable audience group you create by selecting unique or shared user data.. You can use the following data in your conditions:

    Selected Lifecycle, Subscription, and Uploaded Lists must contain Channel IDs or Named Users as the identifier, not a mix of the two.

    Note

    Setting channel conditions is not supported for projects using the channel-level segmentation system.

    Under Localization, enable the option if you want to provide different content to app and web users depending on their language and country.

  3. For Content, configure the message content per enabled channel. See the Content documentation per message type, Content options, and Localization.

  4. For Delivery, configure the message delivery timing and options. See Message delivery.

  5. In the Review step, review the device preview and message summary:

    When your review is complete, select Save Variant.

To add another variant from scratch, select Add variant. To duplicate an existing variant, select the more menu icon () at the end of a row and select Copy to variant.

Set the test audience

After creating an A/B test, select Audience and then set up your test audience:

  1. Choose and configure users:
    OptionDescriptionSteps
    All UsersThis option makes the test available to a percentage of your total audience.n/a
    Target Specific UsersThis option makes the test available to a percentage of users who meet specified conditions.Select and configure one or more conditions. Use the same process as when building a SegmentA reusable audience group you create by selecting unique or shared user data..
  2. (Optional) Under Audience allocation, limit the selected audience to your specified percentage.
  3. (Optional) Enable Control group.
  4. (Optional) To override the default variant distribution, enable Allow uneven allocations and then edit the percentage for each variant and the control group.
    Note

    If you later add more variants, also update your variant allocation settings.

  5. Select Save.

Start an A/B test

Once you’ve created your message variants and set the audience for your test, select Start and confirm. Airship then sends each variant according to its delivery settings.

After the test starts, an Overview section shows the test status, audience summary, and the Probability of Being the Best Variant chart from the Performance view.

View results

After starting an A/B test, discover which variant performed best and use the test- and message-level reports to determine the quality of each variant and strategies for increasing engagement.

To access test results, go to Experiments, then Message Experiments, select the more menu icon () for a test in the list, then View results. You can also select the name of a test from the list and then go to Results.

See also Implementing A/B tests, outcomes, and compliance in About A/B testing.

Note

Engagement data is sent to Airship as soon as it becomes available. Data may be delayed due to connectivity issues with a user’s carrier, Wi-Fi, power, etc. Wait at least 12 to 24 hours before acting on the data to allow for potential lags.

Performance

The Performance view is the initial state of test results. Its first component is a summary explaining the winning variant or if the results were inconclusive.

Airship declares a winning variant only after all of the following are true:

  • All variants in the test have been sent.
  • At least 90 minutes have passed since the last variant send.
  • The test reached at least 1,000 participants.
  • The winning variant’s Probability to Be Best (PTBB) is 95% or higher.
  • The winning variant’s expected loss is less than 5%.

Following the summary, a table lists this data for each variant:

DataDescription
Probability to Be BestThis is the likelihood that the variant is the top-performing option. Values of 95% or above indicate high confidence.
Expected LossThis is the potential risk of choosing a variant that is not actually the best option. Values below 5% indicate minimal risk.
Conversion Rate (vs Top)This is the percentage of users who converted and how that rate compares to the top-performing variant.
Conversion CountThis is the total number of users who converted within the variant group.
AudienceThis is the number and percentage of contacts assigned to the variant.

Select a variant name to open its message report.

Below the table, the following charts visualize variant performance:

  • Probability of Being the Best Variant — This chart shows the probability that each variant is the top performer in your experiment. Taller bars indicate a higher likelihood of success, and a variant typically needs to reach 95% or higher to be declared the winner. If the bars are roughly the same height, the variants are performing similarly.

  • Conversion Lift Compared to Others — This chart shows the potential uplift of each variant compared to the top performer, helping you distinguish meaningful improvements from minor fluctuations.

    • 0% uplift line — This line represents no difference between variants. Curves to the right indicate improvement. Curves to the left indicate decline.
    • Distribution spread — A narrow, tall curve indicates high confidence, while a wider curve suggests more uncertainty.
    • Bulk mass — If the majority of a variant’s curve lies above zero, the variant is highly likely to outperform the others.
  • Likelihood of Variant Success — This chart shows the estimated conversion rate for each variant as a curve.

    • Height — The higher the peak, the more likely that conversion rate represents the variant’s true performance.
    • Width — A narrow curve indicates high confidence, while a wider curve suggests more uncertainty.
    • Overlap — Significant overlap means the variants are performing similarly. Minimal overlap indicates a clearer performance gap between them.

By channel

Select By Channel to see how variant performance differed across each channel in the test. Select a variant name to open its message report.

Export results

Export options for experiment results vary by view:

RTDS events

Messages used as variants in an A/B test include experiment information in Real-Time Data StreamingA service that delivers user-level events in real time to your backend or third-party systems using the Data Streaming API. events.

The Send event includes an experiments object with the test details, including experiment_id, type, and variant_id. The experiment_id also appears in the body object.

The Control event includes an experiment_id at the top level and also in the body object.

Managing A/B tests

Go to Experiments, then Message Experiments to view and manage your message A/B tests. You can filter the list by experiment type and archive status. Each experiment is listed by name with its status and the date it was last modified. Your last modified experiment is listed first, and you can search by experiment name.

You can perform the following actions from the list:

OptionDescriptionSteps
ViewOpen the test to access its message variants, audience configuration, and results.Select its name. Or select its more menu icon () and then Edit.
DuplicateMake a draft copy of a test with its message variants and audience configuration.Select a test’s more menu icon () and then Duplicate.
View resultsOpen the test’s performance reports.Select its more menu icon () and then View results.
ArchiveStops and archives the test. Archiving a started/active test also cancels its undelivered messages.Select a test’s more menu icon () and then Archive.

Test and variant statuses

In your list of all message experiments, A/B tests display the following statuses:

StatusDescription
DraftThe test has not been started and can still be edited.
StartedOne or more variants are in progress or scheduled. You can edit the test name, targeted audience, and messages that have not yet been sent.
Action RequiredThe test has been started, but at least one variant failed to send. Select Resume Experiment to retry failed variants.
CompletedAll variants have been sent.

Within a test, the Variants list displays the following statuses:

StatusDescription
DraftThe variant has not yet been saved.
ReadyThe variant meets all requirements for sending and has been saved.
ScheduledThe variant is queued to send according to its delivery settings.
ActiveThe recurring variant is sending according to its delivery settings.
SentThe variant was sent.
FailedThe variant failed to send. See the test status Action Required.

Editing message variants and audience

You can edit variants and audience settings for any test with Draft status. After opening a test from the Message Experiments list, select the more menu icon () for a variant and select an option:

OptionDescription
EditModify the variant’s channels, content, or delivery settings.
DuplicateCreate a copy of the variant as a starting point for a new variant.
DeleteRemove the variant from the test.

To modify the test audience, select Audience and adjust targeting or allocation settings. See Set the test audience for configuration details.

For tests with Started status, you can manage variants configured for recurring delivery from the more menu icon ():

OptionDescription
PauseTemporarily stop sending the variant. Select Resume to continue sending.
StopCancel future sends of the variant. You cannot resume a stopped variant.