Message A/B tests

Experiment with up to 26 message variations to determine audience engagement.

About A/B tests for messages

Create variants of message content by duplicating the initial variant or by starting from scratch. Each variant returns analytic data to help you determine the most effective way to engage your audience. Message A/B tests can include a control group, and you can adjust audience allocation across the message variants and control group.

Set up the test in two steps:

Create two or more message variants — Just like in the Message composer, for each variant, select channels, configure content for each channel, and set up delivery.
Allocate an audience — You can designate all users as eligible for the test or target specific users. Of that group, set the percentage that will participate in the test. Audience members are randomly selected.
You can also include a control group, which is the portion of your audience that doesn’t receive messages. It’s disabled by default, and you can enable it when setting the test audience. When enabled, the control group appears alongside variants in the By Channel view of your test results.
The overall audience percentage is automatically divided evenly between variants and the control group, but you can set your own values.

After creating variants and setting the audience, you can start the test and review its results.

To have Airship optimize your experiment in real time and maximize conversions, use an Intelligent Rollout instead.

When running a message experiment and a Holdout ExperimentMeasures the effects of excluding a group of audience members from all messages or messages with specific campaign categories. You can compare the performance of the two audience groups in reports for selected goal events. simultaneously, Airship prevents holdout group users from being included in the message experiment. This eliminates potentially skewed data in cases where there are overlapping experimentation audiences. It also ensures that the most critical experiments maintain integrity.

To prepare for your tests, see About A/B testing.

Supported channels and measuring engagement

These channels and message types are supported for message A/B tests:

App — Push notifications, in-app messages, and Message Center
Web
Email
SMS
Open channel

Airship uses the following engagement signals to determine the top-performing variant:

Push — Direct clicks on a push message
Message Center — Message reads
Email — Clicks on any link in the email, excluding unsubscribe links
SMS — Clicks on a link in the message

Create a message A/B test

First, select the Create dropdown menu (), then A/B Test. Or you can start from your list of all message experiments by going to Experiments, then Message Experiments, selecting Add experiment, and then A/B Test.

Next, select the test name and change it to something descriptive, then select the check mark to save it.

To finish setting up your test, you must add message variants and determine the audience. You can configure them in any order.

Tip

You can also create a message experiment from the Message composer, with the message as the first variant. In the Review step, select Create Experiment, then A/B Test or Intelligent Rollout.

Add message variants

You can add up to 26 variants to an A/B test:

Select Add variant. After completing a step, select the next step in the header to move on.
For Channels:
First, select a Channel CoordinationA group of strategies for targeting users where they are most likely to receive your message. strategy:
- Fan Out targets a named user on all the channels they are opted in to, maximizing the chances they receive your message.
- Last Active targets a named user on the opted-in channel they used most recently.
- Priority Channel targets a named user on the first channel they are opted in to, in the priority order you set.
Then, enable the channel types to include in your audience. For Mobile Apps, also select from the available platforms. For Priority Channel, also drag the channel types into priority order.
Note
For projects using the channel-level segmentation system, instead of Channel Coordination, enable the channels you want to send the message to.
Use Channel conditions to filter which channels are included in the audience. A channel must meet the conditions to remain in the audience.
For example, if your audience includes users with app, email, and SMS channels, and you set a channel condition requiring membership in an email subscription list:
- Only email channels that meet that condition would remain in the audience.
- All app and SMS channels would be excluded.
To set channel conditions, use the same process as when building a segmentA reusable audience group you create by selecting unique or shared user data.. You can use the following data in your conditions:
- AutogroupA default tag group. When devices are registered in Airship, they are automatically assigned a tag within the autogroup tag group. The tag is a numeric value of 1 to 100. Some uses for autogroup tags are creating a control group, and to randomly segment your audience. Autogroup must be enabled for your account.
- ChannelA device or address registered with Airship to receive messages, such as a mobile app, web browser, email address, or SMS number. It stores opt-in status, device-specific information, and metadata used for targeting. Each channel has a unique channel ID. ID
- Device propertiesInformation about a channel, such as language and time zone settings, OS and browser versions, and notification opt-in status. Device properties apply to a channel only, not a named user. Values come from the user’s device when an SDK is present, or are set via the API for channels without one. Each property is automatically made available as an attribute, a tag, or both for audience segmentation.
- EventsA record of an action in your app, on your website, in the Airship system, or in an external system. Examples are a message send, an app open, or a purchase transaction.
- Lifecycle listsAn audience list Airship automatically populates based on user activity in your app, such as app opens, uninstalls, notifications received, and dormancy.
- Predicted to Churn statusAnalyzes your audience for users that exhibit behaviors indicating they are likely to become inactive, and tags the users as High, Medium, or Low Risk.
- Subscription listsAn audience list users opt in to for a specific messaging topic. Users can manage their opt-in status using a Preference Center.
- TagsMetadata that you can associate with a channel or named user for audience segmentation. Generally, a tag is a descriptive term indicating a user preference or other categorization, such as wine_enthusiast or weather_alerts_los_angeles. in the device tag groupAn array of tags that you can associate with both channels and named users.
- Uploaded listsAn audience list you populate by uploading a CSV of channel or named user IDs. In the API, uploaded lists are called static lists.
Selected lifecycle, subscription, and uploaded lists must contain channel IDs or named users as the identifier, not a mix of the two.
Note
Setting channel conditions is not supported for projects using the channel-level segmentation system.
Under Localization, enable the option if you want to provide different content to app and web users depending on their language and country.
For Content, configure the message content per enabled channel. See the Content documentation per message type, Content options, and Localization.
For Delivery, configure the message delivery timing and options. See Message delivery.
In the Review step, review the device preview and message summary:
- Use the arrows to page through the various previews. The channel and display type dynamically update in the dropdown menu above. You can also select a preview directly from the menu.
- If you want to make changes, select the associated step in the header, make your changes, then return to Review.
- Select Send Test to send a test message to verify its appearance and behavior on each configured channel. The message is sent to your selected recipients immediately, and it appears as a test in Messages Overview. Follow the same steps as in the Review step for the Message composer.
When your review is complete, select Save Variant.

To add another variant from scratch, select Add variant. To duplicate an existing variant, select the more menu icon () at the end of a row and select Copy to variant.

Set the test audience

After creating an A/B test, select Audience and then set up your test audience:

Choose and configure users:

Option	Description	Steps
All Users	This option makes the test available to a percentage of your total audience.	n/a
Target Specific Users	This option makes the test available to a percentage of users who meet specified conditions.	Select and configure one or more conditions. Use the same process as when building a segmentA reusable audience group you create by selecting unique or shared user data..

(Optional) Under Audience allocation, limit the selected audience to your specified percentage.
(Optional) Enable Control group.
(Optional) To override the default variant distribution, enable Allow uneven allocations and then edit the percentage for each variant and the control group.
Note
If you later add more variants, also update your variant allocation settings.
Select Save.

Start an A/B test

Once you’ve created your message variants and set the audience for your test, select Start and confirm. Airship then sends each variant according to its delivery settings.

After the test starts, an Overview section shows the test status, audience summary, and the Probability of Being the Best Variant chart from the Performance view.

View results

After starting an A/B test, discover which variant performed best and use the test- and message-level reports to determine the quality of each variant and strategies for increasing engagement.

To access test results, go to Experiments, then Message Experiments, select the more menu icon () for a test in the list, then View results. You can also select the name of a test from the list and then go to Results.

See also Implementing A/B tests, outcomes, and compliance in About A/B testing.

Note

Engagement data is sent to Airship as soon as it becomes available. Data may be delayed due to connectivity issues with a user’s carrier, Wi-Fi, power, etc. Wait at least 12 to 24 hours before acting on the data to allow for potential lags.

Performance

The Performance view is the initial state of test results. Its first component is a summary explaining the winning variant or if the results were inconclusive.

Airship declares a winning variant only after all of the following are true:

All variants in the test have been sent.
The test reached at least 1,000 participants.
The winning variant’s Probability to Be Best (PTBB) is 95% or higher.
The winning variant’s expected loss is less than 5%.

Following the summary, a table lists this data for each variant:

Data	Description
Probability to Be Best	This is the likelihood that the variant is the top-performing option. Values of 95% or above indicate high confidence.
Expected Loss	This is the potential risk of choosing a variant that is not actually the best option. Values below 5% indicate minimal risk.
Conversion Rate (vs Top)	This is the percentage of users who converted and how that rate compares to the top-performing variant.
Conversion Count	This is the total number of users who converted within the variant group.
Audience	This is the number and percentage of contacts assigned to the variant.

Select a variant name to open its message report.

Below the table, the following charts visualize variant performance:

Probability of Being the Best Variant — This chart shows the probability that each variant is the top performer in your experiment. Taller bars indicate a higher likelihood of success, and a variant typically needs to reach 95% or higher to be declared the winner. If the bars are roughly the same height, the variants are performing similarly.
Conversion Lift Compared to Others — This chart shows the potential uplift of each variant compared to the top performer, helping you distinguish meaningful improvements from minor fluctuations.
- 0% uplift line — This line represents no difference between variants. Curves to the right indicate improvement. Curves to the left indicate decline.
- Distribution spread — A narrow, tall curve indicates high confidence, while a wider curve suggests more uncertainty.
- Bulk mass — If the majority of a variant’s curve lies above zero, the variant is highly likely to outperform the others.
Likelihood of Variant Success — This chart shows the estimated conversion rate for each variant as a curve.
- Height — The higher the peak, the more likely that conversion rate represents the variant’s true performance.
- Width — A narrow curve indicates high confidence, while a wider curve suggests more uncertainty.
- Overlap — Significant overlap means the variants are performing similarly. Minimal overlap indicates a clearer performance gap between them.

By channel

Select By Channel to see how variant performance differed across each channel in the test. Select a variant name to open its message report.

Export results

Export options for experiment results vary by view:

In the Performance view, select Download.
In the By Channel view, select Download Results, then Performance Data. If your experiment included custom eventsAn event you define to track an action in your app, on your website, or in an external system. Examples are a sign-up, a video watch, or a wish list addition., you will also have the Variant Event Data option, which is a report of event conversions and associated values, broken out by variant.

RTDS events

Messages used as variants in an A/B test include experiment information in Real-Time Data StreamingA service that delivers user-level events in real time to your backend or third-party systems using the Data Streaming API. events.

The Send event includes an experiments object with the test details, including experiment_id, type, and variant_id. The experiment_id also appears in the body object.

The Control event includes an experiment_id at the top level and also in the body object.

Managing A/B tests

Go to Experiments, then Message Experiments to view and manage your message A/B tests. You can filter the list by experiment type and archive status. Each experiment is listed by name with its status and the date it was last modified. Your last modified experiment is listed first, and you can search by experiment name.

You can perform the following actions from the list:

Option	Description	Steps
View	Open the test to access its message variants, audience configuration, and results.	Select its name. Or select its more menu icon () and then Edit.
Duplicate	Make a draft copy of a test with its message variants and audience configuration.	Select a test’s more menu icon () and then Duplicate.
View results	Open the test’s performance reports.	Select its more menu icon () and then View results.
Archive	Stops and archives the test. Archiving a started/active test also cancels its undelivered messages.	Select a test’s more menu icon () and then Archive.

Test and variant statuses

In your list of all message experiments, A/B tests display the following statuses:

Status	Description
Draft	The test has not been started and can still be edited.
Started	One or more variants are in progress or scheduled. You can edit the test name, targeted audience, and messages that have not yet been sent.
Action Required	The test has been started, but at least one variant failed to send. Select Resume Experiment to retry failed variants.
Completed	All variants have been sent.

Within a test, the Variants list displays the following statuses:

Status	Description
Draft	The variant has not yet been saved.
Ready	The variant meets all requirements for sending and has been saved.
Scheduled	The variant is queued to send according to its delivery settings.
Active	The recurring variant is sending according to its delivery settings.
Sent	The variant was sent.
Failed	The variant failed to send. See the test status Action Required.

Editing message variants and audience

You can edit variants and audience settings for any test with Draft status. After opening a test from the Message Experiments list, select the more menu icon () for a variant and select an option:

Option	Description
Edit	Modify the variant’s channels, content, or delivery settings.
Duplicate	Create a copy of the variant as a starting point for a new variant.
Delete	Remove the variant from the test.

To modify the test audience, select Audience and adjust targeting or allocation settings. See Set the test audience for configuration details.

For tests with Started status, you can manage variants configured for recurring delivery from the more menu icon ():

Option	Description
Pause	Temporarily stop sending the variant. Select Resume to continue sending.
Stop	Cancel future sends of the variant. You cannot resume a stopped variant.