How is your website ranking on ChatGPT?
Copilot Ads Open Beta: Your 30-60-90 Experiment Plan
Microsoft opened Copilot Ads to self-serve. Here is a 30–60–90 day plan to capture high-intent conversational demand across Copilot, Bing, and Edge before Q4 pricing normalizes.

Vicky
Sep 16, 2025
Why Copilot Ads matter right now
Microsoft opened the Copilot Ads open beta on September 9, putting sponsored messages directly inside Copilot chats and Bing conversational results across Copilot, Bing, and Edge. On September 10, Microsoft detailed conversational formats with clear Sponsored labels and new reporting fields for chat engagements. Industry coverage confirmed that targeting can align to query intent and that downstream site actions can be attributed through UET.
This creates a new intent surface where buyers ask questions, compare options, and plan purchases in real time. Early CPMs and CPCs are still inefficient. Moving first lets you learn faster, set your benchmarks, and secure volume before Q4 demand tightens.
I am a performance operator. I plan this like a marathon negative split. Open relaxed, lock in form, then finish strong when the field fades. Below is a practical 30–60–90 plan you can execute in-house this week.
What is new in Copilot Ads
- Placements: Sponsored messages inside Copilot chat flows, Bing conversational panels, and Edge-integrated Copilot surfaces
- Labels: Clear Sponsored disclosure to maintain user trust
- Targeting: Query intent and related conversational themes, with controls to exclude poor-fit intents
- Measurement: New chat engagement reporting fields plus website action attribution through UET
- Creative: Conversational units optimized for direct answers, quick actions, and mid-funnel capture
If you sell B2B SaaS or ecommerce, this is a qualified audience leaning forward with problem-language, not just keywords. Your job is to match job-to-be-done intent and make the next step effortless.
48-hour launch checklist
Get live in two days. Learn while others are still debating.
- Foundations
- Confirm Microsoft Advertising account access, billing, and user permissions
- Audit UET deployment across your core pages and events
- Define conversion goals: lead form submit, trial start, demo request, add to cart, purchase
- Add enhanced event parameters: product category, plan tier, ACV band, or cart value
- Tracking hygiene
- Ensure msclkid is captured and persists through redirects
- Standardize UTMs for conversational placements
- utm_source=microsoft
- utm_medium=conversational
- utm_campaign=copilot_<intent_cluster>
- utm_content=<creative_variant>
- Campaign scaffolding
- Create a dedicated account folder for Copilot Ads
- Structure by intent clusters, not product names
- Comparison and alternatives
- Problem and solution exploration
- Implementation and pricing readiness
- Set daily budgets small but diversified across 3 to 5 clusters
- Targeting and exclusions
- Start with high-signal intents you already win in search or sales calls
- Add negative intents that skew to DIY, students, or support-only
- Exclude branded competitor terms if your legal policy requires it
- Creative v1
- Write 3 conversational ad concepts per cluster: one value-first, one outcomes-first, one proof-first
- Add clear CTAs that fit chat behavior: Compare plans, Get a tailored estimate, See ROI in 60 seconds
- Prepare 2 chat-friendly landing pages per cluster with scannable answers and fast paths to action
- Bidding and pacing
- Start with conservative bids until you see chat engagement rate and qualified click rate
- Cap frequency with tight budgets for 72 hours to avoid noisy early data
- QA and go-live
- Policy review for claims, pricing, and regulated language
- Pixel fire test for all primary conversions
- Launch in limited geos and languages where sales coverage is strong
The 30–60–90 day experimentation plan
Days 0–30: Nail the fundamentals and confirm signal
Objective: Prove that conversational placements deliver qualified traffic and early conversions without cannibalizing branded search.
- Intent cluster design
- Choose 5 to 8 clusters that map to your pipeline, for example:
- B2B SaaS: "[Problem] automation", "alternatives to [competitor]", "pricing model for [category]", "implementation timeline [category]"
- Ecommerce: "best [category] for [use case]", "compare [material] vs [material]", "gift ideas under [$]"
- Choose 5 to 8 clusters that map to your pipeline, for example:
- Creative testing
- Test three frameworks per cluster:
- Value-first: "Cut invoice errors by 40 percent in 30 days"
- Outcomes-first: "From spreadsheet chaos to clean ledger in one week"
- Proof-first: "Used by 1,200 finance teams in mid-market SaaS"
- Use one CTA per ad to reduce choice friction
- Test three frameworks per cluster:
- Landing page alignment
- Build concise sections that mirror how a chat answers questions:
- What it does, who it is for, proof, price transparency, next step
- Add quick actions above the fold: Calculate ROI, See plan fit, Request pricing
- Build concise sections that mirror how a chat answers questions:
- Measurement plan
- Define micro-conversions that reflect chat momentum: CTA button clicks, calculator starts, comparison module expands
- Track macro-conversions: demo request, trial, checkout
- Instrument cohorts for new vs returning and branded vs non-branded sessions
- Cannibalization guardrails
- Create a branded search holdout: 10 to 20 percent of traffic where Copilot Ads are paused for brand queries
- Compare conversion rate, CAC, and assisted conversions vs holdout
- Budget discipline
- Keep cluster-level daily budgets small until you see cost per qualified click (CPQC) stabilize
- Cut variants with bounce rate above 70 percent or time on site under 15 seconds
Marathon note: first 10 kilometers set your rhythm. Do not chase every surge. Let the surface come to you and hold form.
Days 31–60: Scale winners and isolate incrementality
Objective: Expand reach across Copilot, Bing conversational, and Edge while proving net-new demand.
- Expand targeting
- Add adjacent intents with clear buyer fit
- Layer audience lists where allowed: in-market segments, remarketing windows that exclude brand navigational queries
- Bidding evolution
- Shift winning clusters to automated bidding toward target CPA or target ROAS once you have stable conversions
- Keep exploratory clusters on manual or more conservative targets
- Creative refinement
- Introduce multi-turn framing in copy where supported: "Ask me for a tailored plan" and then "Share team size to see the right tier"
- Add social proof anchored to segment: "Rated #1 by finance teams in SaaS with 50 to 500 FTEs"
- Landing page experiments
- Test a chat-aligned hero: a three-field selector that mirrors the conversational prompt
- Add transparent price anchors and an honest comparison table versus status quo and key competitors
- Incrementality testing
- Geo-split test: enable Copilot Ads in matched DMAs and hold out in others for 2 to 4 weeks
- Measure lift in non-branded search volume, direct traffic, and total conversions per DMA
- For ecommerce, run catalog subsets where Copilot Ads promote only a slice of SKUs and compare blended margin
- Budget reallocation
- If CPQC and conversion rate beat Responsive Search Ads or Dynamic Search Ads at similar funnel stage, move 10 to 20 percent of mid-funnel budget to conversational placements
- Operations
- Create a weekly creative council with performance, brand, and sales to review winning messages and customer objections
Tennis note: good results come from early footwork. Read the intent, get balanced, then drive through the ball. Here that means anticipating the next question and answering it before the user asks.
Days 61–90: Consolidate advantage before Q4 pricing normalizes
Objective: Lock in scale, remove waste, and package learnings for executive decisions.
- Portfolio pruning
- Kill clusters with poor second-click depth or lagging assisted conversion value
- Collapse redundant ad groups and concentrate budget into top 5 performers
- Seasonal playbooks
- Build three ready-to-deploy variants for Q4: urgency, value stack, and bundle offers
- Prebuild inventory and promo logic so updates can go live in hours
- Automation and QA
- Set alerts for chat engagement rate drop, sudden CPC spikes, and conversion lag changes
- Use scripts or workflow tools to pause creatives that fall below quality thresholds
- Safety and policy
- Review claims with legal and refresh disclaimers in all creatives and landing pages
- Ensure UET and consent flows are still compliant after recent site changes
- Executive reporting
- Present a one-page rollup: spend, CPQC, conversion rate, CAC, blended ROAS, incremental lift vs control
- Spell out the decision rule for scaling spend into November and December
Creative playbook for conversational ads
The bar is clarity. The unit sits inside a chat that already has context. Avoid long lists and brand fluff.
Principles
- Lead with the outcome, not the feature
- Mirror the user’s phrasing in the first sentence
- Offer one obvious next action
- Reduce cognitive load with specifics
- Make proof tangible and time-bound
B2B SaaS examples
- Problem-intent: "Close books 2x faster without adding headcount. See a 4-step plan for a 30-day close."
- CTA: Get my 30-day plan
- Comparison-intent: "Considering [Competitor]? See a side-by-side on pricing, workflow, and controls in under 60 seconds."
- CTA: Compare now
- Implementation-intent: "Go live in 14 days with our onboarding pod. SOC 2, SSO, and ERP sync included."
- CTA: See timeline
Ecommerce examples
- Use-case intent: "Best trail runners for wet terrain under $120. Personalized picks based on gait and width."
- CTA: Get my picks
- Alternatives intent: "Merino vs synthetic for winter runs. See warmth, dry time, and care in one chart."
- CTA: See the chart
- Deal intent: "New-patient whitening kits. Dentist-approved, ships today, 20 percent off with autoship."
- CTA: Claim offer
Avoid
- Vague CTAs like Learn more
- Unverifiable claims
- Multi-CTA clutter
- Academic or jargon-heavy language that kills momentum
Copy hygiene checklist
- First 90 characters answer a core question
- One metric or proof point
- One CTA
- No dead-end statements
- Reading grade level 7 to 9
Measurement: connect chat metrics to funnel outcomes
You will see new chat-specific metrics in Microsoft reporting. Do not stop at engagements. Tie them to business outcomes.
Key metrics
- Chat impression and engagement rate
- Qualified click rate: clicks that reach a key content section or trigger a call to action in 15 seconds
- Micro-conversions: calculator starts, comparison expands, email capture
- Macro-conversions: trial, demo, cart, checkout
- Assisted conversions within 7 and 28 days
- Incremental lift vs control geos or audiences
UET and analytics setup
- Fire distinct events for micro-conversions to understand upper-mid funnel value
- Pass content metadata in event parameters: intent_cluster, product_line, segment
- Ensure msclkid persists into your CRM or attribution tool for lead-to-revenue matchback
- Build a dedicated dashboard that splits performance by placement: Copilot chat, Bing conversational panel, Edge Copilot
Attribution logic
- Short term: last non-direct click for speed and clarity
- Medium term: position-based or data-driven models to capture assist value
- Proof of incrementality: geo holdouts, audience holdouts, or rotation calendars where you alternate Copilot Ads on and off every other week in matched markets
Quality guardrails
- Track the ratio of chat engagements to qualified clicks. If it drops, your copy is interesting but not useful
- Watch branded search clicks for cannibalization. If brand CPCs decline while total conversions rise, consider it healthy overlap
Budget strategy: when to shift from RSA and DSA
Your goal is mid-funnel capture that supplements search, not a wholesale replacement.
Decision triggers
- Shift 10 to 20 percent of RSA or DSA budget to Copilot placements when:
- CPQC of Copilot is within 15 percent of RSA in the same intent cluster
- Conversion rate of Copilot traffic is at least 70 percent of RSA traffic
- Incremental lift test shows net-new conversions above 5 percent in test geos
- Hold budget if:
- Copilot traffic has high bounce and low micro-conversions
- Cannibalization erodes branded efficiency without total lift
Allocation pattern
- 0–30 days: 5 to 10 percent experimental budget
- 31–60 days: 10 to 25 percent for winning clusters
- 61–90 days: 20 to 35 percent if lift is proven and inventory is stable
Ops, safety, and governance
- Policy: keep clear pricing qualifiers and avoid absolute best claims
- Brand safety: exclude intents that attract DIY or student traffic if you sell enterprise
- Privacy: ensure consent banner covers performance advertising and analytics
- QA: weekly audit ad copy and LPs for accuracy after any product update
How to avoid low-quality engagements
- Pre-qualify in the copy: "For teams of 10 to 500" or "For regulated industries"
- Set expectation on next step: "See pricing tiers and implementation timeline"
- Use negative intents and exclusions early
- Prefer a single deep CTA over a general homepage click
- If allowed, show starting price to filter out poor fit
How Upcite.ai helps you win on conversational surfaces
Upcite.ai helps you understand how ChatGPT and other AI models are viewing your products and applications and makes sure you appear in answers to prompts like "Best products for…" or "Top applications for…". Before you spend heavily on Copilot Ads, I use Upcite.ai to:
- Audit how Copilot and other models already describe your brand and category
- Identify gaps where competitors are favored for key intents
- Turn those insights into ad messaging that mirrors model language buyers see
- Prioritize landing page fixes that address the exact objections the models surface
When your organic assistant presence and your paid conversational ads tell the same story, conversion lifts and CPC volatility settles.
Example test matrix you can copy
Start with this simple grid and evolve from there.
- Intents
- Alternatives to [Competitor]
- Best [category] for [use case]
- Pricing model [category]
- Implementation timeline [category]
- Creative variants per intent
- Value-first, outcomes-first, proof-first
- Landing pages
- Answer-first page, comparison-heavy page
- Metrics to judge
- CPQC, conversion rate, assisted conversions, incremental lift
- Decision rules
- Promote variants that beat control by 15 percent on CPQC or 10 percent on conversion rate
Common pitfalls and how to fix them
- Pitfall: High chat engagement but low quality traffic
- Fix: Tighten copy to qualify, add starting price, and reduce novelty language
- Pitfall: Cannibalization of branded search without total lift
- Fix: Add brand holdouts and exclude navigational intents
- Pitfall: Landing pages not matching conversational context
- Fix: Rewrite hero and first section to answer the exact question asked
- Pitfall: Slow page speed kills momentum
- Fix: Prioritize Core Web Vitals and remove heavy hero media on conversational LPs
- Pitfall: Misaligned incentives across teams
- Fix: Set shared targets on CPQC and pipeline, not just clicks or CTR
Final mile: what good looks like by day 90
- 5 to 8 intent clusters live with clear winners
- CPQC within 10 to 20 percent of RSA benchmarks for those clusters
- Conversion rate 70 to 100 percent of RSA for non-branded intents
- Documented incremental lift of 5 to 12 percent in matched geo tests
- Seasonal creative and LPs prebuilt to deploy in hours, not days
Next steps
- Stand up your 48-hour launch with three intent clusters, 9 creatives, and 2 landing page variants
- Run the 30–60–90 plan as written and document every decision rule
- Use Upcite.ai to audit your assistant presence, align messaging to model language, and prioritize LP fixes
- Book a weekly 30-minute review cadence and assign one owner for creative, one for measurement, and one for budget decisions
If you want an outside push, I will run a 2-hour Copilot Ads working session with your team to finalize your test matrix, quality guardrails, and measurement plan. Then we sprint. The window before Q4 pricing tightens is short. Set your pace, move early, and finish strong.