top of page

How to Steal the Product Copy Formula of Winning Brands: A Simple 4-Step Data-Driven Framework

  • Writer: tony56024
    tony56024
  • 1 minute ago
  • 11 min read

Do you know what actually differentiates great product copy from the good?


Great product copy is rarely the result of creative inspiration. It's the outcome of a data-driven process.


Good copy is often just creative writing. Bad copy is what happens when someone simply asks ChatGPT to generate it.


Product copy formula: Stop treating product copy like an art project


In this blog, we break down the exact 4-step process you can use to write great product content by reverse-engineering the content generation process of category leaders.


Let's say you want to launch a new product in a category where companies like Olay and Cetaphil are dominating. What is the best way to write product content?


The brands that dominate their category didn't get there by chance. Their listings are the result of:



When a brand like Olay sells millions of products across marketplaces, every part of their product pages has been refined over time: titles, bullet points, images, ingredient claims, and product positioning were all battle-tested.


When you study an Olay listing, you aren't just looking at a description — you are looking at a compressed history of optimisation. Every ingredient claim and benefit earned its spot through testing. By reverse-engineering these category leaders, you can bypass years of trial and error and start with a proven formula.


The 4-Step Framework to write great product copy that sells


The most important thing is to understand the patterns that category champions use. This gives you a clear picture of what works and what doesn't. The best way to find those patterns is to analyze the data of the leaders. That's exactly what this framework helps you do. Let's unlock the best way to find your best product copy formula.


Step 1 - Data Collection


The first step is to collect product listings from the category you want to compete in.

For this demonstration, we are using 44 Olay product listings from the Body Wash category on Amazon.


The data was extracted using the Datahut web scraping platform. We are demonstrating how to write the product title, but you can follow the same process for bullet points, descriptions, and more.


To make this easier to understand, imagine you are launching a new body wash product on Amazon and want to learn how leading brands structure their product titles.

Here is an example of a typical title from the dataset:


Olay Body Wash for Women, Fresh Radiance, 24/7 Skin-Loving Freshness, Visibly Radiant, Plant Based Cleansers, Vitamin B3 & Antioxidant Blend, For All Skin Types, Strawberry & Mint Scent, 29 fl oz


To an amateur, this looks like a long, cluttered sentence. To a data scientist, it is a highly structured sequence of high-value variables:


[Brand] + [Product Type] + [Target User] + [Benefit Claims] + [Key Ingredients] + [Skin Type] + [Fragrance] + [Size]


Once you collect titles across a category, you can start identifying patterns that successful brands consistently use.


You can collect this data yourself using web scraping, or access ready-to-use datasets through Datahut.


Step 2: From Messy Text to Analytics Ready Data


When you scrape product listings, you end up with messy, unstructured text. To a human, the structure is obvious but to a computer, it's just a string of words. Annotation converts this into structured product intelligence.


For example, this title:


Olay Face Moisturizer, Regenerist Micro-Sculpting Cream for Women, Fragrance-Free – Anti-Aging – Peptide – 1.7oz

Looks like one long sentence. But an annotation tool turns it into structured information like this:


Part of Title

Entity

Brand

Olay

Product Category

Face Moisturiser

Product Line

Regenerist

Target User

Women

Attributes

Fragrance-Free

Benefits

Anti-Aging

Ingredients

Peptide

Size

1.7oz


This process is called data annotation. Annotated data looks like this


Product Copy Formula of Winning Brands

Each title is broken down into attributes such as Brand, Category, Variant, Target User, Key Attribute, Benefit, Key Ingredient, Size/Weight, Scent, Skin Type, Technology, Bundle, and Free-of claims.


By labeling titles across these dimensions, we convert unstructured product text into structured data. This makes it possible to analyse patterns in how leading brands construct their listings what benefits they emphasise, which ingredients they highlight, and how they position variants and claims.


This structured annotation allows us to reverse-engineer the formula behind effective product titles and identify the signals that marketplaces and shoppers respond to.

You can use any annotation tool available.


We also built a lightweight browser-based annotation tool that works with any LLM API to speed up the process. You can download it and run it directly in your browser. You can get it here: Get the annotation tool


You can also see the demo video of annotation here : Demo Video


Data annotation tool for better product copy
Demo of Data Annotation

After annotation the data looks like this


[
  {
    "text": "Olay Super Serum Body Wash for Extra Dry Skin, 24hr Long Lasting Hydration, 5+ Ingredient Complex for Bright Even Firm Luminous Skin, 18.5 fl oz",
    "annotations": [
      {
        "text": "Olay",
        "label": "Brand"
      },
      {
        "text": "Body Wash",
        "label": "Category"
      },
      {
        "text": "Extra Dry Skin",
        "label": "Skin Type"
      },
      {
        "text": "Hydration",
        "label": "Benefit"
      },
      {
        "text": "Bright",
        "label": "Benefit"
      },
      {
        "text": "Even",
        "label": "Benefit"
      },
      {
        "text": "Firm",
        "label": "Benefit"
      },
      {
        "text": "Luminous Skin",
        "label": "Benefit"
      },
      {
        "text": "Super Serum",
        "label": "Variant"
      },
      {
        "text": "18.5 fl oz",
        "label": "Variant"
      },
      {
        "text": "Serum",
        "label": "Key Ingredient"
      }
    ]
  },

Step 3: Finding the Winning Formula


After you finish annotating all the titles, export this entire data as a json file. You can now convert this to a csv file with the labels as headers. After annotating all 44 listings, the patterns become undeniable. Olay titles aren't just descriptions; they are modular stacks of information.


The anatomy of Olay product titles and the patterns you can clearly see.


The annotated Olay product titles show a very consistent structure.


  • Brand, Category, Benefit, and Size/Weight appear in all 44 listings, making them the core components of every title.

  • Skin Type, Key Ingredient, and Target User appear in most listings (42), indicating that Olay frequently highlights who the product is for and what key ingredients drive the benefit.

  • Scent appears in about three-quarters of the titles (32) as an additional descriptive element, while attributes like


Overall, the data suggests that Olay titles prioritize clear product identification, benefit communication, and ingredient signalling. Additional attributes used selectively to enhance differentiation.


From the position analysis of the annotated fields, the typical title structure looks like this:


Brand + Category + Target User + Variant + Benefit + Key Ingredient + Skin Type + Scent + Technology + Free Of + Size/Weight + Bundle


Step 4: Ingredient Signalling (The SEO Secret)


Analysis of the annotated data reveals which "Power Words" Olay uses to trigger customer trust and search relevance.


In this category, Vitamin B3 / Niacinamide was mentioned 41 times across 44 titles. This tells a new competitor two things:


  1. Market Expectation: If you don't mention Niacinamide or a similar "Serum Complex," you are invisible to the category's top shoppers.

  2. SEO Priority: These ingredients are likely the highest-volume search terms for the category.


The Power of High-Frequency Signaling


By counting ingredient mentions in annotated titles, we can see the signals brands believe resonate most with shoppers. Beyond the dominance of Vitamin B3, the data shows a clear hierarchy of secondary ingredients:


  • Serum Complex (18 mentions): This suggests Olay is moving the category toward a "skincare-as-bodycare" trend.

  • Plant Based Cleansers (16 mentions): This indicates a strategic pivot toward "clean beauty" claims to capture health-conscious segments.

  • Hyaluronic Acid (15 mentions): Highlighting a well-known "hero ingredient" provides an instant mental shortcut for "hydration" to the consumer.

  • Antioxidant Blend (11 mentions) and Vitamin C (8 mentions): These are used regularly to reinforce specific skin-brightening or anti-aging benefits.


Product Copy Formula of Winning Brands

Why Ingredient Data Matters


This type of analysis allows you to understand ingredient trends within the category before you even write your first draft. Instead of choosing ingredients based on what sounds "nice," you choose them based on proven search volume and consumer trust signals established by the market leaders.


When you align your product's key ingredients with these "Top Ingredients in Titles," you aren't just describing a product—you are aligning your listing with the established mental models of your target audience.


Is it just for the titles ?


Not at all. The same reverse-engineering process applies to every element of a product listing — and each one is worth studying carefully.


Bullet Points 


Bullet points are where brands do their heaviest persuasion work. By annotating bullet points across category leaders, you can identify which benefit claims appear most frequently, how brands sequence their messaging (does hydration come before brightening, or after?), and what proof points — dermatologist-tested, clinically proven, allergy-tested — the top sellers consistently include. These aren't random choices. They reflect what shoppers actually respond to.


Product Descriptions 


Descriptions give brands more room to tell a story, but the best ones still follow a recognizable structure. Analyzing descriptions across a category reveals how leaders handle objections, which emotional triggers they lean on, and how they transition from ingredient claims to lifestyle outcomes. This is where you find the deeper narrative formula that drives conversions.


Image


Alt Text and Backend Keywords The same ingredient and benefit signals that dominate titles often show up in backend search terms and image metadata. Studying what category leaders emphasize visually — and how they describe those images — gives you a fuller picture of their SEO strategy beyond just the visible copy.


A+ Content and Brand Stories 


For brands that invest in enhanced content, the patterns are even more revealing. Layout choices, headline structures, comparison charts, and lifestyle imagery all follow repeatable formulas that can be decoded and adapted.


Titles are just the easiest place to start because the patterns are most visible there. Once you've built the habit of reverse engineering category leaders at the title level, applying the same logic to the rest of the listing becomes natural. The compounding effect on your content quality will be significant.


One important caveat

Reverse-engineering category leaders gives you a strong starting point, not a final answer. What worked for Olay an established brand with years of trust built in and may not land identically for a new entrant with no brand recognition yet.


Treat this analysis as your first draft hypothesis. Use it to build listings that are informed by evidence, then run your own A/B tests to validate what actually converts for your specific product and audience. The framework removes the blank page problem; your own testing is what refines the result.


Closing Thoughts


When you look at a product listing on Amazon, it might appear to be just a piece of marketing copy. But behind every high performing listing is a long history of experimentation, optimisation, and data-driven decisions.


The brands dominating a category rarely guess their way to the perfect title. Over time, they test different ingredients, benefits, claims, and positioning until they discover what resonates with both marketplace algorithms and customers.


By collecting category data, annotating it, and analysing the patterns, you can uncover the same signals that experienced brands and consulting teams rely on. Instead of starting with a blank page, you start with evidence of what already works in the market.

In other words, writing product content is not about creativity alone. It’s about understanding the structure behind successful listings and using data to guide your decisions.


And once you approach product content this way, every category becomes something you can study, decode, and systematically improve.


How We Can Help You Write Better Product Copy


Writing great product copy starts with great data — and that's exactly where Datahut comes in.


1. Get the Data You Need, Fast


Manually collecting product listings from Amazon, Walmart, or any other marketplace is slow, inconsistent, and prone to gaps. Datahut's web scraping platform extracts clean, structured product data at scale — titles, bullet points, descriptions, ingredient claims, pricing, and more — so you can start your analysis immediately instead of spending weeks gathering raw inputs.


2. Learn From What's Already Working


We've worked across dozens of product categories and marketplaces, and we've seen firsthand what separates listings that convert from those that don't. Our team can consult with you on the best practices we've observed across categories — from title structure and ingredient signalling to benefit hierarchy and SEO keyword positioning. You get the benefit of patterns we've identified across thousands of listings, without having to build that knowledge from scratch.


If you're launching a new product or optimising an existing catalog, we can help you go from a blank page to a data-backed content strategy — faster than you'd expect.



Frequently Asked Questions


Frequently Asked Questions


1. What is the best way to write product titles for Amazon?


The most reliable approach is to reverse-engineer titles from the top-selling products already in your category rather than writing from scratch. Collect 20–50 titles from category leaders, break each one down into its components (brand, product type, target user, key benefit, ingredients, skin type, scent, size), and look for the patterns that appear most consistently. Whatever shows up in 80%+ of titles is effectively a category requirement .


Your listing needs it to be considered relevant by both the algorithm and the shopper. What appears in 50–70% of titles is where differentiation happens. Start with the required elements, then use the variable ones to position your specific product. Amazon's character limit (typically 200 characters) means you'll need to prioritize, so build your title around the highest-frequency signals first.



2. How do category leaders like Olay write their product listings?


The honest answer is that no one at Olay sits down and writes a listing the way a copywriter would. At scale, listings are the output of a testing infrastructure . Product teams run A/B experiments on titles, change one variable at a time (swap an ingredient claim, test a different benefit phrase, add or remove a scent descriptor), and measure the impact on click-through rate and conversion.


What you see on the page today is the version that survived that process. For a new entrant without that testing history, reverse-engineering gives you a shortcut: you're essentially inheriting the conclusions of experiments you didn't have to run yourself.


The important thing to understand is that you're reading the end state, not the starting point Olay's current listings look nothing like their first drafts.


3. What is data annotation in the context of product content?


Annotation is the process of turning a product title which is just a string of text into a structured record where every component has a label.


So instead of "Olay Body Wash for Women, Vitamin B3, 29 fl oz" as a single sentence, you end up with a row of data: Brand = Olay, Category = Body Wash, Target User = Women, Key Ingredient = Vitamin B3, Size = 29 fl oz. Once you have 40 or 50 titles annotated this way, you can run simple analysis — what percentage of titles include a skin type claim? Which ingredients appear most often, and in what position? Annotation is what converts a pile of scraped text into something you can actually learn from.


It doesn't require specialist tools; a spreadsheet works fine for small datasets, though LLM-assisted annotation tools speed things up significantly at scale.


4. How do I find the right ingredients to include in my product title?


If i give you an example from the data set we just analysed, start by collecting titles from the top 20–30 products in your category. List every ingredient or ingredient-adjacent claim that appears (Niacinamide, Hyaluronic Acid, Peptide, Plant-Based Cleansers, etc.). Count how many titles each one appears in. Ingredients that show up in more than 70% of titles are effectively category table stakes — shoppers in that category are already searching for them, and excluding them signals that your product doesn't belong.


For ingredients that appear in 30–50% of titles, you have a choice: including them helps you capture a specific search segment, but only if your product actually contains them. Never include an ingredient in a title purely for SEO if it's not meaningfully present in the formula — this generates returns, negative reviews, and potential compliance issues on Amazon. Use the frequency data to prioritize, but let your actual product formulation make the final call.


5. How many competitor listings do I need to analyse for this to be useful?


For most product categories on Amazon, 30–50 listings from the top sellers gives you enough signal to identify reliable patterns. Below 20, you risk mistaking one brand's idiosyncratic style for a category norm. Above 100, you hit diminishing returns unless the category is unusually fragmented with many distinct sub-segments.


The key is to focus on the right listings: pull from the top 20–30 organic search results for your primary category keyword, not just one brand. If a single brand like Olay dominates the category, include their full catalog but balance it with 10–15 listings from other top sellers so you're reading category patterns rather than one company's house style.


Rerun the analysis every 2–3 months ingredient trends and title conventions shift, and what worked two years ago may no longer reflect how the algorithm or the shopper has evolved.


Do you want to offload the dull, complex, and labour-intensive web scraping task to an expert?

bottom of page