3 Ways Popular Price Comparison Websites Gather Data
Updated: Feb 5, 2021
A few years back, a typical online buyer looking to buy apparel or an electronic gadget would scour the web to compare prices from various e-commerce sites and look for the best deals available. Prices of the same commodity would vary vastly over different sites which in turn prompted sites to monitor price changes in real-time using analysts. In today’s hyper-competitive e-commerce scenario, retail players grab every opportunity to attract potential customers. Price comparison websites are one of the opportunities e-retailers leverage to attract customers.
Price comparison websites are a win-win for both a business owner and a customer. The customers benefit by gaining good deals, a convenient shopping experience, better coverage over major e-commerce sites, and a display of greater variety for the same product. On the other hand, business owners benefit by gaining a greater no of leads, better conversion rates, and enhanced customer services.
How do Price Comparison Websites work?
A price comparison app or website allows you to compare prices for products sold by various retailers.
These price comparison tools allow you to compare prices across retailers by typing in the product name to find your product. The price comparison tools then show you a list of retailers who are selling the same product, and how much they’re selling it for.
Price comparison engines skim through multiple e-commerce sites to gather data about products and services like prices, descriptions, features, reviews, etc. This information is then combined on a price comparison website and results are tailored according to a visitor’s search request.
This way when a buyer searches for a product on a price comparison website, the site compares and displays listings of the same product from multiple sellers. The buyer can then compare the listings based on price deals, features, shipping costs, etc to find the best deal.
The algorithms governing the price comparison functions use data as input. Extracting and updating this data in real-time is a challenging task for price comparison engines. Moreover, e-commerce sites use dynamic pricing which changes frequently and needs to be updated with the price comparison engines immediately. For example, Amazon takes exactly two minutes to adapt to a price change and that’s 43000 minutes faster than the industry average.
Google Shopping, PriceGrabber, Shopping.com, Shopzilla, and various other price comparison sites offer an easy and fast way to locate the best prices.
Challenges faced by Price Comparison Websites
While price comparison websites are constantly updating their comparison algorithms to cope with the dynamic web changes that occur across various e-commerce sites, these engines face some challenges when it comes to building and retaining a dynamic comparison tool.
The challenges in getting and using data for price comparison include:
Comparison technology: One major issue we often see is the difference in the structures of pricing data from different retailers. This makes it tough to build a comparison engine that adheres to all structures. Such challenges often lead to a discrepancy between a product’s price on the website and what we see in a comparison shopping engine. As data updates may take too long in some CSEs, resulting in inaccurate information.
The volume of data: If the price comparison site is targeting a larger product range and more e-commerce sites, they’re looking at a humongous amount of data to handle. This makes your task even more complex.
How Price Comparison Websites Grab Data
A lot of startups work with Datahut to obtain data for their price comparison engines at a large scale. Product data from e-commerce sites is collected using the following ways:
1. Feeds from merchants
Price comparison websites enter into a collaboration with stores or companies to gain data directly from a retailer’s web feeds. Traffic from comparison shopping websites is a major source of revenue for internet retailers. If you can build a partnership with them – they can give you access to their API. However – you will need to pay a premium to actually use the API at a higher volume. Merchants set up either an API or use FTP to directly deliver data.
While this method of data collection provides direct access to data, it has its cons. Oftentimes getting real-time updates to changes becomes difficult. Moreover, different merchants provide data in multiple formats, making it difficult to integrate into one platform.
Note: One of the common ways of monetizing a price comparison sites is by becoming an affiliate partner and getting a referral commission.
2. Product feeds from third-party API
Retail product data can also be obtained through third-party APIs. When integrated with shopping carts, some services provide e-commerce data via an API request. In this case, third-party services charge for the volume of data extracted. These companies actively crawl the most common retailer websites and provide this data in a database for easy access by price comparison websites.
This method saves a lot of development time and allows easy integration of price comparison engines with multiple shopping carts. However, the biggest challenge in API integration is that it is a difficult and expensive process when developing multiple integrations adhering to the specifications of every shopping platform. Shopping giants like Magento, WooCommerce, OpenCart, Shopify, etc can be integrated via API2Cart which provides a unified API to connect all shopping carts simultaneously.
3. Web Scraping
In most cases, options 1 and 2 might not be available when you are just starting up and have fewer funds. In such cases, Web scraping is one of the reliable and efficient ways of getting product data from target sites according to your requirements.
You can acquire pricing data by either building your own in-house web scraping set up or employing a data-as-a-service (DaaS) provider who’ll provide the requisite data as per your requirement. For efficient results in your price comparison site, the quality of the data has to be top-grade, and having updated datasets with latency becomes an essential requirement. Since managing a price comparison website is itself a cumbersome task, we recommend you go with a web scraping service provider for pricing data extraction.
Web scraping services would employ crawling bots to regularly gather information from websites for further processing. This instantly provides the price comparison engine with accurate and updated information from target sites without having to depend on merchants to get data. However, many websites have special anti-crawling mechanisms, which block users that make too many requests.
We recommend you read our post on How to Choose the right Web Scraping Service for your organization.