S2 - Full Website Collection

This document defines the Product Page Full Site Scrape service.

Return To Product List

Return To Packages
 

Scope

This product sheet defines web scraping in the context of an e-commerce website.

The target website is defined by the client.

A feasibility check will be carried out to ensure the website can be scraped.

Definition

The Product Page Full Site Scrape product:

  • Navigates the target website category structure
  • Collects products from the lowest level category in the target website category structure
  • Product information is collected from the product page level
  • Product information collected where available:
    • Product Code
    • Product Title
    • Price
    • Product Description
    • Category Structure
    • Manufacturer
    • Manufacturer Part Number/EAN/ISBN
    • Promotional Information
    • Stock Availability
    • Regular Price
    • URL
    • Image URL
  • The following information is additionally recorded:
    • Collection date/time
 

Input Required From Client

  • Target Website
  • Data Collection Frequency
    • Daily
    • Weekly (specify day of week)
    • Monthly (specify day of month)
    • One-off
    • Other - as specified
 

Notes

The scrape will collect the product code, product title and price on every visit. More static data, such as the full product description is collected the first time a product is identified and is refreshed on a periodic basis thereafter
Back to List