S1 - Full Website Collection

This document defines the List Page Full Site Scrape service.

Return To Product List

Return To Packages
 

Scope

This product sheet defines web scraping in the context of an e-commerce website.

The target website is defined by the client.

A feasibility check will be carried out to ensure the website can be scraped.

Definition

The List Page Full Site Scrape product:

  • Navigates the target website category structure
  • Collects products from the lowest level category in the target website category structure
  • Product information collected from the product list page - this is defined as a website page which displays multiple separate products in a list
  • Product information collected includes, but is not necessarily limited to, the following where available:
    • Part Number
    • Product Title
    • Manufacturer Part Number/EAN/ISBN
    • Price
    • Short Description
    • URL
    • Image URL
    • Stock Availability
  • The following information is additionally recorded:
    • Collection date/time
 

Input Required From Client

  • Target Website
  • Data Collection Frequency
    • Daily
    • Weekly (specify day of week)
    • Monthly (specify day of month)
    • One-off
    • Other - as specified
 
Back to List