S3 - Website Collection Listed Categories

This document defines the List Page Listed Categories Scrape service.

Return To Product List

Return To Packages
 

Scope

This product sheet defines web scraping in the context of an e-commerce website.

The target website is defined by the client.

The target categories are defined by the client.

A feasibility check will be carried out to ensure the website can be scraped.

Definition

The List Page Listed Categories Site Scrape product:

  • Navigates the target website category structure with the target categoried defined by the client
  • Collects products from the lowest level category in the target website category structure
  • Product information is collected from the product list page - this is defined as a website page which displays multiple separate products in a list
  • Product information collected includes, but is not necessarily limited to, the following where available:
    • Part Number
    • Product Title
    • Manufacturer Part Number/EAN/ISBN
    • Price
    • Short Description
    • URL
    • Image URL
    • Stock Availability
  • The following information is additionally recorded:
    • Collection date/time
 

Input Required From Client

  • Target Website
  • Target Categories
  • Data Collection Frequency
    • Daily
    • Weekly (specify day of week)
    • Monthly (specify day of month)
    • One-off
    • Other - as specified
 

Notes

This product may not be suitable if the client wishes to match products using the Standard, Enhanced or TOTAL Matching applications.
Back to List