Scope
This product sheet defines web scraping in the context of an e-commerce website.
The target website is defined by the client.
The target categories are defined by the client.
A feasibility check will be carried out to ensure the website can be scraped.
Definition
The Product Page Listed Categories Site Scrape product:
- Navigates the target website category structure with the target categories defined by the client.
- Collects products from the lowest level category in the target website category structure
- Product information is collected from the product page level
- Product information collected where available:
- Product Code
- Product Title
- Price
- Product Description
- Category Structure
- Manufacturer
- Manufacturer Part Number/EAN/ISBN
- Promotional Information
- Stock Availability
- Regular Price
- URL
- Image URL
- The following information is additionally recorded:
- Collection date/time
Input Required From Client
- Target Website
- Target Categories
- Data Collection Frequency
- Daily
- Weekly (specify day of week)
- Monthly (specify day of month)
- One-off
- Other - as specified
Notes
The scrape will collect the product code, product title and price on every visit. More static data, such as the full product description is collected the first time a product is identified and is refreshed on a periodic basis thereafter