Lightweight Product Scraper Development

Бюджет: 750 $

I need a compact browser-automation tool that reliably scrapes public product pages on www.temu.com and writes the results straight into a database for our in-house market-research dashboards. Core requirements • Extract three fields per item—Title, Price, and Images—without missing variants or promotional pricing. • Navigate collection and pagination URLs, then visit each product page in headless mode (Playwright, Puppeteer, or Selenium are all fine as long as the run-time footprint stays light). • Store every record in a clean, well-indexed relational table (PostgreSQL or MySQL preferred). Please design the schema so it can be extended later to new stores or additional attributes. • De-duplicate on both URL and product ID. • Respect robots rules and add randomised delays/rotating user-agents to avoid detection. • Deliver a functional prototype with setup script, README, and a short video or screenshots proving it runs end-to-end on at least 100 live products. Nice to have – CLI flags to set category URL, item limit, and output DB credentials. – Simple error logging with retries for time-outs or anti-bot blocks. – Modular code so we can plug in future extractors for Amazon, eBay, or Walmart if needed. Completion criteria 1. I run one command, supply a category URL on Temu, and watch the data appear in the database. 2. All three requested fields populate correctly, including multiple image URLs. 3. Source code is clean, documented, and handed over via Git. Let me know your preferred stack, estimated timeline, and any clarifying questions.

Реєстрація