E-Commerce Price Comparison Web Scraper - Samsung Products UAE Project Overview Build a Python web scraper that collects Samsung product data from Amazon.ae and Noon.com/UAE, stores it in Google Sheets, and sends email alerts when prices change by 3% or more. Project Scope Data Collection Requirements Source Websites: Amazon.ae and Noon.com/UAE Product Category: Samsung products only Quantity: 50 identical products from each platform Data Points to Capture: Product name Price (AED) SKU (or ASIN for Amazon) Product Image URL Rating (stars/score) Number of Reviews URL Stock status Last updated timestamp Core Features Python Web Scraper Scrape 50 matching Samsung products from each website using Crawl.ai or BeautifulSoup/Selenium Identify the same products across both platforms Use rotating proxies to avoid blocking Data Processing Use Deep-Seek to clean and standardize scraped data Match products across platforms Validate data quality Google Sheets Integration Push scraped data to Google Sheet via Google Sheets API Schedule regular updates (hourly/daily) Maintain comparison data across platforms Price Change Alerts Trigger email when price changes ≥ 3% Include: product name, old price, new price, % change, link Use Gmail API or SendGrid for emails Automation Use n8n workflow or AppScript to schedule Python script execution Alternative: Python APScheduler for cron jobs Technical Stack Language: Python 3.x Web Scraping: Crawl.ai OR BeautifulSoup + Selenium Data Processing: Deep-Seek API for data refinement Rotating Proxies: BrightData, Oxylabs, or similar Google Sheets: Google Sheets API Emails: Gmail API or SendGrid Automation: n8n (webhook/scheduler) OR AppScript OR Python APScheduler Deliverables Python scripts (well-commented) Setup guide with configuration instructions Google Sheet template n8n/AppScript automation workflow (or cron setup) Test results showing working scraping + alerts Documentation on how to modify product list/websites Ideal Candidate Python web scraping experience Familiar with Crawl.ai or BeautifulSoup/Selenium Google Sheets API knowledge Email API integration experience Rotating proxy experience Familiar with n8n or AppScript scheduling (or APScheduler) Can start immediately