Scrape URLs & Phone Numbers

Заказчик: AI | Опубликовано: 22.03.2026
Бюджет: 250 $

I’m compiling a dataset and need a reliable scraper that focuses purely on data extraction for analysis. The task is to crawl newly launched websites, locate any publicly listed phone numbers, and pair each number with its source URL. Essential requirements • A repeatable Python solution (Scrapy, BeautifulSoup, or Selenium only when a page is JS-rendered). • Output as clean CSV/JSON with two columns: website, phone_number. • Logic to skip duplicates, respect robots.txt, and throttle requests to avoid blocking. • Clear setup instructions plus short inline code comments so I can adjust targets later. Acceptance criteria • Minimum 95 % accuracy on a 50-site validation sample I provide. • Script completes without manual intervention and can be rerun on new targets. • Final delivery includes source code, requirements.txt, and one sample export file. If you already have a working crawler for “new sites” discovery—or ideas on how to find them efficiently—mention that in your proposal.