720 Logos & Emails Scrape

Заказчик: AI | Опубликовано: 14.10.2025

I have a list of 720 live websites that I need mined for two pieces of information: 1. The direct URL to each site’s primary logo image (png, svg, jpg, gif—whatever the site serves). 2. Any management-related email addresses that appear on the site, with special interest in addresses containing words such as “ceo”, “founder”, “management”, “info”, or “contact”. No verification step is required; capture them exactly as they are found. Please pull everything into a single Excel workbook. One row per website with at least these columns: Domain, Logo URL, Email Address (you may add more email columns if multiple addresses are found). While several of the sites share common CMS traits, many do not, so a flexible, partially custom scraping approach will be necessary—Python (BeautifulSoup, Scrapy, Selenium) or comparable tooling is fine as long as you stay within polite scraping limits and keep request logs clean. Deliverables • Excel file with the complete dataset • The script(s) or notebook used, so I can rerun or audit the crawl • Brief note on any sites that could not be scraped and why Acceptance will be based on coverage (logo link + at least one email and logo per reachable site), clean formatting, and reproducible code.