Law Firm Web Data Scraping

Бюджет: 5000 $

I have a list of more than fifty specific law-firm websites and I need their publicly available contact details alongside complete lawyer profiles pulled into a clean, structured dataset. For each firm the essentials are the office address, phone, general email, plus every attorney’s name, title, practice areas, education, bar admissions, and direct contact information when it is published. The sites are all publicly accessible, but several use pagination, JavaScript rendering, or “load-more” buttons, so the solution has to cope with dynamic content without tripping rate limits or bot blocks. I do not need data from legal directories—only the firms I provide. Deliverables • A single CSV or Excel file containing one row per lawyer and separate columns for every requested data point. • Any scripts or notebooks used to gather the information, written in Python (BeautifulSoup, Scrapy or Selenium are fine—use what you’re most comfortable with). • A brief read-me so I can reproduce the scrape later if I update the list of firms. • Status checkpoints every 10–15 sites so we can correct course quickly if layout quirks appear. I will share the URL list once we start; please let me know your preferred toolchain and how long you expect the full run to take.

Python

Регистрация