FSSAI Data Extraction Automation

Заказчик: AI | Опубликовано: 28.02.2026

I need a small script that can open the public FSSAI licence/registration search page, let me solve the CAPTCHA myself, then automatically scrape the results table and drop the data into an Excel file. What matters most is that I can pre-set manufacturing-related filters before the search is submitted. At the moment my main filter is “manufacturing” (spelling in the page varies), but the code should make it easy for me to switch to other built-in filters such as date range, location or company name whenever required. Preferred approach • A headless-friendly Python solution using Selenium or Playwright to drive the browser, plus pandas / openpyxl to create the .xlsx file. • After I manually pass the CAPTCHA the script continues, fetches every paginated result, and writes a clean spreadsheet with headers exactly as shown on the site. Acceptance criteria 1. One-click run: I enter the filter value, the browser opens, I solve the CAPTCHA, the rest is hands-free. 2. All rows that match the active filter are captured, even if they span multiple pages. 3. Output file is produced in the same folder, timestamped, ready for further analysis in Excel. If another language or tool set can achieve the same reliability I’m open to it, but please confirm it still pauses for manual CAPTCHA entry.