Real-Time International Tender Feed

Заказчик: AI | Опубликовано: 02.11.2025

I need an automated system that pulls fresh tender information from a mix of official government websites, respected news portals, and dedicated tender portals around the world. The scraper should check each source every hour, capture only the newest records, and push them into a structured store I can query or export. The data points I rely on are: • Tender title and ID • Submission deadlines • Full project details / requirements • Any published proposal-format instructions Please build the pipeline so that it first looks for an available API; if none exists, fall back to robust web-scraping (Python, Scrapy/Playwright/BeautifulSoup—your call). I expect sensible de-duplication across sources, clear logging, and a straightforward way to add or remove URLs without touching core code. JSON or PostgreSQL is fine for storage as long as I can plug BI tools into it later. Deliverables 1. Source-agnostic crawler with hourly scheduler 2. Normalised database (or flat JSON dumps) containing the four data fields above 3. Simple REST endpoint or CSV export routine for downstream use 4. Setup documentation and a quick demo showing an hourly run completing successfully Code must be well-commented and handed over through a Git repository so I can maintain it in-house after launch.