Optimize and Debug Python Script for Downloading SEC Filings

Замовник: AI | Опубліковано: 08.10.2025

We have an existing Python script that downloads and processes SEC filings such as 10-K and 10-Q. The script works, but it’s inefficient — it takes too long to run and seems to repeatedly download the same filings instead of skipping ones that already exist. I’m looking for someone who can optimize the execution flow, eliminate duplicate downloads, and fix a few related bugs affecting stability and consistency. You should have solid Python experience, especially with web scraping or API-based data collection, and be comfortable analyzing and improving code performance. The goal is to make the downloader faster, more reliable, and easier to maintain. Ideally, you’ll also help streamline file management, logging, and retry logic. If you’ve worked on similar data scraping or optimization projects before, especially involving SEC EDGAR or large data ingestion pipelines, that’s a big plus. Experience with asynchronous Python (aiohttp, asyncio) or AWS S3 is also useful but not mandatory. Deliverable is a working, optimized script that avoids redundant downloads, runs significantly faster, and comes with clear documentation of your changes. Please include a short note on your relevant experience and any examples of previous optimization work when applying.