Python Deep Web Scraping

Замовник: AI | Опубліковано: 08.11.2025

I have two product pages that hide their outbound links inside several layers of pop-ups and dynamic elements. I need standalone Python scripts that dig through those layers, capture every link I specify, and export the results to neatly structured JSON. Here is the scope I have in mind: • Build two separate scrapers—one per URL I’ll share in chat. • Each script must launch on a manual command from the terminal. No schedulers are required. • Navigate through all pop-ups, modals, or infinite-scroll sections until every target link is revealed and harvested. • Save the final set of links to a JSON file, preserving any hierarchy or grouping that appears on the page. • Keep the code clean—functions, comments, and a brief README. Provide a requirements.txt so I can create a virtual environment and run python main.py without surprises on my own Docker. • Handle edge cases such as missing pop-ups, site timeouts, or updated selectors with graceful error messages rather than silent failures. You may use Selenium, Playwright, BeautifulSoup, or a headless browser—whatever ensures reliable deep scraping. Just avoid solutions that depend on paid APIs. I’ll test by running each script locally, comparing the JSON output against the live site, and checking that all links are captured. If everything matches, we’re done; if not, we’ll iterate once for fixes. Show me past work demonstrating you have already dealt with JavaScript-heavy sites or multi-popup navigation, and let me know which library you prefer so we can kick this off quickly.