Amazon Books Data Extraction

Замовник: AI | Опубліковано: 02.04.2026
Бюджет: 200 $

This project is to extact ISBNs and ASINs from Amazon. Target Delivery of 4/10/26 (or sooner) We are looking for someone with experience pulling data from Amazon (ideally books data) and most important, familiarity with Amazon’s captcha. We need systematically extract a list of all ISBNs (i.e. ASINs) for the top 100 Pages of all Book categories on Amazon. We estimate there will be about 150k to 250k pages to scrape. Using the below URL (see below), we need to pull the following info for each search result for EACH category and for each subcategory (up to five subcategories deep). You can review up to 100 Pages for each subcategory. In each instance, the data will need to be sorted by “Most Reviews”. This is very important. Also, prior to going through the subcategories, please cycle through the first 100 pages of the top level category as well. So for example, prior to proceeding with the subcategories of Arts & Photography (such as Architecture etc), please extract the ASINs from the first 100 Pages of the Arts & Photography List. 1. Book Title: Where the Crawdads Sing: Reese's Book Club 2. Primary ASIN: 0735219109 3. Star Rating 4.7 4. Review Count: 639,789 5. Format 1 Type: Paperback 6. Format 1 ASIN: 0735219109 7. Format 1 Price: 9.41 8. Format 1 List Price: 18.00 9. Format 2 Type: Kindle 10. Format 2 ASIN: B078GD3DRG 11. Format 2 Price: 9.41 12. Etc (for each Format shown) 13. Search Category String: Eg: History > Military > World War II (i.e. this is the category you are extracting from) 14. Search URL Note that the only info that is truly essential is the ASINs and Search Category String, so if the other bits are tricky to pull then feel free to ignore them. Deliver the resulting files of up to 1 Million ASINs, if possible. CSV or Txt file is fine. Happy to have a quick call to clarify anything. https://www.amazon.com/s?i=stripbooks&rh=n%3A283155&s=review-count-rank&dc&ds=v1%3AiKJJFGEeqaBPPE1WAoTv6DVpHfLJYqD7XMz9FgbPIRQ&crid=257D171XQRMUR&qid=1775146245&sprefix=b0gvbf1hlw%2Caps%2C117&xpid=rMvVQCXKbYUDo&ref=sr_ex_n_1