Project: Australia Sports Clubs Master Database + Matching System Objective Build an automated Python scraping system to collect all sports clubs across Australia and create a clean, de-duplicated, matched Excel master database. Primary sources may include: PlayHQ (AFL and other sports) NRL club directories Cricket associations Soccer (Football Australia directories) Other public sports governing body directories Required Data Fields For each club: Club Name Sport Type (AFL, NRL, Cricket, Soccer, etc.) League / Association State Suburb / City Postcode Contact Person Name (if available) Contact Role Email Phone Home Ground Ground Address Website (if available) Source URL Matching Requirement (Important) The freelancer must: Remove duplicates across sports and directories Implement fuzzy matching logic (e.g. using RapidFuzz or similar) Match clubs against an existing Excel file (provided by me) Add a new column: Match Score % Matched Club Name (from my file) Matching should compare: Club Name Suburb / City State Technical Requirements Python only Use Selenium or API method for dynamic websites Handle pagination Avoid IP blocking Clean null values Structured pandas DataFrame Export final file as: Australia_Sports_Clubs_Master.xlsx Deliverables Fully working Python script Matching logic included Clean Excel output Instructions to run Commented code Skills Required Advanced web scraping Selenium (dynamic websites) Fuzzy matching (RapidFuzz) Data cleaning & structuring Handling large datasets