Maintain Web Scrapers

Замовник: AI | Опубліковано: 17.03.2026

Job Posting: Senior Full-Stack Developer (Python/TypeScript) — Legal Automation Specialist Project Overview We are seeking a highly skilled Senior Full-Stack Developer to maintain and scale a unified legal case tracking and AI-driven document review platform. The system orchestrates complex data pipelines: reverse-engineering official eCourts mobile APIs (AES-256), scraping Indian Tribunals (NCLT, NCLAT, SCI), and delivering automated "Legal Review" workflows via a TypeScript/React frontend. Technical Stack Backend: Python 3.8+ (FastAPI), PyMuPDF (fitz), BeautifulSoup, pycryptodome (AES-256), ddddocr. Frontend: TypeScript, React (Next.js App Router), Tailwind CSS. Database & Infra: Supabase (PostgreSQL with JSONB), Real-time, Storage, RLS. Communication: WhatsApp Business API (Meta), SMTP (Email). Key Responsibilities 1. Backend: Scraper & Sync Engineering Maintain Scrapers: Monitor and repair scrapers for NCLT, NCLAT, SCI, and High Courts (Bombay, Delhi, Gujarat). Encryption & API: Maintain AES-256 encryption/decryption logic for communication with official app.ecourts.gov.in endpoints. Sync Orchestration: Optimize case_hearing_sync.py to handle "stale" case recovery and denormalized data mirroring (_mirror_cause_list_entry_to_case) for frontend performance. AI Agents: Refine legal_review_agent.py logic, including complex data structures like TableComparisonResult. 2. Automation & Reporting Reporting Pipeline: Manage daily_case_reports.py to generate multi-variant PDFs ("Matters Listed" vs. "Orders Found"). Delivery Logic: Debug and optimize automated Email/WhatsApp delivery triggers for generated reports. 3. Quality Assurance (QA) & Testing Scraper Validation: Build automated test suites to detect breaking changes on live court websites before data corruption occurs. Frontend Testing: Conduct regression testing on TypeScript components to ensure legal document rendering remains pixel-perfect. Integration QA: Verify end-to-end data flow: Scrape → Supabase Sync → Agent Processing → Frontend Notification. Required Skills Python Mastery: Advanced scraping (POST/GET flows, session persistence, OCR CAPTCHA solving). TypeScript/React Pro: Strong experience with Next.js App Router and building clean, data-intensive UIs. Supabase/PostgreSQL: Expertise in JSONB manipulation, complex SQL migrations, and Row Level Security (RLS). PDF Processing: Experience extracting structured data from unstructured PDFs using fitz (PyMuPDF). QA Mindset: Proven ability to build testing frameworks for unstable third-party data sources. Budget: 15,000/mo for 15h/week