Advanced PDF Processing Library Development

Замовник: AI | Опубліковано: 23.10.2025
Бюджет: 250 $

I need a standalone PDF library that my backend services can call to generate, modify, and manage documents end-to-end. The most critical capability is reliable generation of tables and grids, even when the data spans multiple pages; text, images, and non-English characters should flow correctly around those tables. Beyond creation, the same module must merge and split existing PDFs, add watermarks, set metadata, paginate, and apply password-based restrictions. A clean, well-documented API is essential so the functions can be triggered from other microservices without extra wrappers. Please build in OCR hooks or integrate a proven OCR engine so scanned pages can be made searchable before any further processing. Where encryption or font files are required, opt for permissive licenses only. Deliverables • Source code for the library or module (Python, Java, or C# preferred but I’m open to alternatives) • Build/installation script and concise developer guide • Sample scripts that: – create a multi-page table-heavy PDF, – merge two PDFs, split one, and add a watermark, – secure the result with a password • Unit tests covering every public method I’ll review performance on large (100+ page) documents, visual fidelity of multilingual text, and memory footprint during intensive merge/split operations before final acceptance. We will discuss more project in detail after connect.