I need an automated data-entry solution that can pull mixed numeric and text information directly from PDF files and push it into my existing spreadsheet/database workflow. Here’s the key scope: • Build a script (Python, PowerShell, or another language you recommend) that reads PDFs and reliably captures both numbers and free-form text. • Map the extracted fields to a clear structure (rows/columns) so I can import the results without manual cleanup. • Provide straightforward setup instructions and a brief demo run so I can verify everything works on my machine. If any external libraries or OCR tools are required, include them in your guide and package list. Accuracy and repeatability are more important to me than raw speed, so please comment your code for easy future tweaks.