End-to-End Data Science Pipeline

Бюджет: 1500 $

I have a dataset that is growing quickly and a business case ready for machine-learning insights. What I need now is someone who can take the project from raw data all the way to a production-ready model: that means designing the data pipeline, choosing and training the model, fine-tuning for accuracy and latency, then packaging everything for a reliable, scalable deployment. Here is what success looks like for me: • A repeatable pipeline (Python, SQL, Airflow or equivalent) that ingests the source data, performs cleaning / feature engineering, and stores curated tables. • Well-documented training notebooks or scripts using standard frameworks (scikit-learn, TensorFlow or PyTorch) together with clear evaluation reports. • Fine-tuning that pushes performance to an agreed metric benchmark, accompanied by hyper-parameter and experiment tracking (e.g., MLflow). • Containerised deployment (Docker, preferably orchestrated with Kubernetes or a managed cloud service on AWS or GCP) exposing a REST or gRPC endpoint, plus simple monitoring hooks. All code should sit in a version-controlled repo with a straightforward README so another engineer can reproduce every step. Automated tests for critical functions and a brief hand-over call will round off the engagement. If this full-stack data science path—from pipeline through to live inference—is your comfort zone, I’d like to hear how you would approach it and the timelines you foresee.

Python

Реєстрація