Advanced Kubernetes SRE : Microservices Deployment & Observability

Замовник: AI | Опубліковано: 10.11.2025
Бюджет: 250 $

I am looking for a freelance, "Seeking an expert DevOps/SRE engineer to build a resilient, secure, and highly available platform for three microservices (Node.js, Go, Python) on Kubernetes. The project must cover the full SRE lifecycle, from architecture design to failure simulation." Key Project Deliverables & Requirements: Architecture Design: Provide a clear architecture diagram detailing service communication (API, Auth, Image Store) and external dependencies (DB, S3, etc.). Deployment & Security: Create Dockerfiles and deploy images to a Private Registry. Implement robust Network Policies for pod isolation. Securely manage credentials using Secrets Management. Set up Ingress with TLS (Let's Encrypt or self-signed). Advanced Monitoring (Observability): Implement the Prometheus & Grafana stack. Configure Custom Metrics for each service. Integrate Alertmanager with an external notification tool (e.g., Slack). High Availability & Scaling: Configure Horizontal Pod Autoscaler (HPA) based on appropriate metrics (CPU, Requests). Define realistic Liveness and Readiness Probes. Use PodDisruptionBudget (PDB) to guarantee minimum availability during maintenance. Failure Simulation & Documentation: Design and execute failure scenarios (e.g., database downtime, sudden traffic spike). Document the system's response (logs, alerts, recovery). Final delivery includes a comprehensive README, all YAML/Code files, the architecture diagram, and a video/report documenting the failure tests.