Function Description / Professional Tasks
We’re seeking a skilled MLOps Engineer to build and sustain a platform that turns data‑science ideas into production‑ready services. You’ll be the bridge between experimentation, deployment, and operations, ensuring models run reliably, securely, and cost‑effectively at scale.
What You Can Do:
− Design & evolve a reusable ML pipeline architecture that supports training, validation, and serving across teams.
− Automate model delivery pipelines, from code commit to production endpoint, leveraging continuous − integration / continuous deployment (CI/CD) practices with GitLab CI/CD and Kubernetes‑centric orchestration via ArgoCD for declarative application management.
− Collaborate with data scientists to translate prototypes into reproducible, production‑grade artifacts.
− Maintain a robust model lifecycle system: versioning, performance monitoring, drift detection, and rollback procedures.
− Optimize resource utilization and cost across the organization’s cloud footprint while meeting SLAs.
− Enforce data governance, privacy, and security standards throughout the ML platform stack.
− Build tooling for feature management, preprocessing, and evaluation that can be reused by multiple projects.
− Diagnose and resolve production incidents, working cross‑functionally to root‑cause and prevent recurrence.
What You Can Do:
− Design & evolve a reusable ML pipeline architecture that supports training, validation, and serving across teams.
− Automate model delivery pipelines, from code commit to production endpoint, leveraging continuous − integration / continuous deployment (CI/CD) practices with GitLab CI/CD and Kubernetes‑centric orchestration via ArgoCD for declarative application management.
− Collaborate with data scientists to translate prototypes into reproducible, production‑grade artifacts.
− Maintain a robust model lifecycle system: versioning, performance monitoring, drift detection, and rollback procedures.
− Optimize resource utilization and cost across the organization’s cloud footprint while meeting SLAs.
− Enforce data governance, privacy, and security standards throughout the ML platform stack.
− Build tooling for feature management, preprocessing, and evaluation that can be reused by multiple projects.
− Diagnose and resolve production incidents, working cross‑functionally to root‑cause and prevent recurrence.