Ango logo
Back to Jobs

Principal DevOps Engineer

Bengaluru, India
Full-time
Experience: 6–10 years
Apply

About iMerit

iMerit is a global leader in AI data solutions, trusted by the world's most innovative companies to power mission‑critical AI initiatives. Our platforms, Ango Workflow Automation and 3D Point Cloud Multi‑Sensor Fusion, power data pipelines for some of the world's most advanced Autonomous Vehicle (AV), Robotics, and Mobility programs. We are building a next-generation annotation platform designed for AV and ADAS applications. It unifies large-scale data ingestion, workflow orchestration, AI-assisted annotation, and high-performance visualization into a seamless, real-time web environment that handles complex multi-sensor and multi-modal data sources.

About the role

As a highly experienced Principal DevOps Engineer, you will be the principal owner of the infrastructure, reliability, and automation strategy for our distributed, real-time data annotation platform. You will architect and implement multi-tenant Kubernetes environments optimized for GPU workloads, with full CI/CD, observability, and DR readiness — ensuring secure, high-availability deployments for enterprise grade real-time data and ML pipelines.

Key Responsibilities

  • Architectural Ownership: Design and lead the implementation of the highly available, scalable, and secure cloud infrastructure (preferably AWS) underpinning our microservices.

  • Kubernetes & IaC Mastery: Drive the strategy for and manage multi-cluster Kubernetes (K8s) environments. Lead the adoption of Infrastructure-as-Code (IaC) tools like Terraform or CloudFormation to manage infrastructure lifecycle.

  • Distributed Systems Reliability: Implement and manage Kafka brokers and streams, ensuring high throughput, fault tolerance, and low latency for the workflow engine and data pipelines.

  • CI/CD & Automation: Develop, optimize, and enforce zero-downtime deployment strategies via advanced CI/CD pipelines (e.g., ArgoCD, Jenkins, GitLab CI).

  • Observability & SRE: Establish a comprehensive SRE culture. Define, track, and report on Service Level Objectives (SLOs) and FinOps. Implement advanced monitoring, logging, and alerting using tools like Prometheus, Grafana, and the ELK stack in line with p95 and p99 targets.

  • Security & Compliance: Ensure compliance and security hardening across the entire infrastructure, including network policies, secrets management (e.g., Vault), and RBAC within K8s. This will include tenant isolation, secrets rotation, and compliance (SOC2/ISO).

  • Resilience: Design and rigorously test the Disaster Recovery (DR) and business continuity plan for the final production environment.

Qualifications

  • 6–10 years of professional experience in DevOps, SRE, or Cloud Infrastructure Engineering.

  • Expert-level proficiency with Kubernetes (core components, networking, security, scaling, custom controllers).

  • Mastery of Infrastructure-as-Code (IaC), specifically Terraform.

  • Deep, hands-on experience with distributed message brokers, primarily Kafka.

  • Strong background in cloud platforms (preferably AWS) and networking concepts.

  • Excellent scripting skills (e.g., Python, Go, Bash) and experience managing Git repositories and GitOps principles.

  • Proven ability to lead projects, mentor junior engineers, and drive technical decision-making within the DevOps domain.

Why Join Us

At iMerit, you will architect and own the infrastructure foundation that powers mission-critical AI data pipelines for the world's most advanced autonomous systems. You will design multi-tenant, GPU-optimized Kubernetes environments that enable real-time data annotation at enterprise scale. Your work will directly impact the reliability, security, and performance of platforms trusted by global industry leaders. Join us to build the infrastructure backbone that supports the next generation of autonomy and intelligent perception.