This global AI manufacturing platform is hiring a full time Site Reliability Engineer in Chicago. You'll design and operate secure, highly available infrastructure in GCP (with multi-cloud exposure), lead Terraform-based Infrastructure as Code, manage Kubernetes/Docker environments, and build CI/CD pipelines (GitHub Actions, ArgoCD) supporting AI/ML workloads. The role focuses on DevSecOps, compliance in regulated industries (Aerospace & Defense), observability (Prometheus/Grafana/ELK), automation in Python or Go, and incident response ownership for U.S. operations.
This is a high-ownership role for a security-first engineer who wants real impact. You'll help define U.S. infrastructure strategy, influence compliance and reliability standards, and collaborate with a global engineering team in a fast-scaling environment. Strong growth opportunity, meaningful technical scope, and clear visibility into leadership.
Required Skills & Experience