Principal Engineer – Platform Engineering, Site Reliability
Job Description:
• Design, deploy, and manage scalable cloud solutions on AWS public cloud platform via Infrastructure as Code.
• Manage infrastructure as code (IaC) leveraging Terraform, CloudFormation and GoLang.
• Design and implement Kubernetes-based platform solutions with focus on scalability, reliability, and security.
• Support and maintain large Kubernetes clusters in production environments.
• Implement security best practices and ensure compliance with industry standards and regulations.
• Work closely with development, operations, and security teams to integrate infrastructure as code practices.
• Develop automation to build and deploy Docker Containers through CI/CD pipelines for engineering teams deploy and test services.
• Write policy & standard validation tests and integrations with Security Scanning software to ensure compliance.
• Implement and support Observability solutions to ensure platform performance, reliability, and scalability.
• Create Dashboards and integrate into Backstage IDP for visibility into system health.
• Provide guidance and mentorship to team members on best practices in GitOps, CI/CD, and infrastructure management.
• Work closely with development, operations, and security teams to integrate infrastructure as code practices across the organization.
Requirements:
• 10+ years of experience in cloud & platform engineering, DevOps, developer enablement, or infrastructure roles with increasing responsibility.
• Expert-level proficiency with AWS cloud services especially in compute, storage, networking, implementing cloud native services, and developing resilient infrastructure architecture.
• Strong software engineering skills and methodologies.
• Deep Knowledge of AWS EKS or running multiple Kubernetes instances, VPC Configuration, AWS Waf, S3, RDS, and other cloud services.
• Experience with ArgoCD, Argo Workflows, CircleCI, or GitHub Actions.
• Experience with Crossplane a plus.
• Strong hands-on experience with developing golden paths for engineers and managing CI/CD pipelines leveraging GitOps workflows for Kubernetes application delivery.
• Deep expertise in Infrastructure as Code using Terraform for multi-cloud resource provisioning and full stack deployments.
• Proven experience with Kubernetes infrastructure management and service mesh technologies like Istio and API gateway solutions.
• Strong understanding of Kubernetes architecture, operators, custom resources, and ecosystem tools.
• Advanced programming skills in GoLang and Python for building platform tooling and automation.
• Experience with Grafanna, OTEL, and Backstage.
• Demonstrated ability to lead technical initiatives and influence architectural decisions across engineering teams.
• Excellent communication and collaboration skills with both technical and non-technical.
• Background in SRE or Software Engineering practices and principles.
• Bachelor's Degree in Computer Science or related discipline, or relevant experience.
• AWS certifications (Solutions Architect Professional, DevOps Engineer Professional).
Benefits:
• An inclusive culture strongly reflecting our core values: Act Like an Owner, Delight Our Customers and Earn the Respect of Others.
• The opportunity to make an impact and develop professionally by leveraging your unique strengths and participating in valuable learning experiences.
• Highly competitive compensation, benefits and rewards programs that encourage you to bring your best every day and be recognized for doing so.
• An engaging, people-first work environment offering work/life balance, employee resource groups, and social events to promote interaction and camaraderie.
Apply tot his job
Apply To this Job