Responsibilities
- Manage and maintain AWS cloud environments, including provisioning, configuration, and ongoing optimization of resources
- Develop and manage Infrastructure as Code (IaC) using Terraform to enable scalable and automated cloud operations
- Design and enforce resource tagging strategies to support cost allocation, governance, and operational visibility
- Partner with cross-functional teams to optimize cloud infrastructure for performance, security, and cost efficiency
- Monitor cloud usage and spending, identify cost optimization opportunities, and implement FinOps best practices
- Ensure cloud environments comply with organizational policies and relevant industry standards
- Apply cloud security best practices, including identity and access management, encryption, and network security controls
Requirements
- Hands-on experience with AWS services, including EC2, S3, RDS, SQS, Lambda, EKS, MSK, and IAM
- Strong experience with Kubernetes, particularly Amazon EKS
- Familiarity with observability and monitoring tools such as CloudWatch, Prometheus, and Datadog
- Proficiency in Infrastructure as Code (IaC), especially Terraform, including writing and maintaining reusable configurations
- Experience working with Kafka, preferably AWS MSK
- Solid understanding of cloud cost management and FinOps principles, with a track record of implementing cost optimization strategies
- Experience with on-call rotations, incident response, and troubleshooting in production cloud environments
- Strong communication and collaboration skills, with experience working in distributed or remote teams
- Experience building and maintaining CI/CD pipelines for applications written in TypeScript, Kotlin, and Ruby
- Awareness of compliance frameworks (e.g., HIPAA, SOC 2) and experience applying security and compliance controls in cloud environments
