- Proficiency in tools like Terraform, Ansible, or CloudFormation to automate infrastructure provisioning and management for Infrastructure as Code (IaC) - Strong experience with major cloud platforms like Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP). - Expertise in container technologies such as Docker and container orchestration tools like Kubernetes. - Knowledge of CI/CD pipelines and tools like Jenkins, GitLab CI/CD, Travis CI, etc. - Familiarity with monitoring tools such as Prometheus, Grafana, ELK stack (Elasticsearch, Logstash, Kibana), and APM solutions. - Ability to identify performance bottlenecks, optimize resource utilization, and enhance application response times. - Proficiency in scripting languages (Python, Bash, etc.) and programming languages (Java, Go, etc.). - Familiarity with chaos engineering, fault injection, and other resilience testing methodologies. - Proficiency in using version control systems like Git for managing code. - Experience with automating routine operational tasks to increase efficiency and reduce manual intervention. - Ability to manage relationships with third-party vendors providing SRE-related tools and services. - Skill in creating comprehensive technical documentation for systems, processes, and procedures. - Excellent problem-solving and troubleshooting skills. - Excellent communication and collaboration skills to work effectively with cross-functional teams. - Proven leadership and team management skills. - Relevant certifications in cloud platforms and automation tools are a plus. - Well-versed in interpreting business requirements and translating those into Security architecture decisions. - Experience with various enterprise applications and IT services, as well as software development, compliance and security, and IT operations disciplines. - Experience suggesting competitive and innovative technical decisions while driving value through adopting solutions on the cloud.