
Cloud Engineer
- Batam, Kepulauan Riau
- Permanen
- kerja tetap
- Manage infrastructure as Code: Develop and maintain automation scripts (Python, Bash, Shell, Go, JavaScript etc.) and tools (GitLab, Hashicorp Terraform, Hashicorp Vault, etc.) to streamline & improve infrastructure deployment, monitoring, and management processes, using Infrastructure as Code (IaC).
- Monitoring & Observability: Establish robust monitoring and logging foundations collaborating with the wider Site Reliability Engineering Function and ensure platform reliability and performance using the standard application performance management tools.
- Security: Implement standard methodologies for security and ensure the platform adheres to industry standards.
- Incident Management: Participate in out of hours on-call rota, help resolve platform related incidents and implement post incident improvements to prevent recurrent issues.
- Collaborate with multi-functional teams in an Agile/Scrum environment to deliver high-quality solutions.
- Automate the deployment, configuration, and maintenance of our OpenShift clusters through creation and deployment of custom pipelines
- Participate in the review of new cluster configurations and pipelines to ensure they achieve their intended result in a non-disruptive manner
- Identify, recommend, and implement upgrades and enhancements required to maintain system functionality and optimal performance
- Document any new services or infrastructure being deployed as well as ensuring that current documentation remains accurate and follows standard methodologies.
- Fix and conduct root cause analysis for Linux and Windows servers.
- Enhance monitoring and logging with Prometheus and ELK stack.
- Ensure compliance with security standards and implement robust monitoring and logging practices.
- 5+ years of hands-on experience working with public cloud platforms (AWS, Azure, or GCP), containerisation technologies (Docker, Kubernetes, Rancher, Fleet, and Cloudbees, etc.), and monitoring solutions (Prometheus, Grafana, ELK, OpenTelemetry, etc.).
- Knowledge of at least one of these programing languages: Python, Go, JavaScript
- Knowledge of at least one of these scripting languages: Bash, PowerShell
- Interest in designing, developing, deploying, and improving microservice infrastructure and related services
- Ability to pick up new software, frameworks, and APIs
- Excellent software design, problem solving, and debugging skills
- Experience using a Linux operating system
- Knowledge of TCP/IP network fundamentals
- Experience with CI/CD tools and methodologies such as: Jenkins, ArgoCD, AWX etc
- Proficiency with IaC tools like Terraform, Ansible, GitLab runners, etc.
- Proficient in fixing issues and experienced in Linux and Windows server environments.
- Strong communication and collaboration skills.
- Problem-solving and critical thinking abilities.
- Develop scripts for automating routine tasks (e.g., Bash, Python, PowerShell).
- Knowledge or experience with CI/CD and tools like: Kubernetes, Ansible, Docker, Git, Artifactory, Jenkins, Puppet, CF Engine, Terraform, Heroku, GitHub Actions
- Knowledge or experience with security tools like: NeuVector, Twistlock, Nessus, Kali, Coverity/Fortify, AppSpider/Netsparker/Burp, metasploit, nmap, CIS Hardening
- Knowledge or experience with microservice platforms like: Rancher Labs, AKS, GKS, EKS
- Knowledge or experience with cloud platforms like: AWS, OpenStack, Azure, Google Cloud
- Knowledge or experience with databases like: MySQL, MongoDB, MSSQL, MariaDB, Oracle, PostgreSQL, Cassandra, HBase, Hive, Amazon RDS, Impala
- Knowledge or experience with development frameworks like: Flask, Django, React, Ionic, Bootstrap, Semantic UI, Foundation, Angular, Polymer, VueJS
- Experience with Software Development Process: including planning, system integration, verification, and validation for a variety of technical solutions
- Experience working with the Agile/Scrum methodology
- Experience administering Windows or Linux operating systems