Informations de base
Ref Number
Site principal
D'autres sites
Pays
Types d'emploi
Work Style
Description et exigences
Key Responsibilities
Kubernetes Cluster Management & Design:
Lead the design, deployment, and ongoing maintenance of highly available and scalable Kubernetes clusters in both cloud (e.g., AWS, GCP, Azure) and/or on-premises environments.
Develop, manage, and optimize Helm charts, Kubernetes manifests, and automation scripts to support the efficient deployment and operation of microservices architectures.
Performance & Optimization:
Continuously monitor, analyze, and optimize cluster performance, scalability, and availability to ensure peak operational efficiency.
Conduct capacity planning and resource management to support evolving application needs.
CI/CD & Automation:
Implement, maintain, and enhance robust CI/CD pipelines utilizing industry-leading tools such as Jenkins, GitLab CI, ArgoCD, or Tekton, driving automation from code commit to production deployment.
Collaborate closely with development teams to containerize applications effectively and ensure smooth, automated deployments.
Manage and automate core infrastructure components to improve reliability and reduce manual effort.
Security & Best Practices:
Implement and enforce stringent security best practices within container environments, including RBAC (Role-Based Access Control), secrets management, network policies, and vulnerability scanning.
Champion DevSecOps principles throughout the container lifecycle.
Troubleshooting & Observability:
Proactively troubleshoot complex production issues within Kubernetes environments and perform thorough root cause analysis to prevent recurrence.
Keep abreast of the latest advancements in the Kubernetes ecosystem (e.g., service mesh, ingress controllers, observability tools like Prometheus, Grafana, ELK stack, Jaeger/Zipkin) and advocate for their adoption where beneficial.
Collaboration & Mentorship:
Collaborate effectively with cross-functional teams, including developers, QA, and security, to foster a culture of shared responsibility and operational excellence.
Provide technical guidance and mentorship to junior team members on Kubernetes and containerization best practices.
Required Skills and Experience
Overall Experience: 7+ years of progressive experience in DevOps, Site Reliability Engineering (SRE), Infrastructure Engineering, or similar roles.
Kubernetes Production Experience: 3+ years of hands-on experience managing and operating Kubernetes clusters in production environments.
Containerization Mastery: Strong expertise in Docker, containerization principles, and container orchestration concepts.
Cloud Platforms: Proficiency with major cloud platforms and their managed Kubernetes services (e.g., AWS EKS, GCP GKE, Azure AKS).
CI/CD & Automation: Demonstrated experience implementing and optimizing CI/CD pipelines and integrating automated testing frameworks.
Networking Fundamentals: Deep understanding of core networking concepts, including load balancing, DNS, firewalls, and network policies within cloud-native and Kubernetes environments.
DevSecOps & Security: Solid grasp of DevSecOps principles and proven experience in implementing security best practices and ensuring compliance within containerized infrastructure.
Scripting & Automation: Strong scripting skills (e.g., Python, Bash, Go) for infrastructure automation and managing Kubernetes.
Problem-Solving: Excellent troubleshooting, debugging, and root cause analysis skills for complex distributed systems.
Description supplémentaire du poste
EEO Statement