Site Reliability Engineer (SRE) Responsibilities Design, implement, and maintain scalable and highly available infrastructures. Monitor and ensure the performance and reliability of production systems. Implement automation for recurring tasks and operational processes. Collaborate with development teams to improve continuous delivery and codedeployment. Respond to incidents and conduct post-mortem analysis to prevent future issues. Optimize resource usage and manage system capacity. Requirements Experience in a similar role. Knowledge of Unix/Linux operating systems. Experience with monitoring and log management tools (Prometheus, Grafana, Splunk, ELK stack). Scripting and automation skills (Python, Bash, Go, Shell). Experience with cloud platforms (AWS, GCP, Azure). Knowledge of containers and orchestration (Docker, Kubernetes). Familiarity with CI/CD tools (Jenkins, GitLab CI/CD, CircleCI). Experience in configuration management (Ansible, Puppet, Chef). Knowledge of SQL and NoSQL databases (MySQL, PostgreSQL, MongoDB). Experience with cloud storage (S3, Google Cloud Storage). Familiarity with security tools (Vault, OSSEC, or any SIEM). Experience with infrastructure as code (Terraform, CloudFormation). Knowledge of networking and load balancing (Nginx, HAProxy, F5). Experience with messaging and data flow systems (Apache Kafka). Problem-solving skills and ability to work under pressure. Excellent communication and teamwork skills. Good written and oral English language skills. Benefits A dynamic and collaborative work environment. Opportunities for professional growth and development. Flexible work arrangements and remote work possibilities. Competitive salary. #J-18808-Ljbffr
Site Reliability Engineer (Sre)
VIRTUALENT
pueblo santiago acahualtepec, pueblo santiago acahualtepec
Publicado hace 15 días
Denunciar empleo