Site Reliability Engineer II (Cloud Engineer) with the right combination of software engineering skills, experience with cloud services at scale, and passion for quality. The person in this role will support the engineering team at all stages of product development from design and architecture to deployment and production support to ensure the highest level of reliability, availability and performance as required by the customers of our advanced physical security solutions. Responsibilities Triage, debug, and fix issues in our production system that may involve the full stack from front‑end and application microservices to databases and networking. Use monitoring tools to identify performance, availability, and cost concerns. Automate monitoring, alerting, and scaling to proactively identify and resolve issues. Collaborate with engineering teams to review architecture and implementation of new features to ensure resilience and scalability. Design and implement solutions to eliminate technical debt related to reliability, complexity, and observability. Coordinate incident response and root cause analyses. Participate in the engineering support on‑call rotation. Develop troubleshooting processes, tools, and documentation for production support. Operate in a Continuous Delivery environment. Build operational maturity through automation. Qualifications BS or MS Degree in Computer Science, IT or related field required; 3+ years of experience managing and maintaining large‑scale SaaS platforms. Experience developing or maintaining software in at least one language (JavaScript, C#, Python, Java, etc.). Familiarity with shell scripting (PowerShell, Bash, etc.). Experience with log aggregation and analysis frameworks (ELK, Azure Log Analytics, Splunk, etc.). Understanding of Git or similar version management tools. Proficient English level. Passion for automation. Intrinsic curiosity to identify patterns, understand the inner workings of complex systems and how issues emerge in production environments. Excellent problem‑solving and software troubleshooting skills; able to diagnose problems quickly and accurately in a live, continuous‑delivery environment. Enjoy working on a collaborative team. What We Value Experience developing and maintaining applications on Microsoft Azure, AWS or similar cloud‑based platforms. Experience with PowerShell, Node.js, Kusto, or Groovy. Experience with NoSQL databases (MongoDB, Cosmos, Dynamo, Redis or similar). Knowledge of both serverless and container‑based application deployment models. Experience with CI/CD tools. #J-18808-Ljbffr
Site Reliability Engineer Ii (Cloud Engineer)
HONEYWELL
mexico, mexico
Publicado hace 7 días
Denunciar empleo