Designs and maintains cloud infrastructure for healthcare applications, ensuring availability, security, and compliance.
Lead the team's response to critical incidents and drive improvements for cloud infrastructure stability and reliability.
Build and orchestrate large, distributed infrastructure with a focus on automation.
Build and orchestrate Modern OTEL-based Observability Platform
Building, changing, and maintaining cloud infrastructures in Azure – compute, data, and networking.
Supporting large scale cloud services and distributed systems
Deploy and operate observability platforms for logging, metrics, and distributed tracing.
Create a hybrid infrastructure integrating edge devices, on-premises, and cloud resources.
Exercise state-of-the-art SRE practices throughout the company.
Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
Lead technical initiatives for automating system engineering efforts to guarantee the reliability of global Elastic infrastructure.
You'll help build our Integration Partner Ecosystem - building an API driven ecosystem that provides easy connectivity to our customers
Build, scale, and monitor reliable systems using Java, Python, MySQL, NSQ, Hbase, AWS, and Kubernetes.
Build and scale reliable systems, collaborate with cross-functional teams.
Work on a tech-giant scale with smaller, supportive teams where every engineer has the chance to make an impact.
Collaborate, drive, and execute architectural discussions with cross-functional teams.
Design, develop, deploy, maintain, and optimize the platform that powers an application.
Scale Kubernetes clusters, automate bare-metal bring-up, and build software abstractions.
Operate distributed LLM inference and large GPU clusters worldwide.