Staff Site Reliability Engineer

Staff Site Reliability Engineer

8-11 years
Not Specified

Job Description

Staff - DevOps / Site Reliability Engineering
VMware is looking for talented candidates to fill a Site Reliability Engineer (SRE) position in the End-User Computing (EUC) R&D Organization.
VMware End-User Computing runs the world's largest Digital Workspace Platform - Workspace ONE. We are in the business of helping customers bring Business Mobility to reality: transforming their high-demand applications, building, and leading the next-generation desktop, and helping industries shift core business processes and operations to make things happen only possible in today's mobile environment. With over 60,000 customers around the globe, our End-User Computing team is helping companies deliver work at the speed of life and our technologies are leading what is happening next for users across the enterprise.
Team Responsibility
The SRE team drives the delivery of core applications, services and infrastructure that make up our global SaaS offering. The team is involved in improving efficiencies in service reliability with automation, Infrastructure as Code, Observability, and other software solutions through collaborative efforts with other R&D groups within VMware. The Workspace ONE Cloud Services team uses these tools amongst others to ensure continuity and reliability of our enterprise service offerings and operates 24/7/365 days a year.
Role Responsibility
The Staff Engineer will be responsible for automating and maintaining the delivery of SaaS services and monitoring solutions hosted and consumed by the Cloud Services team and our customers. This includes the automation of core application services and associated Infrastructure services. You will also be required to enable other product engineering teams to drive towards automated problem resolution. On the observability side, we help service owners define and instrument SLOs & alerts that follow best practice, build tools and dashboards, facilitate postmortems, and look to continuously enhance our existing systems and process to improve the reliability of the SaaS offering.
You should have a strong DevOps oriented mindset, be willing to take on challenges, maintain a high degree of ownership & transparency and work effectively both on a team and independently.
The SRE role is a great fit for engineers who want to own production solutions while getting hands on with a wide variety of the latest and greatest open-source technologies, and love to push the boundaries of what cloud infrastructure software, observability and tooling can achieve.
The responsibilities will include but not be limited to:
  • Providing technical leadership and direction for the team
  • Driving the improvement of development practices and tooling and review designs or improvements to our code base.
  • Mentor and coach engineers and seek opportunity for continuous improvement, champion engineering best-practices, tooling, and efficiency.
  • Development, configuration and maintenance of CI/CD pipelines and tools to facilitate rapid deployment of code and faster testing of new features and products.
  • Ensure fast and reliable delivery of the automation code to production datacenters. Emphasis on code quality through code reviews, CI process for the automation code and unit tests where applicable.
  • Work closely with internal software engineering teams to improve availability & observability of services & design and develop deployment automation pipelines for new cloud service offerings.
  • Manage multiple different types of infrastructure services efficiently leveraging practices for configuration management, Infrastructure as Code, efficient auto-remediation etc.
  • Working under pressure in production environments running production customer workloads and services. Work per escalation, notification, and incident management practices.
  • Drive the product towards higher availability and reliability & assist with on-call support on a rotating schedule for incident escalations.

Required Skills
  • 8+ years of industry experience and 5+ years of relevant experience in a Development, SaaS Operations, Site Reliability, or comparable Cloud Engineering position with a demonstrated track-record of execution and delivery
  • Excellent system design and development skills
  • Proven development and scripting background with experience in the following languages: Python, PowerShell, Bash
  • Proficiency with Ansible or similar configuration management tools like Chef, Puppet etc.
  • Experience deploying, provisioning, and administering production SaaS systems with Windows and Linux based servers
  • Experience with Jenkins or similar build automation tools and CI/CD orchestration tools
  • Experience around Observability solutions like third-party monitoring, logging, and remediation services (e.g., Wavefront, Nagios, SolarWinds, Elasticsearch, StackStorm etc.)
  • Experience with container technologies and microservice architecture like Kubernetes, Docker Swarm etc.
  • Experience with on-premises and cloud-based infrastructure and services such as vSphere products, VMware Cloud on AWS, AWS, and network software like BIG-IP, DYN, Route53 etc.
  • Proven ability to handle multiple, complex technical projects and have the flexibility to work in a very dynamic environment.
  • BS in Computer science or related technical field

Preferred Skills
  • Experience with deployment frameworks such as Terraform, CloudFormation etc.
  • Experience with basic networking and network security
  • Working knowledge of RDBMS systems including SQL Server

VMware Company Overview: At VMware, we believe that software has the power to unlock new opportunities for people and our planet. We look beyond the barriers of compromise to engineer new ways to make technologies work together seamlessly. Our cloud, mobility, and security software form a flexible, consistent digital foundation for securely delivering the apps, services and experiences that are transforming business innovation around the globe. At the core of what we do are our people who deeply value execution, passion, integrity, customers, and community. Shape what's possible today at
Equal Employment Opportunity Statement: VMware is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind: VMware is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at VMware are based on business needs, job requirements and individual qualifications, without regard to race, color, religion or belief, national, social or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV Status, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate. VMware will not tolerate discrimination or harassment based on any of these characteristics. VMware encourages applicants of all ages. Vmware will provide reasonable accommodation to employees who have protected disabilities consistent with local law.

Job Details

Employment Types:




About Vmware

Job Source :

Similar Jobs

People Also Considered

Data Not Available

Career Advice to Find Better

Simple body text this will replace with orginal content