Operations Engineer I

Operations Engineer I

1-4 years
Not Specified

Job Description

Job Description :
Operations Engineer (Service Reliability Engineer (SRE)) 1
About the Role
Operations Engineers also called as Service Reliability Engineer at Flipkart are developers with excellent operations mindset. As a Platform OE you will be responsible for
availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for the platforms and services provided by Flipkart
You will be responsible for making sure that our platforms and applications are highly available and Service Level Agreements (SLA) are met. You will own all the SLIs and SLOs of the services. You will work directly onscrum teams with our Software Development Engineers using their interest in operationsand development skills to ensure new features follow SRE best practices and are
supportable. You will be responsible for solving greenfield problems in automation and
benchmarking at scale.
What You'll Do
.Keep the Platforms up and running to meet their availability and reliability SLAs.
.Build and improve configuration and automation tools to remove manual steps in deploying, upgrading etc..
.Monitor and resolve issues in all environments. Ensure SLA/SLO and uptime are met.
.Alert appropriately, help build self-healing capabilities in the platforms, involve people when needed, and log tickets.
.Participate in a 24x7 on-call rotation.
.Cover availability, reliability, security etc. considerations being imbibed and reviewed and adhered to at every stage of product development.
.Contribute to the RCA lifecycle for the platform issues, be answerable to the internal stakeholders on most of the service internals.
What You'll Need
.BTech or MTech in CS or equivalent with 2+ years working w/ highly available platforms in
web-scale organizations. Experience of at up to one year as a developer is good to have .Good troubleshooting skills of always available and high scale systems.
.Should have the ability to effectively collect all the relevant data-points and debugging artefacts/snapshots so that the debugging at a later stage by other SREs or devOps engineers can be aided.
.Intermediate to expert level knowledge of at least one configuration management system (Ansible, Puppet, etc.). Mid to advance level knowledge of Python, Go etc.
.Understanding of standard networking basics such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing, DB basics etc. Understand and use CI/CD effectively to drive agility. .
.Good written and verbal communication skills.

Job Details

Employment Types:




About Flipkart

Flipkart is India’s largest e-commerce marketplace with a registered customer base of over 150 million. In the 10 years since we started, Flipkart has come to offer over 100 million products across 120+ categories including Smartphones, Books, Media, Consumer Electronics, Furniture, Fashion and Lifestyle. Launched in October 2007, Flipkart is known for its path-breaking services like Cash-onDelivery, No-Cost-EMI and 10-day replacement policy. Flipkart was the pioneer in offering services like In-a-Day Guarantee (65 cities) and Same-Day-Guarantee (13 cities) at scale. With over 1,20,000 registered sellers, Flipkart has redefined the way brands and MSME’s do business online.

Similar Jobs

People Also Considered

Data Not Available

Career Advice to Find Better

Simple body text this will replace with orginal content