Lead Site Reliability Engineer


Lead Site Reliability Engineer

Be part of something bigger

Nextuple is a technology company with deep domain experience in Retail and Supply Chain. Nextuple brings an innovative approach and solution to Ecommerce operations and fulfillments, focusing on quality, efficiency, and consistency for customers and partners. Nextuple is rapidly expanding in its digital offerings and is in need of Lead Site Reliability Engineer to join the fast growing, customer facing digital applications space. Learn more at www.nextuple.com.

Location: Bangalore

Job requirements

● Gather and analyze metrics from both infra and applications perspective to assist in performance tuning and fault finding
● Partner with development teams to improve services through smoke testing and release procedures
● Ensure service availability and system health through automation and scripting wherever applicable. Monitor production uptime.
● Define and collect metrics to track SLO and SLAs
● Ensure proactive identification of issue in production through log analysis from functional perspective
● Define and automate Production Performance metrics gathering
● Monitor and Review production system access control and logs
● Assist in DR Setup and Own the periodic DR testing
● Ensure health of automatic backup / restore of production database
● Use best practices in monitoring and reporting production health
● Plan and execute system availability and health during customer peak seasons
● Ensure Certificate rotation on time.

● Mentor and guide SRE team members

Essential Skills & Qualifications

● minimum 8 years of SRE experience, preferably in a Software product company
● Expertise in any one scripting language such as Shell, Python
● Expertise in one NoSql database preferably Mongodb
● Expertise on Production monitoring and SRE process
● A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
● Expertise in log analysis using leading tools preferably ELK/Elastic stack
● Good knowledge of Linux
● Expertise on monitoring tools preferably Grafana/Prometheus
● Exposure to DevOps practices and technologies will be an added advantage
● Exposure performance engineering processes and tools will an added advantage

Desirable Skills

● Has familiarity with the retail industry domain.
● Demonstrates attention to details.
● Has the ability to work in a lightweight process environment.
● Is self-motivated and takes ownership of tasks and executes.

Soft Skills

● Strong interpersonal communications skills.
● Mentor junior team members
● Demonstrates attention to details.
● Has the ability to work in a lightweight process environment.
● Is self-motivated and capable of high performance in a periodic iteration cadence.
● Takes ownership of tasks and executes.
● Collaborate effectively with a distributed team.
● Mentor and develop junior talent
● Identify skill gaps for the junior team members and recommend certifications/trainings to address them

To apply for this opportunity, Kindly share your resume to careers@nextuple.com

Apply Now