Job Description
• Develop and maintain tooling used for environment monitoring and task automation
• Identify application reliability and availability improvements and build solutions to drive an improved experience
• Analyze and establish efficient configurations for software and servers, DB connections, indexes, drivers, etc.
• Coordinate with development teams, technical and non-technical Partners and clients to maintain wide knowledge on dependencies of the critical business transaction including platform, services and tools
• Monitor internal and vendor service level objectives (SLOs) and agreements (SLAs); identifies and resolves SLO / SLA gaps
• Serve as technical subject matter expert (SME) for cross-functional engineering Teams;
• Assist with and troubleshoot systems-related issues and maintenance
• Collaborate on maintaining services once they are live; measures and monitors availability, latency, and overall system health
• Develop run book and build automation
• Develop and maintain E2E monitoring dashboards to support critical business transaction
• Develop and maintain synthetic monitoring for critical business transaction using tools such as ThousandEyes
• Practice sustainable incident response and blameless postmortems
• Document and promote SRE standards and procedures
• Develop and assist in deployment and rollback automation
• Review Release and deployments requirements
• Build and setup automation tests.
• Incident communication to impacted stakeholders
• Coach and mentor junior engineers and fellow practitioners
Role Summary:
Experienced SRE Engineer with 5+ years in designing, managing and supporting distributed systems across multi-cloud environments.
Key Skills & Expertise:
- CI/CD: GitHub, Harness
- Cloud Platforms: GCP, PCF, AWS
- Monitoring & Observability: Splunk, Grafana, AppDynamics, Thousand Eyes
- Containers & Orchestration: Docker, Kubernetes, Cloud Foundry
- Messaging & Streaming: Kafka, MQ
- Protocols & Web Services: HTTP, DNS, TCP/UDP, REST, SOAP, JSON
Core Competencies:
- Strong troubleshooting and debugging in microservices architecture
- Incident management, issue resolution and RCA creation
- Multi-cloud platform management (SRE practices)
- Enterprise cloud infrastructure handling
- Agile development practices with tools like Git, Jira, Confluence
Experience: 5-8 Years .
The expected compensation for this role ranges from $60,000 to $135,000 .
Final compensation will depend on various factors, including your geographical location, minimum wage obligations, skills, and relevant experience. Based on the position, the role is also eligible for Wipro's standard benefits including a full range of medical and dental benefits options, disability insurance, paid time off (inclusive of sick leave), other paid and unpaid leave options.
Applicants are advised that employment in some roles may be conditioned on successful completion of a post-offer drug screening, subject to applicable state law.
Wipro provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Applications from veterans and people with disabilities are explicitly welcome.
Reinvent your world. We are building a modern Wipro. We are an end-to-end digital transformation partner with the boldest ambitions. To realize them, we need people inspired by reinvention. Of yourself, your career, and your skills. We want to see the constant evolution of our business and our industry. It has always been in our DNA - as the world around us changes, so do we. Join a business powered by purpose and a place that empowers you to design your own reinvention.