IrvineRecruiter Since 2001
the smart solution for Irvine jobs

Sr. SRE Engineer, Resilience (Remote)

Company: Enova International
Location: Irvine
Posted on: June 25, 2022

Job Description:

The health and safety of Enova's employees is our number one priority. Proof of vaccination will be required regardless of work location, unless prohibited by applicable state law. Employees may request an exemption to the vaccination policy due to medical reasons, sincerely-held religious beliefs, or as otherwise permitted by applicable state law.Enova is currently accepting candidates for remote positions in the following eligible states: AZ, CT, ID, IL, IN, ME, MI, MN, NE, NV, NJ, NM, NY, UT, WI.What you'll be doing:In this role, you will help improve the resiliency of our services through technology, incident analysis, and process refinement.You will work on optimizing how we deal with unexpected complex failures, including facilitating our incident response process, running post-incident blameless retrospectives, analyzing for and learning from consistent high-level trends, and integrating technology to reduce the effort needed to maintain these functions.You will be responsible for learning how our systems and applications relate holistically in order to appropriately react during outages and work alongside Subject Matter Experts to drive resolution. You will develop improvements to how we collect and analyze data around failures, adjusting to the ever-advancing environment as progress is made.You will collaborate with IT, Software Engineering, and product teams to foster a culture of quality where resilience is woven into our technology stack. You will show what different failure modes look like by running experiments (Mock Incidents, Disaster Recovery) and share learnings across the organization.Your core priorities will be to:

  • Own Enova's Production Incident Process end-to-end.
  • Develop processes and technology to sustainably test and improve the resiliency of our services on an ongoing basis, balancing tech and business needs.
  • Manage process refactoring initiatives to ensure risk mitigation is considered, improving customer experience.
  • Collect data, perform trend analysis, and identify patterns of risks and vulnerabilities.
  • Work with leading teams to address vulnerabilities, particularly principal engineers and production managers.
  • Socialize lessons learned among all teams to bolster the culture of operational ownership.
  • Be part of our PI PIC (Incident Commander) rotation following training, leading incidents to completion, and driving post-incident analysis (including interviews, contributing factor analysis, incident response analysis, and remediation plans).What you should have:
    • 3+ years of professional work experience in a technology role; Software Engineering, Systems, Ops, SRE, Product Management or others.
    • Interest in complex distributed systems - how they work, how they can work better, how to know if they are working correctly.
    • Superior analytical, problem solving, and critical thinking skills.
    • Understanding of infrastructure as code (Terraform, Chef, etc.)
    • Experience with query language (Postgres, sql Kafka, etc.)
    • Ability to handle, analyze, and present data.
    • Comfortable with ambiguity; able to translate ambiguous problems into strong solutions.
    • Demonstrates maturity, good judgment, negotiation, leadership and project management skills.
    • Excellent written and verbal communication skills, including the ability to communicate to different levels of an organization (i.e. on a technical vs. non-technical level).Nice to have:
      • Experience with full stack development.
      • Experience with handling and leading resolution of major failures of critical systems.
      • Experience driving large-scale changes.About Resilience Engineering:The Resilience Engineer is a subset of the Site Reliability Engineering team that strives to drive a culture of continuous resiliency improvement in our systems. We do this by focusing on our incident response process, incident analysis and learnings, and creatively solving systemic hurdles to resiliency. We work closely with other Tech, Operations, and Business teams to resolve complex failures and to continuously learn.Our goal at Enova is to recruit, hire, develop and maintain a diverse workforce. It is our policy to provide equal employment opportunity for all persons and not discriminate in employment decisions by placing the most qualified person in each job, without regard to any other classification protected by federal, state, or local law.About Enova:Enova is a leading financial technology company providing online financial services through its AI and machine learning powered lending platform. Enova serves the needs of non-prime consumers and small businesses, who are frequently underserved by traditional banks. Enova has provided more than 7 million customers with over $40 billion in loans and financing with market leading products that provide a path for them to improve their financial health. Want to learn more? Just ask any of our almost 1,500 employees.Our goal at Enova, we believe that diversity and inclusion among our teammates is critical to our success as a global company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. It is our policy to provide equal employment opportunity for all persons and not discriminate in employment decisions by placing the most qualified person in each job, without regard to any other classification protected by federal, state, or local law. California Applicants: Click here to review our California Privacy Policy for Job Applicants.

Keywords: Enova International, Irvine , Sr. SRE Engineer, Resilience (Remote), Engineering , Irvine, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Irvine RSS job feeds