Site Reliability Engineer (SRE) - Denver, CO / Waltham, MA / Remote

Engineering · Denver, Colorado
Department Engineering
Employment Type Full-Time
Minimum Experience Mid-level

At Randori, we help defenders continuously assess their real-world security. Our automated attack platform provides a red team experience designed to mirror today’s adversaries, not yesterday’s threats. We partner with security teams to help them understand the art of the possible by delivering an unrivaled attack experience, at scale. 

As a Randori Site Reliability Engineer, (SRE) you will become a SME on our platform.  You will create, fix, extend and scale the code to keep it working and to harden the resilience and security posture.  A focus to automate and build instrumentation throughput of the system is a key objective. You will build out alerting and tuning of those alerts. You will also build out graphing capabilities and outputs for consumption and use by your Engineering colleagues and other cross functional teams within Randori.

As a SRE, you will be an owner of and be responsible for overall health, performance, and capacity of the Randori infrastructure and services of the platform. In addition to helping engineering teams with resolving operational issues, you will also design and implement solutions, tools and practices that help us improve operational efficiency and meet product SLA’s.

The work is frenetic but has a tremendous impact on our customers and the security market as a whole. We are looking for individuals that are driven, proactive, thorough and forward looking. 

Responsibilities of the Role:

  • Become a Subject Matter Expert, (SME) on the platform
  • Work with technical leads, engineers, designers, business analysts, and operations teams to ensure we have visibility into the health of our systems
  • Help define, break down and prioritize work in partnership with the team
  • Propose and deliver improvements to our production monitoring systems
  • Drive our on-call program, being the first line of defense during the day for internal platform support
  • Work with engineering and infrastructure to surface more metrics
  • Define metrics to provide visibility into system availability
  • Automate, Communicate, Automate

What’s Required:

  • BA/BS in Computer Science, Computer Engineering, or commensurate practical experience
  • 2-5 years of experience working in a support, development, testing or Linux admin role where scripting to automate, simplify and scale for cross team use was your approach
  • Experience with cloud computing technologies such as GCP, AWS, Azure or Rackspace 
  • Experience working with and supporting site reliability for SaaS based solutions
  • Baseline and actual coding experience with one or more of the following languages, C++, Java, Python or Go
  • Adaptability to changing directions and requirements

Preferred qualifications:

  • Experience with grafana, prometheus, and/or other industry standard monitoring tools
  • Highly proficient experience with scripting languages such as Python
  • Experience as an SRE working with GCP
  • Experience with Kubernetes for orchestration and system application deployments, scalings, and management.

If you don't meet 100% of the requirements below - that's okay, nobody's perfect! We believe in hiring people, not just a list of skills. We encourage you to apply if you think this is a role that would make you excited about coming to work every day.

Randori is committed to a diverse and inclusive workplace. We will consider all qualified candidates regardless of race, color, national origin, sex, age, marital status, personal appearance, sexual orientation, gender identity, family responsibilities, disability, education, political affiliation or veteran status.

Security Responsibilities:

Every Randori employee has the responsibility to ensure all tasks performed adhere to Randori’s  Information Security requirements. This includes understanding and following security policies, participation in information security training, and helping improve the organization's overall security maturity. Each member of our team must understand their role in protecting company assets and the subsequent handling of confidential & customer data.

Thank You

Your application was submitted successfully.

  • Location
    Denver, Colorado
  • Department
  • Employment Type
  • Minimum Experience