Site Reliability Engineer (SRE)
Rokt
New York, new york
Job Details
Full-time
Full Job Description
We are Rokt, a hyper-growth ecommerce leader. We enable companies to unlock value by making each transaction relevant at the moment that matters most, when customers are buying. Together, Rokt's AI-based relevance Platform and scaled ecommerce network powers billions of transactions. In December 2022, Rokt’s valuation increased to $2.4 billion USD, allowing us to expand rapidly across 15 countries.
At Rokt, we practice transparency in career paths and compensation.
At Rokt, we believe in transparency, which is why we have a well-defined career ladder with transparent compensation and clear career paths based on competency and ability. Rokt’stars constantly strive to raise the bar, pushing the envelope of what is possible.
We are looking for a Senior Site Reliability Engineer
Compensation: $190,000 - $270,000 salary, employee equity plan grant & world class benefits.
About Rokt’stars
As a mission-driven, hyper-growth community of curious explorers, our ambition is to unlock the full potential in ecommerce and beyond. Our bias for action means we are not afraid to quickly venture into uncharted territories, take risks or challenge the status quo; in doing so we either win or learn. We work together as one aligned team never letting egos get in the way of brilliant ideas. We value diversity, transparency and smart humble people who enjoy building a disruptive business together. We pride ourselves on being a force for good as we make the world better.
The Rokt engineering team builds best-in-class ecommerce technology that provides personalized and relevant experiences for customers globally and empowers marketers with sophisticated, AI-driven tooling to better understand consumers. Our bespoke platform handles millions of transactions per day and considers billions of data points which give engineers the opportunity to build technology at scale, collaborate across teams and gain exposure to a wide range of technology. We are expanding rapidly in our major R&D centers in NYC and Sydney. We are passionate about using intelligent systems to improve the transaction moment for retailers everywhere. Come join us and build the future!
Requirements
The Role
As a Site Reliability Engineer (SRE) you will be part of a team responsible for designing and building high levels of availability, scalability and reliability into our systems. You will become intimate with the architecture of our systems and be responsible for diving deep into code, architecture, or root cause analysis, working directly with feature teams. Ability to quickly learn and become familiar with our tech stack, such as Spark, Kubernetes, ScyllaDB, Clickhouse, Airflow/Flyte, Terraform, Tableau, etc.
Responsibilities
- Evolve systems by pushing for changes that improve reliability, capacity, and reduce latency.
- Introduce best practices into the teams around observability, SLOs, and reliability.
- Work in close collaboration with partner teams to shape the future roadmap to establish a high operational bar.
- Share your knowledge by giving brown bags, tech talks, or evangelizing appropriate tech and best practices.
- Contribute to Root Cause Analysis (RCA) investigations and prioritizing incident follow-up action items.
- Design, recommend, or implement tools and processes to help development teams be as productive as possible.
- Contribute proactively to documentation and information-sharing, uplifting partner teams or the broader org.
Requirements
- 3 years hands-on experience in Site Reliability and Observability Engineering, debugging, diagnosing and correcting errors and resolving high severity incidents
- Commercial experience in one of the following languages Java, C#, Python or Go.
- Solid experience with cloud infrastructure and tooling such as AWS, GCE, Azure, Kubernetes, Docker, CI/CD pipelines, Terraform.
- Experience working on various monitoring and alerting tools
- Strong organizational and interpersonal skills, with experience instilling a culture of operational maturity.
- Aptitude for navigated incidents from tech emergency through the retrospective process.
Nice to have:
- Experience in defensive programming, circuit breakers, resilience frameworks, fault tolerance, and self-healing mechanisms of services.
- Experience building solutions in distributed systems for high volume transaction processing.
- Good problem-solving skills, coupled with effective communication, sense of ownership, and personal drive.
- Proactively provide ideas and opinions, respectfully sharing them through proposals, presentations, or facilitating group discussions.
- An eagerness to learn and share learnings.
Benefits
About Rokt’stars:
As a mission-driven, hyper-growth community of curious explorers, our ambition is to unlock the full potential in ecommerce and beyond. Our bias for action means we are not afraid to quickly venture into uncharted territories, take risks or challenge the status quo; in doing so we either win or learn. We work together as one aligned team never letting egos get in the way of brilliant ideas. We value diversity, transparency and smart humble people who enjoy building a disruptive business together. We pride ourselves on being a force for good as we make the world better.
About The Benefits:
We leverage best-in-class technology and market-leading innovation in AI and ML, with all of that being underlined by building and maintaining a fantastic and inclusive culture where people can be their authentic selves, and offering a great list of perks and benefits to go with it:
- Accelerate your career. We offer roadmaps to leadership and an annual $5000 training allowance
- Become a shareholder. Every Rokt’star gets equity in the company
- Enjoy catered lunch every day and healthy snacks in the office. Plus join the gym on us!
- Access generous retirement plans like a 4% dollar-for-dollar 401K matching plan and get fully funded premium health insurance for your entire family!
- Dog-friendly office
- Extra leave (bonus annual leave, sabbatical leave etc.)
- Work with the greatest talent in town
- See the world! We have offices in New York, Seattle, Sydney, Tokyo and London
We believe we’re better together. We love spending time together and are in the office most days (teams are in the office 4 days per week). We also get that you need to balance your life and your commitments so you have the flexibility to manage your own hours and can spend up to a week of every quarter working from anywhere.
If this sounds like a role you’d enjoy, apply here and you’ll hear from our recruiting team.