Site Reliability Engineer
Job Summary
Potpie is seeking a Senior Site Reliability Engineer to be instrumental in building robust, scalable, and resilient infrastructure for their AI agent platform. This role involves designing and maintaining core infrastructure, CI/CD pipelines, observability, and automation, with a focus on reliability, performance, and security. The ideal candidate will have expertise in cloud infrastructure, Kubernetes, SRE principles, and strong scripting skills in Python or Go.
About Suprsend
SuprSend is a central communication stack for easily creating, managing and delivering notifications to your end users on multiple channels. Our single notification API has all the features set, which enables you to send notifications in a reliable and scalable manner and take care of end user experience, thereby eliminating the need to develop any notification service in-house for transactional/engagement notifications.
Job Roles & Responsibilities
- Design, implement, and maintain the core infrastructure and CI/CD pipelines to ensure high availability, scalability, and performance of the potpie platform and its AI agents.
- Be responsible for observability (logging, monitoring, alerting) across the stack to proactively identify and resolve issues.
- Automate deployment, scaling, and operational tasks using infrastructure-as-code (IaC) principles.
- Expertise in Cloud Infrastructure (e.g., AWS, GCP, Azure), particularly managing and deploying applications at scale.
- Strong practical experience with Kubernetes (e.g., GKE, EKS) and containerization technologies (Docker).
- Solid understanding of SRE principles and practices, including SLOs, SLIs, error budgets, and post-mortem analysis.
Cultural Expectations
- Eagerness to tackle challenging technical problems related to scaling AI agents.
- Desire to do impactful work that positively changes the lives of fellow developers.
- Having out-of-the-box ideas and wanting autonomy to chase them.
- Willingness to work across the stack on a fast-paced project from day one.
- Opportunity to build company culture.
- Aspiration to build something of your own one day.
Login to Apply
Please login to apply for this job.
Other Jobs
Backend Developer
Bengaluru
Backend Engineer
Bengaluru
C++ Developer
Remote
MERN Stack Developer
Hyderabad
Mobile Developer
Remote
