Senior Site Reliability Engineer
16 days ago
New York
Who We Are Perchwell is the modern real estate listings platform built for agents to search smarter, collaborate better, and close deals faster. We are building the future of residential real estateʼs critical listings infrastructure: a platform where market research and client collaboration converge. Our modern architecture enables continuous innovation at a pace legacy systems cannot match, with AI-powered features and mobile-first capabilities designed for how agents actually work. As consumer expectations and technology evolve, Perchwell remains focused on our core vision: empowering real estate professionals with the most intelligent, data-driven, and connected platform in the industry. Backed by leading venture capital firms including Lux Capital and Founders Fund, along with strategic partnerships with some of the country's top Multiple Listing Services (MLSs), Perchwell represents the first major new platform to enter the listings technology market in decades. This unique combination of institutional investment and deep industry alignment provides both the resources and market validation needed to transform the multi-trillion dollar residential real estate industry. As a Senior SRE, you will help own and improve the technical foundations of Perchwell while exemplifying engineering rigor and excellence across our engineering culture and strategy. You will be helping execute large strategic technical initiatives within the SRE domain. You have deep experience in building and operating infrastructure for large production systems. You’ve owned or have highly opinionated views on what good observability is and help other teams see the light. You are rigorous and meticulous in your work and hold others to the same high engineering standards. You believe the best way to help others is to provide them the tools they need to get their job done in a safe and efficient manner and have a demonstrated history of success enabling product teams to quickly innovate and iterate. You will work closely with the VP of Engineering and other senior leaders to tackle and remediate our current problem set while also building net new capabilities. You will build important relationships and partner deeply with our product and QA organizations. The SRE team is responsible for building the ability to innovate faster in a safe and reliable way. Reliability, resiliency and adaptability are our north stars. This role has an on-call requirement. We believe that in an ever-changing, innovative environment, we do our best work when we are working as a team in-person. In this role, you’ll work out of our New York City HQ in Soho Manhattan at least 3 days/week. * Design and build scalable processes and solutions to fundamental engineering challenges * Design and manage scalable, secure AWS infrastructure * Partner with the Quality Team to own the CI/CD processes and enable fast, safe and frequent deployments * Contribute to and evolve our Kubernetes infrastructure and strategy * Build and manage safe self-service methods for our teams to manage their infra via Terraform and other automation tools * Be a champion of observability ( o11y ) by owning our o11y systems and establishing and enforcing best practices throughout the engineering organization around performance and service monitoring * Participate in and help improve our incident management and disaster recovery processes * Partner with FinOps and feature teams to manage infrastructure spending by identifying and optimizing costs saving strategies * BS or MS in Computer Science, related technical field, or equivalent experience * Distributed systems experience * Deep experience with AWS cloud services such as: EC2, RDS, EKS, CloudFront, ECR, S3, IAM, CodeBuild, Lambda, and Route53 * In-depth knowledge of Kubernetes, including experience with deploying, managing, scaling, and orchestrating clusters and automating/exposing this to other teams * You’ve built tools and or enabled automation for your team or others * You’ve used at least one programming language ( ex: python, golang, rust ) to solve problems and close feature gaps of your tools * Demonstrated pattern of systems thinking leading to organizational impact and strategic problem solving * The ability to go deep across service, database and infrastructure boundaries for design and performance related work * 5+ years of experience in a dedicated SRE role * AWS Well Architected Framework experience * Experience with Event Driven System designs and the required messaging technologies and patterns ( kafka, nats, aeron ) * Experience with security frameworks and tools for hardening environments and systems * Experience with different types of databases and their architectures * Experience with the Ruby on Rails ecosystem, and understanding of Rails-specific quirks * Experience with the management and scaling of Elasticsearch Salary Information To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. Our ranges are based on function and level benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including skills, job-related knowledge and depth of work experience. The base compensation range for this position is expected to be between $170,000-$230,000/year + equity + benefits. Note: At this time, we are only considering candidates who are authorized to work in the U.S.