Associate Product Reliability Engineer
hace 6 días
Boston
Job Description About InvoiceCloud: InvoiceCloud is a fast-growing fintech leader recognized with 20 major awards in 2025, including USA TODAY and Boston Globe Top Workplaces, multiple SaaS Awards wins for Best Solution for Finance and FinTech, and national customer service honors from Stevie and the Business Intelligence Group. Judges also highlighted our mission to reduce digital exclusion and restore simplicity and dignity to how people pay for essential services, as well as our leadership in AI maturity and responsible innovation. It's an award-winning, purpose-driven environment where top talent thrives. To learn more, visit InvoiceCloud.com. Job Details The Product Reliability Engineering team at InvoiceCloud partners with Software Engineering to monitor, support, and improve the reliability of the SaaS platform powering millions of digital payments and customer interactions every month. As an Associate Product Reliability Engineer, you will build foundational technical and diagnostic skills while supporting production operations for InvoiceCloud's Payment Service Network. With guidance from senior team members, you'll assist in debugging, monitoring, and incident response, and contribute to improving reliability through automation, dashboards, and documentation. Working closely with experienced engineers and product support teams, you will gain exposure to InvoiceCloud's codebase, incident management practices, agile workflows, and delivery standards while building confidence and independence. Success Profile At InvoiceCloud, success in this role is grounded in our core competencies. These competencies represent the mindsets and behaviors that enable team members to deliver impact, innovate, and create value for our customers. Below outlines how each competency translates into meaningful actions and outcomes for this position. Results Driven • Supports issue triage and debugging across production systems, using logs, metrics, and traces to identify symptoms and narrow hypotheses., • Writes clean, functional, and well-tested code (primarily .NET/C#) to deliver small reliability improvements, automation, and fixes with defined scope and guidance., • Assists in building and maintaining monitoring dashboards and alerting to improve visibility into platform health., • Owns assigned incident tickets or operational work items through resolution, communicating progress, impact, and blockers clearly., • Documents recurring issues, troubleshooting steps, and runbooks so others can respond consistently and efficiently., • Partners with senior engineers and product support teams to reproduce issues, validate fixes, and confirm service restoration., • Uses Git and standard branching/review practices to streamline collaboration and ensure operational changes are traceable., • Creates or improves automation scripts (PowerShell and/or Python) to reduce repetitive operational work and speed up diagnostics., • Learns to prioritize reliability work using impact and urgency (e.g., incident severity, customer impact, and SLO risk) while meeting sprint goals., • Explores reliability and observability tools and techniques that improve detection, diagnosis, and recovery (e.g., better logging, actionable alerts, and dashboards)., • Leverages AI-assisted development tools (e.g., GitHub Copilot, Cursor, Windsurf) for debugging, code generation, and documentation, while learning to validate AI output critically., • Contributes ideas for improving incident response, post-mortem practices, and production readiness during feature delivery., • Bachelor's degree in Computer Science, Engineering, or related technical discipline, • 0–2 years of experience in software engineering, DevOps, production support, or technical support (internship, co-op, or professional), • Understanding of object-oriented programming, basic data structures, and algorithms, • Familiarity with .NET/.NET Framework, C#, SQL, and version control systems (Git), • Exposure to cloud environments (Azure preferred) and basic concepts like deployments, configuration, and networking fundamentals, • Exposure to scripting for automation and troubleshooting (Python and/or PowerShell), • Familiarity with monitoring/observability tools or concepts (dashboards, alerts, log queries); experience with New Relic or similar is a plus InvoiceCloud is committed to providing equal employment opportunities to all employees and applicants. We do not tolerate discrimination or harassment of any kind based on race, color, religion, age, sex, nationality, disability, genetic information, veteran or military status, sexual orientation, gender identity or expression, or any other characteristic protected under applicable laws. This commitment applies to all aspects of employment, including recruitment, hiring, placement, promotion, termination, layoff, recall, transfer, leave, compensation, and training. If you require a disability-related or religious accommodation during the application or recruitment process, and wish to discuss possible adjustments, please contact . Click here to review InvoiceCloud's Job Applicant Privacy Policy. For recruitment agencies: InvoiceCloud does not accept unsolicited resumes from agencies. Please do not forward resumes to our job aliases, employees, or any other company location. InvoiceCloud is not responsible for any fees associated with unsolicited submissions.