Site Reliability Engineer- DevSecOps Engineer
2 days ago
Austin
Job Description Site Reliability Engineer- DevSecOps Engineer Looking for a Senior Site Reliability Engineer (DevOps/DevSecOps). This engineer will lead the design, implementation, and operation of infrastructure, automation, and security-focused workflows that power VA.gov and other large-scale, mission-critical applications serving millions of Veterans and their families. This role combines reliability engineering, modern DevOps practices, and security-aware infrastructure design to ensure systems are robust, cost-efficient, and fully compliant with federal standards-while enabling fast, safe, and reliable delivery of application changes. Location: 100% Remote Salary Range- $130,000 - $150,000 Annually This requires collaboration closely with backend engineers, security specialists, and other SRE/DevOps team members to: • Architect, maintain, and optimize AWS GovCloud infrastructure with scalability, resilience, and cost efficiency in mind., • Build and manage container orchestration environments (Kubernetes, EKS) for high-availability applications., • Design and operate CI/CD pipelines for automated builds, tests, and deployments, incorporating security and compliance checks for federal systems., • Implement infrastructure-as-code (Terraform, Ansible, or similar) for consistent, auditable, and repeatable environments., • Monitor, analyze, and improve system performance, scaling, and resource utilization across VA.gov services., • Lead operational readiness, incident response, and on-call coverage in a 24/7 production environment., • Automate repetitive operational tasks to increase deployment reliability and reduce manual overhead., • Ensure security best practices across infrastructure and deployments, including network segmentation, traffic encryption, and vulnerability mitigation., • Collaborate with application teams to align infrastructure design with business and technical goals., • Mentor junior engineers on SRE, DevOps, and DevSecOps principles and practices. Requirements: • 7+ years of experience in Site Reliability Engineering, DevOps, Cloud Engineering, or related infrastructure roles., • Deep expertise with AWS GovCloud and container orchestration (Kubernetes, EKS)., • Proven experience with CI/CD tools (GitHub Actions, Jenkins, CircleCI, or similar) in production environments., • Proficiency in infrastructure-as-code (Terraform, Ansible, or similar)., • Strong knowledge of observability and logging systems (Datadog, Prometheus, Grafana, ELK, or similar)., • Experience in database administration and optimization (PostgreSQL or similar)., • Strong scripting/programming skills for automation (Python, Ruby, Bash, or similar)., • Advanced understanding of networking, load balancing, and traffic routing., • Demonstrated ability to design for reliability, performance, and cost optimization in infrastructure., • Familiarity with security frameworks and compliance requirements in regulated or sensitive environments (e.g., FedRAMP, FISMA, RMF)., • Experience supporting VA.gov or other large-scale federal systems. Preferred: • Background in high-availability architectures and disaster recovery planning., • Hands-on experience implementing zero-downtime deployment strategies (blue/green, canary releases)., • Track record of integrating reliability and security into Agile or DevSecOps workflows., • Understand how infrastructure (equipment, connectivity, application interfaces, images and processed data inputs and outputs) are moved from point A to point B., • Work with an internal customer organization and can troubleshoot for instance, incoming images/data being processed for invoices or other types of processing which might be imbedded in the image/data processing system that is then passed to a data entry or other validation systems., • Knowledgeable about SSO, SOA, PowerShell, Red Hat Linux, application integrations, scripting, cloud technology & storage, database design and table structure and data security related to PII and PHI data. Level of Education: • Bachelor's Degree with 7 years' experience, • Solid experience in AWS and AZURE platforms and architecture in an Agile DevOps environment., • Strong developer skills and DevSecOps experience. Benefits Overview: Full-time employees are offered comprehensive and competitive benefits package including paid vacation, sick leave, holidays, health insurance, life insurance, military leave, training, tuition reimbursement, a wellness program, short- and long-term disability, 401(k) retirement plan with company matches/immediate vesting, commuter benefits, and more. EEO Policy: It is our policy to promote equal employment opportunities. All personnel decisions, including, but not limited to, recruiting, hiring, training, promotion, compensation, benefits, and termination, are made without regard to race, creed, color, religion, national origin, sex, age, marital status, sexual orientation, gender identity, citizenship status, veteran status, disability, or any other characteristic protected by applicable federal, state or local law.