Tentek, Inc.
DevOps / Cloud Engineer
hace 1 día
San Diego
Job Description Sr DevOps / Cloud Engineer Position Summary: Lead cloud infrastructure strategy and DevOps practices, architecting and automating highly scalable, reliable, and secure systems that power digital experiences across a global ecosystem. Define infrastructure and deployment standards across multiple engineering teams, partner with Principal Architect and engineering leadership on platform reliability and scalability, and drive operational excellence at enterprise scale. Mentor DevOps engineers, establish best practices, and build infrastructure-as-code frameworks that enable rapid, safe deployment of platform capabilities serving hundreds of millions of users worldwide. Core Responsibilities: • Architect and define cloud infrastructure strategy for platform capabilities across AWS at enterprise scale, • Design and implement scalable, multi-region infrastructure supporting global business units, • Build and maintain sophisticated CI/CD pipelines enabling rapid, safe deployment across 6+ engineering teams, • Establish infrastructure-as-code frameworks and standards used across the organization, • Drive platform reliability, performance, and cost optimization initiatives, • Partner with Principal Architect and engineering leadership on platform scalability and architecture decisions, • Design and implement comprehensive monitoring, alerting, and observability solutions, • Lead incident response and post-mortem processes, driving continuous improvement, • Implement security best practices, compliance standards, and disaster recovery strategies, • Automate infrastructure provisioning, configuration management, and deployment processes, • Define DevOps standards, best practices, and tooling strategies organization-wide, • Mentor Senior DevOps Engineers and DevOps II on infrastructure design and cloud architecture, • Collaborate with engineering teams to optimize application performance and resource utilization, • Drive cloud cost management and optimization across platform infrastructure, • Evaluate and introduce new DevOps tools, technologies, and practices, • Build self-service infrastructure capabilities enabling engineering team autonomy, • Champion SRE practices and improve platform SLAs, uptime, and reliability, • Coordinate cross-organizational infrastructure initiatives and standardization, • 8+ years DevOps, cloud engineering, or infrastructure experience at enterprise scale, • Expert-level AWS knowledge across compute, networking, databases, security, and platform services, • Deep expertise in infrastructure-as-code using Terraform, CloudFormation, or similar, • Advanced CI/CD pipeline architecture and automation (Jenkins, GitHub Actions, GitLab CI), • Container orchestration expertise with Kubernetes (EKS), Docker, and related ecosystem, • Strong programming/scripting skills in Python, Bash, Go, or similar languages, • Deep understanding of microservices architecture and distributed systems, • Expert in monitoring, logging, and observability tools (Prometheus, Grafana, ELK, Datadog, CloudWatch), • Security best practices including IAM, secrets management, network security, compliance, • Advanced troubleshooting and incident response skills for complex distributed systems, • Cost optimization and FinOps practices for cloud infrastructure, • Site Reliability Engineering (SRE) principles and practices, • Configuration management tools (Ansible, Chef, Puppet), • Networking expertise (VPC, load balancers, DNS, CDN, service mesh), • Database operations (RDS, DynamoDB, Aurora, backup/recovery), • Disaster recovery and business continuity planning, • Mentorship and technical leadership capabilities, • Excellent communication skills for both technical and executive audiences, • Strategic thinking balanced with hands-on technical execution, • Cloud Platforms: AWS (expert level - EC2, EKS, RDS, Aurora, DynamoDB, S3, Lambda, CloudFront, VPC, IAM, CloudWatch, etc.), • Infrastructure as Code: Terraform (primary), CloudFormation, Pulumi, • CI/CD: Jenkins (primary), GitHub Actions, GitLab CI, CircleCI, AWS CodePipeline, • Containers & Orchestration: Docker, Kubernetes, EKS, Helm, Kustomize, • Configuration Management: Ansible, Chef, Puppet, • Scripting/Programming: Python, Bash, Go, PowerShell, • Monitoring & Observability: Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, New Relic, CloudWatch, Splunk, • Service Mesh: Istio, Linkerd (awareness), • Secrets Management: HashiCorp Vault, AWS Secrets Manager, Parameter Store, • Version Control: Git, GitHub, GitLab, • Artifact Management: Nexus, Artifactory, ECR, • Build Tools: Maven, Gradle, npm, Docker build, • Networking: AWS VPC, Route 53, CloudFront CDN, ALB/NLB, API Gateway, • Databases: RDS (PostgreSQL, MySQL), Aurora, DynamoDB, ElastiCache (Redis), Cassandra, • Security: AWS Security Hub, GuardDuty, WAF, security scanning tools, • Incident Management: PagerDuty, Opsgenie, • Communication: Slack, MS Teams (for alerts and ChatOps)Company DescriptionTenTek has been in business since 1989 and is recognized as a leading staffing provider of tech professionals to a growing client base.TenTek has been in business since 1989 and is recognized as a leading staffing provider of tech professionals to a growing client base.