System Engineer
5 days ago
Sacramento
Immediate opening for an Operations Command Center Engineer 2 (OCCE2). The OCCE2 will be responsible for handling escalated incidents as referred by OCCE1, performing deeper troubleshooting, incident management, and root cause analysis. This individual will provide technical expertise to ensure uptime and efficiency in the operations of IT systems, applications, and infrastructure, and will be involved in maintaining and updating monitoring tools, processes, and cloud-based solutions to enhance operational efficiency. Contributes to key areas such as network management, system administration, and automation. Position Title: Systems Engineer/Administrator Location: Sacramento, California Job Type: 6 months Contract to hire, Hybrid (3days at office) KEY RESPONSIBILITIES • Manages escalated tickets from OCCE1 for advanced troubleshooting and problem resolution across network, system, and cloud platforms., • Proactively monitors system health, performance, and uptime, ensuring continuous service availability using advanced monitoring and observability tools., • Identifies recurring incidents and initiates root cause analysis for long-term resolution., • Collaborates with cross-functional teams, including Applications, Infrastructure, Security, and Cloud teams, to resolve incidents., • Configures, troubleshoots, and maintains network devices (e.g., routers, switches, firewalls) and ensures secure remote access (VPN, remote desktop solutions)., • Manages and maintains cloud infrastructure (AWS, Azure, GCP), including virtualization (VMware, Hyper-V) and automation (Terraform, Ansible)., • Develops and refines operational runbooks, playbooks, and response procedures, focusing on improving cloud governance and security., • Participates in on-call rotations to support incident handling outside of normal business hours., • Contributes to the continuous improvement of monitoring tools, cloud services, and incident management processes., • Prepares and delivers post-incident reports, root cause analysis, and lessons learned to Senior Management., • Ensures that SLAs related to response times, escalation, and ticket handling are met consistently., • Coordinates shift handovers with detailed incident reporting and supporting documentation., • Leads efforts on system administration (Windows, Linux, Mac OS), backup and disaster recovery procedures, and server management., • Participates in project management efforts, capacity planning and risk management for ongoing operations. EDUCATION/EXPERIENCE • Education: Minimum of High School diploma or equivalent required. Bachelor's degree in Computer Science, Software Engineering, or related discipline preferred., • Comp TIA Network+, • Cisco Certified Network Associate (CCNA), • Microsoft Certified: Azure Administrator Associate, • AWS Certified Solutions Architect - Associate, • Google Professional Cloud Architect, • Red Hat Certified System Administrator (RHCSA), • Certified Ethical Hacker (CEH), • CompTIA CySA+ (Cybersecurity Analyst), • Certified Information Systems Auditor (CISA) - ISACA, • GIAC Security Essentials (GSEC), • PRINCE2 Practitioner, • Agile Certified Practitioner (PMI-ACP) - PMI, • Certified ScrumMaster (CSM), • Network configuration and troubleshooting, • VPN and remote access technologies, • Cloud networking (AWS, Azure, Google Cloud), • Virtualization technologies (VMware, Hyper-V, KVM), • Storage solutions (SAN, NAS, DAS), • Server management and configuration (Windows, Linux), • Windows, Linux, and Mac OS administration, • User and group management (AD, LDAP), • Patch management and system updates, • Backup and disaster recovery procedures, • Cloud platform management (AWS, Azure, GCP), • Cloud services (IaaS, PaaS, SaaS), • Cloud security and governance, • Monitoring and observability (Prometheus, Grafana), • Incident and change management, • Firewalls, IDS/IPS, and VPNs, • Endpoint security and antivirus solutions, • SIEM platforms (Chronicle, Splunk, QRadar), • Vulnerability management (Nessus, Qualys), • Identity and Access Management (IAM), • Compliance standards (ISO, NIST, GDPR, HIPAA), • Data loss prevention (DLP), • Database administration (SQL Server, MySQL, Oracle), • Data backup and recovery, • System and application monitoring (Nagios, Zabbix, SolarWinds), • Log management and analysis (ELK Stack, Splunk), • Cloud monitoring (CloudWatch, Azure Monitor), • Desktop support (Windows, Mac OS), • Ticketing and help desk systems (JIRA, ServiceNow), • Hardware and software troubleshooting, • Scripting languages (Bash, PowerShell, Python), • Automation tools (Ansible, Puppet, Chef), • Project management frameworks (Agile, Scrum, ITIL), • Change management processes, • Documentation standards (SOPs, runbooks), • Data visualization tools (Power BI, Tableau), • Basic data analytics and query skills, • KPI monitoring and reporting