Reflexive Concepts is seeking a skilled System Administrator to join our team!
The System Administrator must provide Infrastructure Server and Sustainment services to enhance and complement High Performance Computing (HPC) sustainment capabilities across two sites and geographic locations, including the integration and management of the Exceptionally Controlled Information (ECI) data system. Server Ops team members and the Government monitoring service (in some cases 24x7x365) both monitor systems. All Systems will be integrated and utilize Government monitoring services. The Government monitoring service will monitor and report problems to the Team via Email/phone during business hours.
Qualifications:
- B.S. in a technical discipline and 10 years’ experience as a System Administrator in programs and contracts of similar scope, type, and complexity
- 5 additional years of experience may be substituted in lieu of a degree
Required:
- Linux (RHEL, CentOS, Rocky, SLES, Ubuntu [new])
- Experience with OS install, file system configuration, TCP/IP networking, configuration, operating system and application troubleshooting, Bash scripting, software compilation and installation
- Understanding of HPC architecture, knowledge about high-speed networks such as InfiniBand, Slingshot
- Familiarity with Jira, Confluence, Grafana, Prometheus, Nagios, Slurm, Git, Salt, Ansible
- Good troubleshooting skills– each system is slightly different, and there's no "one fix" for a particular problem
- Lustre file system configuration and administration, troubleshooting knowledge
- Experience with DDN Exascaler file system appliances
- TCP/IP networking knowledge, specifically storage fabrics
- Experience with Cisco and Juniper (Arista is also desired)
- Genuine curiosity/proactive effort to learn/grow in what comes next: E1000s, ESNs, DAOS, Weka
- Experience with benchmarking tools (e.g. IOR, iperf, FIO, lnet_selftest)
- DoD 8570 IAT II level certification required