Reflexive Concepts is seeking a skilled System Administrator to join our team!
System Administrators (HPC), must provide High Performance Computing (HPC) services in the form of HPC enhanced sustainment capabilities.
These capabilities include:
- Multi-vendor HPC servers, HPC clusters, and SPD servers.
- Systems running Red Hat, CentOS, SUSE, and custom vendor-specific operating systems, with high-speed shared storage (Lustre and GPFS as examples), along with dedicated high-speed low-latency network interconnects like Infiniband and Slingshot.
- High-speed shared parallel storage utilizes Lustre to provide performant shared storage solutions between two or more HPCs in a data center. An Interconnect service integrates HPC systems via a dedicated high-speed network that connects several storage appliances to dedicated HPC LNETs. These appliances would be available to various HPCs to enhance capabilities.
Qualifications:
- B.S. in a technical discipline and 5 years’ experience as a System Administrator in programs and contracts of similar scope, type and complexity
- 5 years of experience may be substituted in lieu of a degree
Required:
- Proficient with the following (as specific position requires):
- Provide support for implementation, troubleshooting and maintenance of IT systems
- Provide Tier 1 (Help Desk) problem identification, diagnosis and resolution of problems
- Manage the daily activities of configuration and operation of IT systems
- Provide assistance to users in accessing and using IT systems
- Provide Tier 1 (Help Desk) and Tier 2 (Escalation) problem identification, diagnosis and resolution of problems
- Provide support to IT systems including day-to-day operations, monitoring and problem resolution for all of the client/server/storage/network devices, mobile devices, etc.
- Provide support for the escalation and communication of status to agency management and internal customers
- Optimize system operations and resource utilization, and perform system capacity analysis and planning
- Provide in-depth experience in trouble-shooting IT systems
- Provide detailed analysis and feedback to agency management and internal customers for escalated tickets
- Provide support for the dispatch system and hardware problems and remains involved in the resolution process
- Configure and manage Linux, Unix, and Windows (or other applicable) operating systems and installs/loads operating system software, troubleshoot, maintain integrity of and configure network components, along with implementing operating systems enhancements to improve reliability and performance
- DoD 8570 IAT II level certification required.