Senior HPC System Administrator
Senior HPC System Administrator
University of California Los Angeles
Requisition Number: 31501
Salary: $6,334 - $12,516 monthly
Reporting to the High Performance Computing (HPC) Systems Group Lead of IDRE's Research Technology Group, the HPC System Administrator is a specialist and technical expert in HPC and UNIX operating systems, who maintains and expands the operational capacities of, and provides technical expertise for ongoing research computing efforts on, the IDRE's researcher-supported Hoffman2 computing cluster. The HPC System Administrator is responsible for researching, prototyping, implementing, and supporting multiple complex components of the Hoffman2 cluster; has technical mastery of storage systems and packet-switched networks at the levels of hardware and software in large-scale HPC environments; is required to respond to complex issues within the Hoffman2 cluster environment as they arise and must be able to isolate and resolve problems in the production HPC system; is required to develop proof-of-concept prototypes for production-candidate technologies based on software containers, virtual machines, and software security mechanisms. Knowledge of modern programming languages is required; a programming skills assessment will be administered prior to hiring. The HPC System Administrator must have the expertise to translate high level requirements into well-architected and well-tested design and implementation and the skills to collaborate with, support, and mentor other team members as appropriate in order to help them realize their full potential.
Description of Unit:
The High Performance Computing Systems Group is composed of five team members who are responsible for maintaining, deploying, and monitoring the Hoffman2 cluster. The Hoffman2 cluster is a shared research cluster currently composed of nearly 1,500 rack-mounted datacenter enterprise grade compute nodes connected by Ethernet and Infiniband network fabrics. High performance NetApp and Panasas file systems constitute the backend storage. The storage is fully backed up. Users' jobs are dispatched to the computing nodes via a job scheduler. Nodes and storage are continuously added (and removed as needed) to the cluster as faculty sponsors upgrade their computing and/or storage resources. This implies that the Hoffman2 is highly heterogeneous cluster.
The HPC Systems Group areas of responsibility include the following:
1. Installing, upgrading and maintaining the OS on the cluster nodes.
2. Administering centralized system configurations and network services.
3. Maintaining a virtual infrastructure that serves administrative hosts.
4. The security and the performance of the Hoffman2 cluster.
5. Maintaining backups and restoring files and/or directories as needed.
6. Installing physical hardware and troubleshooting problems at the user/hardware/software/OS interface.
7. Specifying, procuring, and installing GPUs, compute nodes, high performance storage, and cluster-related network equipment associated with the campus advanced research computing cluster.
8. Maintaining user accounts and ensuring consistent access to computing
9. Maintaining enterprise-class storage systems and proactively monitoring cluster performance.
10. Researching new and existing technologies for acquisition of nodes, storage, networking, and other hardware solutions.
Qualifications and Experience:
Bachelor's degree in computer science, mathematics, statistics, engineering, or a related field. Masters or PhD preferred. Minimum of three (3) years of experience with Unix/Linux system administration and one or more modern programming languages (e.g. Python, C++, Java). The salary for the Senior HPC Systems Administrator position is commensurate with education, experience, and internal equity.
For full application instructions and position description, please visit: https://hr.mycareer.ucla.edu/applicants/Central?quickFind=77753
The University of California is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age or protected veteran status. For the complete University of California nondiscrimination and affirmative action policy, see: UC Nondiscrimination & Affirmative Action Policy, https://policy.ucop.edu/doc/4000376/DiscHarassAffirmAction
Copyright ©2017 Jobelephant.com Inc. All rights reserved.
Posted by the FREE value-added recruitment advertising agency