HPC System Engineer
HPC System Engineer
Job Number: 81078
Job Code: 4833
Job Grade: K
The Stanford Research Computing Center (SRCC) is seeking outstanding applicants for the position of HPC System Engineer. Embedded with world-class researchers in the School of Earth, Energy and Environmental Sciences, you will join a dynamic and growing team of technology specialists supporting the computational and data needs of Stanford's research community. This position will specifically focus on managing and supporting HPC clusters.
The hiring range for this position is $125,000 - $140,000. The successful candidate will be someone who:
- Has built, managed, secured and supported HPC clusters before and is comfortable with handling all aspects of that, from racking servers, to configuring networking, to installing software for end-users to providing one-on-one instruction and support
- Thrives when working in an academic environment
- Is passionate about technology and is driven by challenge and intellectual curiosity
- Is self-motivated to learn, sometimes on your own time
- Has user support experience and actually likes working with end-users on a daily basis
- Is extremely detail-oriented, documents well, and understands the importance of documentation
- Isn't afraid of hardware
- Loves problem-solving
- Understands the need to ensure the usability of systems from the end-users' perspectives
The SRCC is jointly sponsored by University IT (UIT) and the Office of the Dean of Research. The SRCC team of 18 cyberinfrastructure professionals offers research computing platforms, consultation, tool and software development, system engineering, and system administration in support of computational and data-intensive research across the Stanford campus.
This position will provide system administration, engineering and specialized technical consultation for existing and future systems and services for research computing workloads. The position will also specifically have responsibilities for managing high performance computing infrastructure in the School of Earth, Energy and Environmental Sciences and for providing technical consultation to researchers there. The work will include hands-on installation, management and support of complex compute environments, including filesystems and storage platforms, Linux server environments, containers, job schedulers, scientific tools, and application software. CORE DUTIES:
*Other duties may be assigned.
Education and Experience
Bachelor's degree and eight years of related increasingly technical work experience or a combination of education and relevant experience. Strong, demonstrated knowledge of Linux and demonstrated experience managing multiuser compute clusters and associated storage environments are required as well.
Knowledge, Skills and Abilities
Advanced knowledge of Linux and HPC cluster management and operation are required; experience managing, using, supporting and consulting on research computing cyberinfrastructure in an academic or research environment is strongly preferred. Proven ability to deliver outstanding system and service administration and end-user support in a thorough and timely manner is needed. This position requires that you be able to juggle multiple competing priorities, work quickly and accurately, and demonstrate initiative in conceptualizing and moving technical projects successfully to completion. The position must be able to do independent analysis, troubleshooting and problem resolution, but also must work collaboratively with other team members and across organizational group boundaries. An essential component of the job is keeping up with and mastering current and emerging technologies to facilitate researchers' computing work and also that streamline and automate system administration tasks; that requires a demonstrated passion for and curiosity about the breadth of HPC technologies and tools and also of technology trends in general.
This position requires hands-on experience building and supporting multi-tenant Linux servers/clusters and their associated networks, file systems and storage devices in production research environments. Specifically, this technical knowledge needed to be successful in this position includes:
This position requires the ability to lift and manipulate storage and compute servers, rack and unrack equipment up to 40 pounds, and occasionally climb ladders.
This position requires the ability to lift and manipulate storage and compute servers up to 40 pounds, rack and unrack equipment, and occasionally climb ladders. The position will support equipment in off-campus locations, so having a valid driver's license is necessary. The position is expected to respond to critical system problems off-hours and also must also be available for routine on-site system maintenance and patching, typically scheduled for evenings and weekends so to minimize the disruption of research work. The position is expected to rotate on-call duties during winter break and other closures.
Consistent with its obligations under the law, the University will provide reasonable accommodation to any employee with a disability who requires accommodation to perform the essential functions of the job.
Location: Business Affairs, California, United States
To be considered for this position please visit our web site and apply on line at the following link: stanfordcareers.stanford.edu
Stanford is an equal opportunity employer and all qualified applicants will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other characteristic protected by law.
Copyright ©2017 Jobelephant.com Inc. All rights reserved.
Posted by the FREE value-added recruitment advertising agency