Skip to main content

This job has expired

Open Rank Research Engineer II/Senior - HPC Technical Program Manager

Employer
Georgia Institute of Technology
Location
Atlanta, Georgia
Job Title: Open Rank Research Engineer II/Senior - HPC Technical Program Manager
Location: Atlanta, Georgia
Regular/Temporary: Regular
Full/Part Time: Full-Time
Job ID: 258937
Location

Atlanta, GA.

Job Summary

The Institute for Data Engineering and Science (IDEaS) seeks a Research Software Engineer with primary job junction involving working with researchers, representing and managing complex cyberinfrastructure projects, and engaging users and the community. The successful candidate will lead Artificial Intelligence Cyberinfrastructure development efforts for research projects and collaborate with other researchers, faculty members, and industry partners to support development of technologies and solutions. This role requires strong software engineering expertise, excellent communication skills, and the ability to drive projects from conception to implementation.

This position vacancy is an open rank announcement. Final job offer will be dependent on candidate qualifications in alignment with Research Faculty ranks as outlined in section 3.2.1 of the Georgia Tech Faculty Handbook (https://www.policylibrary.gatech.edu/faculty-handbook/3.2.1-research-faculty-hiring-and-promotion-guidelines)

Responsibilities Collaborate with scientific partners on behalf of the institute, comprehend their research objectives, and identify specific cyberinfrastructure needs to support their goals.

Create and implement technical product roadmaps, collaborate with scientific partners to plan cyberinfrastructure research, and oversee the development of software engineering projects.

Oversee the maintenance, enhancement, and security of production cyberinfrastructure to ensure reliable access for all collaborators and users.

Act as project representative at various technical forums, such as conferences, meetings, and events, to engage with the broader community.

Demonstrate the capacity to recognize and pursue opportunities for organizational growth, as well as to establish and nurture collaborative partnerships.

Required Qualifications

Final rank will be determined commensurate with education and experience.

Research Engineer II

  • A Master’s degree in Computer Science or related field and three (3) years of relevant full-time experience after completion of that degree, or
  • A Master’s degree in Computer Science or related field, and five (5) years of relevant full-time experience after completion of a Bachelor’s degree, or
  • A Doctoral degree.

Senior Research Engineer

  • A Master’s degree in Computer Science or related field, and seven (7) years of relevant full-time experience after completion of that degree, or
  • A Master’s degree in Computer Science or related field, and nine (9) years of relevant full-time experience after completion of a Bachelor’s degree, or
  • A Doctoral degree in Computer Science or related field, and four (4) years of relevant full-time experience after completion of a Bachelor’s degree.

Preferred Qualifications Substantial technical management experience including supervisory responsibilities, community collaboration building, project management, information systems, and cyberinfrastructure.

Excellent technical development planning and implementation skills, with the ability to drive change and achieve results.

Ability to enable scientific codes and tools for users across multiple computational resources ranging from tightly coupled systems to the cloud.

Expertise in working with open-source projects and maintaining and versioning the code base; Contributions and Memberships in open-source foundations.

Ability to support diverse users, including serving research and educational needs on cyberinfrastructure with varying computational requirements based on domain specific tools and codes.

Ability to understand and analyze cyberinfrastructure trends, collaborator needs and preferences, and develop an effective technical roadmap.

Excellent communication and interpersonal skills, building and maintaining relationships with internal and external stakeholders.

Proven ability to lead and manage technical research and development teams, with a strong focus on fostering a positive and productive work environment.

Strong problem-solving and decision-making skills, with the ability to think critically and creatively.

Demonstrated commitment to excellence, with a passion for achieving outstanding results.

Established competency in managing, maintaining, and executing product deployment cycle for production cyberinfrastructure systems.

Experience in preparing and reviewing reports to federal funding agencies.

Experience with Science Gateway platform operations and research project-based business analysis.

Experience providing cyberinfrastructure for artificial intelligence (AI) based applications and tools.

Managing and operating multi-tenant cyberinfrastructure for research communities.

Experience leading and maintaining quality assurance aspects of cyberinfrastructure for AI-based research projects.

Experience in managing and maintaining reliable system operations for hosted research applications.

Experience working with domain experts from sciences and engineering, understanding their needs and requirements, and developing cyberinfrastructure and AI-based solutions to support their research.

Experience developing training materials and tutorials, and mentoring, training, and coaching staff.

Experience in gathering, analyzing, and designing requirements for cyberinfrastructure for AI-based applications.

Well-experienced in development cycle management and able to oversee the planning, execution, and monitoring of large-scale projects related to developing cyberinfrastructure and AI-based simulations.

Required Documents to Attach

CV and cover letter including names of 3 references.

Knowledge, Skills, & Abilities Expertise in developing modern web frameworks; proficiency in working with servlet containers (e.g., Tomcat) and web servers.

Programming ability in multiple languages such as Python, PHP, SQL, and JAVA.

Experience working with transactional/non-transactional (Graph, Document) data storages such as MariaDB, Neo4J, and MongoDB.

Experience working with data transport protocols such as SOAP, Rest, GRPC, and Thrift.

Experience working with message broker tools such as RabbitMQ.

Well versatile in using Zabbix, Grafana, Kibana, and Prometheus tools for IT operations and infrastructure monitoring.

Ability to support cyberinfrastructure accessibility through multiple operating systems such as Windows, UNIX, Linux, Mac OS, and Android.

Strong understanding of Unix operating system concepts related to file systems and processes and fluent use of Unix commands to deploy, configure, test, and debug applications in Unix-like operating systems.

Skilled in configuring and scripting with automated test tools like JMeter and Selenium, and the ability to develop JAVA-based load test tools specific for scientific and artificial intelligence domains.

Fluency in deploying, configuring, testing, and debugging parallel supercomputing applications written using MPI/OpenMP using job schedulers such as Slurm, HTCondor, and PBS.

Fluency in automating deployments using scripting languages such as Python, Shell scripting, and Ansible.

Fluency in application/task management using science gateway platforms and task execution frameworks such as Apache Airavata and Apache Helix.

Strong understanding of application security aspects such as authentication, authorization, auditing, confidentiality, integrity, and ability to configure, test, and operate tools that provide the above security features. e.g., Keycloak, WSO2 Identity Server, etc.

Experience working with HPC community account management to authenticate users and manage cloud and HPC (High-Performance Computing) resource credentials using a credential store and frameworks like Apache Custos.

Ability working with multi-tenant cloud resources and HPC resources.

Knowledgeable in Science Gateway Platforms and Science Gateways related to multiple fields of sciences and engineering.

Software Engineering knowledge including quality assurance, configuration management, reliability, and usability.

Ability to simultaneously drive multiple projects to successful completion.

Demonstrated time management and prioritization skills.

Ability to work on time-sensitive projects and deliver in a dependable manner.

Contact Information

For additional information about this job opening, please contact the Director of Institute of Data Engineering and Science “IDEaS”, Srinivas Aluru at aluru@cc.gatech.edu.

Equal Employment Opportunity Georgia Tech provides equal opportunity to all faculty, staff, students, and all other members of the Georgia Tech community, including applicants for admission and/or employment, contractors, volunteers, and participants in institutional programs, activities, or services. Georgia Tech complies with all applicable laws and regulations governing equal opportunity in the workplace and in educational activities. Georgia Tech prohibits discrimination, including discriminatory harassment, on the basis of race, ethnicity, ancestry, color, religion, sex (including pregnancy), sexual orientation, gender identity, national origin, age, disability, genetics, or veteran status in its programs, activities, employment, and admissions. This prohibition applies to faculty, staff, students, and all other members of the Georgia Tech community, including affiliates, invitees, and guests.

Get job alerts

Create a job alert and receive personalized job recommendations straight to your inbox.

Create alert