#100615 Research Data Analyst (Computational and Data Sci

La Jolla, CA
Jul 17, 2019
Institution Type
Four-Year Institution

UCSD Layoff from Career Appointment: Apply by 07/19/2019 for consideration with preference for rehire. All layoff applicants should contact their Employment Advisor.

Special Selection Applicants: Apply by 07/30/2019. Eligible Special Selection clients should contact their Disability Counselor for assistance.



The Research Data Analyst applies skills as a seasoned, experienced IT research professional. Uses computational, computer science, data science, and CI software research and development principles, with relevant domain science knowledge where applicable, along with professional programming concepts for medium-sized projects or portions of larger projects. Develops and optimizes a variety of computational, data science, and CI research tools and components. Performs research on current and future HPC, data, and CI technologies, hardware and software projects. Works on algorithm development, optimization, programming, performance analysis and/ or benchmarking assignments of moderate scope where the tasks involve knowledge of either domain/computer science research requirements and/or CI design/implementation requirements.

cPASS is hiring a Computational Data Scientists for full-time programming and database management in support of quantitative social science research. This position will be assigned to cPASS’s Machine Learning for Social Science Lab (MSSL). MSSL is an institutional division of the Center for the Peace and Security Studies (cPASS). The Machine Learning for Social Science Lab (MSSL) is dedicated specifically to the intersection of questions from the social sciences and methods from computer science and mathematics. MSSL serves as an institutional home to data collection efforts and methodological tools that otherwise exist as informal, scattered, and temporary collaborations between individual scholars.

The primary duty of this position will be related to programming, but will also instruct and supervise graduate and undergraduate research assistants on programming projects. The methodological focus of the work will be developing and maintaining state of the art machine learning pipelines and open source software packages. MSSL specializes in large scale knowledge bases and general machine intelligence, e.g. training computers to read unstructured text like news articles, books, or Wikipedia, and produce datasets of people, places, things, and events that are used in the social sciences.

The empirical focus will span broadly across the social sciences. We catalog and measure large scale historical trends, e.g. the rise and fall of nations, and we investigate the causes of real-time events like wars and terrorism. Our projects range from tracking cyberattacks to measuring travel time on ancient Roman roads networks. We aim to provide resources that are broadly useful across multiple disciplines.

  • A cover letter is required for a complete application.
  • Demonstrated knowledge of HPC/data science/CI.

  • Advanced skills, and proven experience associated with one or more of the following: HPC hardware and software power and performance analysis and research, design, modification, Implementation and deployment of HPC or data science or CI applications and tools.

  • Thorough experience working in a complex computing/data/CI environment encompassing all or some of the following: HPC, data science infrastructure and tools/software, and diverse domain science application base.

  • Demonstrated ability to regularly interface with management.

  • Demonstrated ability to contribute research and technical content to grant proposals.

  • Strong skills and experience in independently resolving broad computing/data/CI problems using introductory and/or intermediate principles.

  • Proven ability to understand research computing/data/CI needs, mapping use cases to requirements and how systems/software/infrastructure can support those needs and meet the requirements. Ability to develop and implement such solutions.

  • Demonstrated experience in independently resolving broad computing/data/CI problems using introductory and/or intermediate principles.

  • Extensive experience in R, Python, data management and cleaning, visualization, and text manipulation. More competitive candidates will have understanding in one or more of statistical analysis, machine learning, databases like SQL or Spark, Latex, and regular expressions.

  • Capabilities will be evaluated based on previous programming projects, and candidates should be prepared to share their portfolio of relevant original programming through their Github page and to answer questions about problem-solving and understanding of methods employed.

  • Effective communication and interpersonal skills. Ability to communicate technical information to technical and non-technical personnel at various levels in the organization and to external research and education audiences.

  • Job offer is contingent upon a satisfactory clearance based on background check results.

Similar jobs

Similar jobs