We are seeking highly skilled Computational Biologists or Bioinformaticians - knowledge of machine learning is desired as that is an active project. This is a full time position requireing strong skills in biostatistics, XML, R, Python and knowledge of JAVA and or C, C++
Working with our team of scientists and informatics staff on continued development and implementation of tumor vs germline analyses based on our informatics pipeline applied to whole exome and whole genome datasets. Successful candidates will collaborate with our genetics and informatics groups to develop and implement computationally intensive machine learning and statistical classification algorithms and resampling methods.
Our mature informatics pipeline was developed and validated using whole genome sequence data on hundreds of thousands of patients. It begins with the raw sequence data and results in comprehensive variant calls, tumor-specific calls, and variant annotations along with raw BAM file data stored in our novel sequence database infrastructure. These data and results are efficiently accessible through our validated web-based sequence analysis tools that integrate public domain disease variant and annotation datasets. The raw sequence data underlying potential pathogenic variants in germline or tumor sequences are immediately visualized for confirmation using our genome browser. This user-friendly system significantly reduces hands on time for the analysis and interpretation of sequence-based tests. The successful candidate will be located at our Cambridge site and will work with both our genetics group and our informatics group. Additional duties include training of outside users and the testing and validation of new versions of the sequence analysis tools. This is a full-time position based in our Boston, MA location. Relocation provided.
•A PhD in statistics, bioinformatics or related field and postdoctoral experience in working with advanced statistical algorithms and software applications for high-throughput NGS data
•Significant high-performance computing experience
•Significant programming experience is required (R/Unix shell programming and Python/Perl/Java/C++)
•Familiarity with some of the mainstream software packages for secondary analysis, such as BWA, GATK, MuTect, VarScan2, etc.
•A good overview of the key public reference databases for clinical variation data
•Experience with mainstream biomedical ontology databases
Additional experience in tumor biology and genetics would be an advantage. Candidates with a proven track record of "omics" data analysis, independent collaborative work and leading contributions to a biological discipline preferred.
We are a major force in the life science industry and plan to be so in the Genomic Sequencing space as service providers, research scientists and collaborators. We are a global in scope with over 10,000 employees. Our Genomic research capability uses our proprietary algorithms to create a calibrated pipeline that is analyzed by our Clinical Sequence Analyzer, in our Genomic Ordered Relations Architecture. Our capabilities (today) exceed the data analysis capabilities of our competition.
Sun, 14 May 2017 16:02:05 PDT