C

Data and Machine Learning Scientist

Catalytic Data Science
Full-time
Remote
United States

Position Title: Data and Machine Learning Scientist


About Catalytic Data Science (CDS):


 


Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment.   By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy.  Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.


 


Who You Are


A self-motivated data scientist with machine learning (ML) experience.  You are eager to help build a digital workspace with a direct impact on human health, pharmaceutical research, and Life Sciences companies.  The successful candidate will be comfortable multitasking and working independently on multiple aspects of our platform.  The position requires a strong technical background in machine learning with an emphasis on utilizing methods for understanding large scale biological data.  This individual will contribute scientific, technical, and leadership expertise to a multidisciplinary team.  He/She will report to the Chief Science Officer but will work closely with other company leaders to ensure company success.


 


Responsibilities



  • Design and implement ML methods on proprietary and open access datasets;

  • Utlizie large-scale datasets to generate statistically motivated research hypotheses;

  • Apply statistical methods to rigorously test and evaluate research hypotheses;

  • Develop and foster external collaborations;

  • Provide expert technical guidance and support customers in the design and analysis of experiments;

  • Work both independently and as part of a collaborative team to develop data analysis and machine learning solutions;

  • Implement, evaluate, and improve NLP algorithms on large scale datasets.


 


Qualifications



  • Ph.D. degree with 3+ years (or MS degree with 5+ years) of working experience in industry in the field of systems biology, bioinformatics, computational biology, data science, ML, or equivalent;

  • Extensive experience applying ML techniques to large scale datasets;

  • Proficient in either R or Python;

  • Experience with ML frameworks such as Torch, PyTorch, TensorFlow, Scikit-learn, etc.;

  • Familiarity with Atlassian JIRA;

  • Familiarity with Large Language Models;

  • Excellent personal and communication skills, including solid presentation and writing skills;

  • Must be self-motivated, a quick learner, dedicated to achieving excellence, and love to take on new challenges.






In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.