USNLX Diversity Jobs

USNLX Diversity Careers

Job Information

Prometheus Federal Services Informatics Data Scientist Lead in Washington, District Of Columbia

Informatics Data Scientist Lead

Prometheus Federal Services (PFS), a trusted partner to federal health and social services agencies, has an opening for an Informatics Data Scientist Lead. This position is responsible for developing and maintaining our Python codebase, focusing on Extract-Transform-Load (ETL) processes and bioinformatics pipelines. The role requires a blend of technical expertise in data science and bioinformatics, with a strong emphasis on Python programming, data processing, and high-performance computing.

Essential Duties and Responsibilities

The successful candidate may be responsible for, among other things:

  • Develop, maintain, and document Python code for ETL processes and bioinformatics pipelines

  • Ensure that code is well-documented, version-controlled, and adheres to industry standards such as PEP8

  • Implement automated testing frameworks (e.g., pytest) to ensure the reliability and performance of code

  • Create logging mechanisms to monitor processes and troubleshoot issues

  • Design and implement ETL processes to extract data from various sources, transform it as needed, and load it into relational databases

  • Enhance and maintain existing ETL processes, ensuring they are well-documented and tested

  • Align and harmonize data from multiple sources for integration into master datasets

  • Develop bioinformatics pipelines for tasks such as variant calling, gene expression analysis, and data annotation

  • Work within a Linux-based high-performance computing environment using command-line tools

  • Utilize tools like Python’s Snakemake to create and manage complex workflows

  • Perform testing and validation of bioinformatics pipelines, ensuring accuracy and efficiency

  • Collaborate with cross-functional teams, including data engineers, researchers, and project managers

  • Participate in regular meetings to discuss project progress, challenges, and goals

  • Provide support to research and data teams, helping to structure and prepare data for analysis and modeling

Minimum Qualifications

  • Bachelor’s Data Science, Computer Science, Bioinformatics, or a related field

  • Minimum of eight (8) years of experience

  • Minimum of five (5) years of federal consulting

  • Strong experience in Python programming, particularly in the context of ETL processes and bioinformatics

  • Familiarity with version control systems (e.g., Git) and workflow management tools like Snakemake

  • Experience working in Linux-based high-performance computing environments

  • Knowledge of relational databases and data integration techniques

  • Experience with automated testing and logging best practices

  • Strong analytical and problem-solving skills

  • Excellent communication and documentation skills

  • Ability to work both independently and as part of a team

  • Authorized to work in the U.S. indefinitely without sponsorship

  • Ability to obtain a public trust 

Preferred Qualifications

  • Experience in healthcare, life sciences, or related industries

  • Master’s degree in Data Science, Computer Science, Bioinformatics, or a related field

  • VHA Experience

  • Knowledge of bioinformatics tools and pipelines

  • Familiarity with AI/ML concepts and their application to data science

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or national origin.

This position may be subject to client or government vaccination and masking guidance, policy or requirements as may be changed from time to time.

Work location is flexible if approved by the company except that position may not be performed remotely from Colorado.

Powered by JazzHR

DirectEmployers