Sr. Data Delivery Specialist (Public and purchased data collection)
South San Francisco, US
Full-time
Hybrid
$128K/yr - $237K/yr
New Grad, Entry Level
Roche is a company dedicated to advancing science and ensuring access to healthcare. They are seeking a Sr. Data Delivery Specialist to support the delivery and operationalization of real-world data and clinical-genomic datasets, ensuring they are accessible and well-documented for research and analytics.
Support intake, tracking, and fulfillment of real-world data requests, including clinical-genomic and multimodal datasets. Assist in preparing datasets for delivery, ensuring completeness, quality, and documentation
Coordinate with external partners (e.g., Caris, FMI) to support data requests, query submissions, and data returns. Assist in managing communications, timelines, and deliverables
Assist in managing data access workflows, ensuring appropriate approvals, training, and compliance with data usage agreements. Track data usage and maintain documentation
Work with sequencing, imaging, and proteomics datasets, supporting standardized formatting, validation, and integration readiness. Contribute to handling emerging multimodal data types and evolving standards
Perform quality checks, metadata validation, and documentation to ensure datasets are analysis-ready. Support troubleshooting of data delivery issues and escalate when necessary
Contribute to early-stage efforts in AI-enabled data curation and harmonization, supporting improved scalability and efficiency in data delivery workflows
Partner with internal teams (e.g., AIBT, CBM, gRED TM, pRED DTAs) to support data integration and delivery needs across diverse scientific use cases
Qualification
Required
PhD and 0-2 years of experience, Master's degree and 3-5 years of experience or a Bachelor's degree and 4-7 years of experience in Data Science, Bioinformatics, Health Informatics, Biomedical Engineering, Computer Science, or a related field and experience working with real-world data, clinical data, or biomedical datasets
Strong attention to detail and commitment to data quality and reliability
Strong organizational and communication skills, with the ability to support multiple stakeholders
You are someone who has the technical skills for: Programming: Python (Pandas) or SQL; familiarity with Bash is a plus. Data Formats: Experience with structured data (CSV, JSON, Parquet); exposure to scientific formats is a plus. Data Platforms: Exposure to cloud environments (AWS S3, GCS, or Azure). Tools: Familiarity with Jupyter notebooks, data portals, or workflow tools is beneficial
Preferred
Exposure to clinical-genomic or multimodal datasets (e.g., Caris, FMI, or similar)
Familiarity with data governance and compliance in healthcare or life sciences
Exposure to AI/ML workflows or data preparation for analytics
Understanding of FAIR data principles and metadata standards
Interest in working with external data partnerships and large-scale data ecosystems
Benefits
A discretionary annual bonus may be available based on individual and Company performance.
This position also qualifies for the benefits detailed at the link provided below.
Roche is a pharmaceutical and diagnostics company that offers medicines and diagnostic tests for various medical conditions and diseases.