Senior Principal, Data Engineering (Remote)
il y a 5 jours
Cambridge
If you are a current Jazz employee please apply via the Internal Career site. \n Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to \ntransform the lives of patients and their families. We are dedicated to developing \nlife-changing medicines for people with serious diseases — often with limited or no \ntherapeutic options. We have a diverse portfolio of marketed medicines, including leading \ntherapies for sleep disorders and epilepsy, and a growing portfolio of cancer treatments. \nOur patient-focused and science-driven approach powers pioneering research and development \nadvancements across our robust pipeline of innovative therapeutics in oncology and \nneuroscience. Jazz is headquartered in Dublin, Ireland with research and development \nlaboratories, manufacturing facilities and employees in multiple countries committed to \nserving patients worldwide. Please visit\n more information. \n The Senior Principal will be responsible for leading projects related to data engineering requirements and initiatives across Jazz Research and Development. The Senior Principal will lead data projects from across the business including Clinical, Pre-Clinical, Non-Clinical, Chemistry, RWD and Omics. Essential Functions • Lead the design, development and maintenance of data pipelines for processing Research and Development data from diverse sources (Clinical Trials, Medical Devices, Pre-Clinical, Omics, Real World Data) utilizing the AWS technology platform., • Create and optimize ETL/ELT processes for structured and unstructured data using Python, R, SQL, AWS services and other tools., • Build and maintain data repositories using AWS S3 and FSx technologies. Establish data warehousing solutions using Amazon Redshift., • Build and maintain standard data models., • Own data quality frameworks, validation processes and KPIs to ensure accuracy and consistency of data pipelines., • Implement data versioning and lineage tracking to support data traceability, regulatory compliance and audit requirements., • Create and maintain documentation for data processes, architectures, and workflows., • Implement modern software development best practices (e.g. Code Versioning, DevOps, CD/CI)., • Support collaboration with RnD Researchers, Data scientists and Stakeholders to understand data requirements and deliver appropriate solutions in a global working model., • Maintain compliance with data privacy regulations such as HIPAA, GDPR, • Expert knowledge of data engineering tools such as Python, R and SQL for data processing., • Expert proficiency with AWS services particularly S3, Redshift, FSx, Glue, Lambda., • Expert proficiency with relational databases., • Strong background in data modeling and database design., • Strong knowledge with unstructured database technologies (e.g. NoSQL) and other database types (e.g. Graph)., • Experience with Containerization such as Docker and EKS/Kubernetes., • Experience with one or more RnD research process and associated regulatory requirements., • Exposure to healthcare data standards (CDISC, HL7, FHIR, SNOMED CT, OMOP, DICOM)., • Experience to big data technologies and handling., • Knowledge of machine learning operations (MLOps) and model deployment., • Strong problem-solving and analytical abilities., • Excellent communication skills for collaborating with stakeholders., • Bachelor’s Degree in Computer Science, Statistics, Mathematics, Life Sciences, or other relevant scientific fields; Master’s Degree preferred, • Occasional mobility within office environment, • Routinely sitting for extended periods of time