Data Lake Lead.
Job DescriptionAstraZeneca is a global biopharmaceutical business that focuses on the discovery, development andcommercialisation of prescription medicines for some of the world's most serious diseases. AtAstraZeneca, we're proud to have a workplace culture that inspires innovation and collaboration.Here, you would express diverse perspectives, contribute to an energised environment and providecreative ideas - and be rewarded for this. We are recruiting for a Data Lake Lead to be based in one of our hub sites (Cambridge UK,Gaithersburg MD, Gothenburg Sweden). As part of AstraZeneca's Science Data Foundation (SDF) program we are continuing the development ofa Data Lake for our Research & Development teams. SDF exists to give our scientists access to dataand tools at pace, accelerating their work on life saving medicines. Through SDF we are making our data Findable, Accessible, Interoperable and Re-usable (FAIR). Thisis being achieved through the creation of a distributed data architecture, of which the Data Lakeis a critical component. It is our desire to create a scalable solution to support the developmentof data communities and valuable data products. To make this solution scalable we need to providethe services and tools to our partners in R&D so that they can ingest data into the Data Lakeand access the data, all in a secure and compliant way. As the Data Lake Lead you will lead the effort to bring about a step-change in how we develop andrun the R&D Data Lake as a capability. This will involve building out the people, processes andtechnology to meet product owner requirements and architectural strategy. You will build the DataLake operating model and work with solution architecture to build a technology roadmap. You will beaccountable for the efficient, secure and compliant running of the Data Lake. Our technology isfocused around the AWS stack, but we do have an on-prem object store (SwiftStack). We wish to seethe...