Senior Data Engineer
hace 9 días
Seville
pbSenior Data Engineer /b /ppbr/ppbr/ppbIn a few words /b /ppbr/pullibPosition: /b Senior Data Engineer – AI Labs at Insud Pharma /lilibLocation: /b Madrid (hybrid) /lilibExperience: /b 5+ years in Data Engineering or Analytics Engineering roles /li /ulpbr/ppbr/ppbAbout AI Labs at Insud Pharma /b /ppbr/ppAI Labs is Insud Pharma’s transversal team for bArtificial Intelligence, Data Science, and Machine Learning /b, working across the group to deliver production‑ready data and AI solutions with real impact. /ppThe team operates across a wide range of areas, including bRD and clinical data, global health and epidemiology, manufacturing and quality, supply chain and operations, and business analytics /b, partnering closely with business units, and external organizations. /ppAI Labs combines strong engineering standards with pragmatic execution, focusing on building scalable AI‑enabled solutions that move from experimentation to real‑world adoption. /ppbr/ppbr/ppbRole Context /b /ppbr/ppThis role will be bprimarily focused on projects linked to Fundación Mundo Sano /b, an international organization dedicated to improving health and quality of life for vulnerable communities through research, innovation, and international cooperation (e.g. neglected diseases such as Chagas). /ppThe goal of this position is to ensure that data coming from multiple sources becomes bavailable, consistent, reliable, and reusable /b, enabling dashboards, reporting, and AI/ML use cases. /ppDue to the international nature of the projects, the role may involve boccasional travel to Latin America, Africa, or other regions /b to work closely with local teams and better understand data generation on the ground. /ppbr/ppbr/ppbRole Objective /b /ppbr/ppBuild and operate a bcoherent, well‑structured data foundation /b for Fundación Mundo Sano projects by owning the bdata engineering layer end‑to‑end /b: ingestion, modeling, data quality, availability, monitoring, and data delivery for dashboards and AI enablement. /ppbr/ppbr/ppbKey Responsibilities /b /ppbr/pulliBuild and operate bdata ingestion pipelines (ETL/ELT) /b from multiple sources (field programs, research datasets, epidemiological surveillance systems, partners, files, APIs). /liliDesign and maintain bdata models and curated datasets /b that standardize entities, metrics, and definitions across projects. /liliEnsure bdata quality, reliability, and consistency /b through automated checks, monitoring, and basic observability. /liliDecide how data is bstructured, stored, and versioned /b to enable long‑term reuse and scalability. /liliMake data bavailable and easy to consume /b for dashboards, reporting, and AI/ML use cases. /lilibProactively guide business and project teams on data best practices /b, setting standards, shaping requirements, and influencing how data should be collected, structured, and used. /liliCollaborate closely with stakeholders to translate needs into bscalable, maintainable data foundations /b. /li /ulpbr/ppbr/ppbTechnologies (examples – adapt to actual stack) /b /ppbr/pullibLanguages: /b SQL, Python /lilibPipelines / orchestration: /b Airflow, Prefect, Dagster or similar /lilibTransformations: /b dbt or equivalent /lilibStorage: /b Data warehouse / lakehouse (e.g. Snowflake, BigQuery, Databricks, Synapse) /lilibData quality / monitoring: /b Great Expectations, Soda, or similar /lilibBI / Dashboards: /b Power BI, Tableau, Looker or similar /lilibEngineering basics: /b Git, CI/CD, basic cloud concepts (AWS / Azure / GCP) /li /ulpbr/ppbr/ppbWhat we are looking for /b /ppbr/pulliA senior, hands‑on bData Engineer /b with a strong ownership mindset, comfortable building and operating bcore data structures and pipelines /b. /lili5+ years of experience in bData Engineering or Analytics Engineering /b roles. /liliStrong bSQL /b and solid bPython /b, with hands‑on experience building and running bETL/ELT pipelines /b in production. /liliProven experience integrating bheterogeneous and diverse data sources /b (multiple systems, files, APIs, changing schemas, inconsistent identifiers). /liliGood understanding of bdata modeling /b and analytical data structures, with the ability to standardize entities, metrics, and definitions across projects. /liliExperience ensuring bdata quality, reliability, and monitoring /b, including automated checks and basic observability. /liliComfortable making data bavailable for dashboards, reporting, and AI/ML use cases /b through curated, analytics‑ready datasets. /liliAble to work proactively with business and project teams, bshaping requirements and setting data standards /b rather than waiting for fully specified inputs. /lilibSpanish /b as the daily working language; bEnglish /b required for specific projects and international collaboration. /liliPragmatic, ownership‑driven mindset, strong communication skills, and motivation to work on bsocial and public‑health impact projects /b. /li /ulpbr/ppbr/ppbOur benefits! /b /ppbr/pp⏰ Flexible start time from Monday to Friday /pp Permanent contract. /p