Clinical Data Scientist
4 days ago
Philadelphia
Job DescriptionAbout the Client Our client is a rapidly growing technology company at the intersection of healthcare and artificial intelligence. Founded by a team of industry veterans and academic leaders, this organization is on a mission to make high-quality clinical data more accessible for innovation in AI-driven healthcare. They are building a next-generation platform to support the development, training, and validation of responsible AI models with a strong emphasis on data quality and patient safety. About the Role The company is looking for a skilled Data Scientist to help shape, validate, and refine large-scale healthcare datasets for use in clinical research and AI product development. This role plays a crucial part in harmonizing complex, multimodal data from diverse healthcare environments into usable, well-documented formats for clinical and AI teams. Responsibilities * Design and maintain robust data transformation pipelines using tools such as dbt and Snowflake, prioritizing data integrity and transparency. * Normalize and integrate various types of clinical data—including structured records, unstructured notes, imaging, and more—into a unified ontological model. * Collaborate with engineering teams to optimize de-identification and ETL workflows from multiple cloud-hosted healthcare data sources. * Partner with NLP experts to develop methods for extracting structured clinical information from text-based sources. * Apply CI/CD and version control best practices within analytics codebases. * Translate complex research and modeling requirements into scalable data engineering solutions in collaboration with technical stakeholders and external partners. Requirements Required: * At least 3 years of experience in analytics or data engineering. * Strong proficiency with SQL and dbt. * Bachelor's degree in a technical or quantitative field. * Hands-on experience with cloud platforms (especially Snowflake and/or AWS). * Competency in Python for data wrangling and feature generation. * Familiarity with AI/ML workflows and deployment pipelines. * Commitment to clean, modular, and well-documented code using software engineering best practices. * Comfort working in a dynamic, fast-paced startup setting. * Clear communication skills and the ability to advocate for robust data practices. * Passion for advancing healthcare through trustworthy and scalable data infrastructure. Preferred: * Experience with healthcare data standards such as HL7, FHIR, or DICOM. * Background in academic medical research. * Visualization experience using tools like Tableau, Power BI, Hex, or Python libraries. * Exposure to integrating LLM tools and frameworks (e.g., RAG, agent workflows) within analytics pipelines. Benefits & Why Join * Competitive compensation package: base salary in the $145K–$160K range plus equity. * Opportunity to work at the forefront of AI and healthcare innovation. * Collaborative and mission-driven team environment. * Flexibility to work remotely or from the company’s office in New York. * A chance to make a real-world impact by shaping the future of clinical AI and healthcare research.