Senior+ Data Engineer
hace 15 días
Denver
Job Description Frontera Health is revolutionizing pediatric healthcare by developing a cutting-edge, tech-enabled platform that delivers essential therapies to rural families. Our platform leverages AI/ML to create a robust video-based data model for early intervention and developmental disorders. By collaborating closely with parents, caregivers, and clinical partners, we're bridging the gap in access to care, improving health equity, and providing personalized treatment plans. Backed by leading investors like Lightspeed and Lux, Frontera Health is poised for rapid growth. Our ABA direct services are designed to meet the unique needs of children in underserved communities, providing them with the support and resources they require to reach their full potential. We are passionate about ensuring that every child, regardless of their location or socioeconomic status, has access to high-quality healthcare. By leveraging our technology platform and partnering with local providers, we are able to deliver effective ABA therapy to families who may otherwise have limited access to these essential services. We are building secure, scalable infrastructure that powers clinical insights from a wide variety of data sources—including audio, video, text, and structured forms. As a Data Engineer, you'll play a critical role in designing the backbone that enables our product, ML, and analytics teams to deliver impact at scale. Responsibilities: • Design and evolve the foundational data model, ensuring consistency and alignment across ML, engineering, analytics, and product teams., • Lead greenfield data initiatives from architecture through implementation, shaping the future of our data stack and practices., • Ingest and process diverse, unstructured data (e.g., audio, video, clinical notes, PDFs), surfacing clinically meaningful information for downstream ML and analytics workflows., • Build and maintain data ingestion pipelines connecting third-party tools, internal systems, cameras, microphones, and file-based sources., • Identify and implement scalable data storage solutions optimized for various use cases—text, media, structured data, logs, etc., • Build and support partner-facing APIs that expose data securely, ensuring alignment with product use cases and regulatory standards., • Collaborate deeply with ML, engineering, product, analytics, and clinical teams to scale complex pipelines, support model training & evaluation, and enable data-driven product development., • Establish and monitor system observability—including health checks, dashboards, logging, and alerts for pipelines and APIs., • Ensure HIPAA compliance and separation of sensitive data, including access controls, encryption, auditability, and secure pipeline design.Qualifications:, • 7+ years of experience as a Data Engineer or Backend Engineer, with strong contributions to pipeline design, data modeling, and backend infrastructure., • Proven success in leading greenfield data initiatives and building from scratch in fast-paced, startup environments., • Expertise in ingesting and transforming unstructured data for use in ML models or analytics workflows., • Strong programming skills in Python, SQL, and experience with frameworks like Airflow, Spark, Kafka, or cloud-native ETL tools., • Deep understanding of storage solutions across structured, semi-structured, and unstructured data types (e.g., PostgreSQL, NoSQL, S3, object stores, search indices)., • Familiarity with API development and experience designing secure, performant data access layers., • Practical knowledge of HIPAA compliance, secure data handling, RBAC, encryption, and audit requirements., • Experience with monitoring/observability tools (e.g., Datadog, Prometheus, Grafana, CloudWatch) for data infrastructure.Bonus if you have, • Experience in healthcare, clinical research, or other regulated domains (HIPAA, GDPR, etc.)., • Exposure to ML workflows, including data prep for training/inference pipelines., • Hands-on work with media processing pipelines (e.g., audio transcription, video segmentation, text extraction)., • Background in metadata modeling, data governance, or cross-system schema design.Why Frontera Health?, • Impactful Mission: Work on challenging and meaningful projects that leverage cutting-edge technologies (AI/ML) to improve pediatric healthcare in underserved communities., • Growth & Innovation: Be at the forefront of innovation, collaborating with a talented and passionate team in a fast-paced, dynamic environment., • Professional Development: Join a culture that values mentorship, learning, and continuous improvement., • Global Collaboration: Engage with team members around the world, broadening your perspective and fostering diverse ideas., • Competitive Compensation: We offer a competitive salary and benefits package. We are committed to: • Providing equal employment opportunities to all qualified individuals, without regard to race, color, religion, sex, national origin, disability status, sexual orientation, gender identity or expression, age, genetic information, veteran status, or any other characteristic protected by law., • Fostering a culture of inclusion and belonging where everyone feels valued and respected., • Providing reasonable accommodations to employees with disabilities., • Recruiting and hiring a diverse workforce that reflects the communities we serve., • Creating and maintaining an inclusive work environment that is free from discrimination and harassment.