About the Client Our client is a rapidly growing technology company at the intersection of healthcare and artificial intelligence. Founded by a team of industry veterans and academic leaders, this organization is on a mission to make high-quality clinical data more accessible for innovation in AI-driven healthcare. They are building a next-generation platform to support the development, training, and validation of responsible AI models with a strong emphasis on data quality and patient safety. About the Role The company is looking for a skilled Analytics Engineer to help shape, validate, and refine large-scale healthcare datasets for use in clinical research and AI product development. This role plays a crucial part in harmonizing complex, multimodal data from diverse healthcare environments into usable, well-documented formats for data scientists and AI teams. Responsibilities
- Design and maintain robust data transformation pipelines using tools such as dbt and Snowflake, prioritizing data integrity and transparency.
- Normalize and integrate various types of clinical data-including structured records, unstructured notes, imaging, and more-into a unified ontological model.
- Collaborate with engineering teams to optimize de-identification and ETL workflows from multiple cloud-hosted healthcare data sources.
- Partner with NLP experts to develop methods for extracting structured clinical information from text-based sources.
- Apply CI/CD and version control best practices within analytics codebases.
- Translate complex research and modeling requirements into scalable data engineering solutions in collaboration with technical stakeholders and external partners.
RequirementsRequired:
- At least 3 years of experience in analytics or data engineering.
- Strong proficiency with SQL and dbt.
- Bachelor's degree in a technical or quantitative field.
- Hands-on experience with cloud platforms (especially Snowflake and/or AWS).
- Competency in Python for data wrangling and feature generation.
- Familiarity with AI/ML workflows and deployment pipelines.
- Commitment to clean, modular, and well-documented code using software engineering best practices.
- Comfort working in a dynamic, fast-paced startup setting.
- Clear communication skills and the ability to advocate for robust data practices.
- Passion for advancing healthcare through trustworthy and scalable data infrastructure.
Preferred:
- Experience with healthcare data standards such as HL7, FHIR, or DICOM.
- Background in academic medical research.
- Visualization experience using tools like Tableau, Power BI, Hex, or Python libraries.
- Exposure to integrating LLM tools and frameworks (e.g., RAG, agent workflows) within analytics pipelines.
Benefits & Why Join
- Competitive compensation package: base salary in the $145K-$160K range plus equity.
- Opportunity to work at the forefront of AI and healthcare innovation.
- Collaborative and mission-driven team environment.
- Flexibility to work remotely or from the company's office in New York.
- A chance to make a real-world impact by shaping the future of clinical AI and healthcare research.