- Build and optimize resilient data ingestion pipelines that process high volume streams of clinical data and integrate them into our core oncology platform architecture.
- Own the indexing, querying, and performance tuning of our OpenSearch cluster to ensure fast text and data discovery across our oncology data layers.
- Implement and refine complex OCR processing workflows to transform raw, unstructured patient timelines into clean, structured data.
- Expand our core oncology data model by maintaining and enhancing microservices that run automated, medically defined quality checks.
- Use AI tools to automate routine setup and boilerplate while building robust backend systems.
- Design and implement impactful ETL pipelines and high-performance RESTful APIs/microservices.
- Participate in thorough code and design reviews, guide technical decisions, and mentor peer engineers.
- Lead the investigation and resolution of data and quality issues.