PurpleLab Expands Healthcare Data Access on Databricks and Snowflake

Healthcare analytics firm PurpleLab has expanded access to its real-world data by integrating its platform with both Databricks and Snowflake. The move allows researchers, payers, and providers to query de-identified claims, EHR, and pharmacy data using cloud-native analytics engines. The platform's architecture emphasizes secure, governed access to meet compliance requirements in healthcare.

- By making its data available in the Databricks and Snowflake marketplaces, PurpleLab aims to reduce the friction of traditional data procurement, allowing users to instantly sample and access datasets within their existing cloud environments. This strategy targets analytics and AI workflows that demand speed and transparency. - PurpleLab's core platform, HealthNexus®, is a no-code analytics environment built on a dataset of over 16 billion annual medical and pharmacy claims. The company's CLEAR Claims data product standardizes and enriches raw claims into a unified data model, which includes social determinants of health (SDOH) and one of the largest mortality datasets available. - The integration with Databricks and Snowflake reflects a broader industry trend where healthcare data is growing at a projected 36% annually. Organizations often adopt a hybrid "lakehouse" approach, using Databricks for processing large, unstructured datasets and AI/ML workloads, while leveraging Snowflake for high-performance SQL analytics on structured data like claims. - To ensure data quality and reliability for analytics, engineering teams in healthcare are increasingly adopting tools like dbt (data build tool) to apply software development best practices to data transformation. This includes version control, automated testing, and clear documentation, which are critical for managing complex data pipelines in regulated environments. - For governance in regulated healthcare environments, data observability has become critical. It complements data governance by providing real-time monitoring of data health, quality, and lineage across complex pipelines, helping to ensure HIPAA compliance and reduce the risk of non-compliance penalties. - The founder and CEO of PurpleLab, Mark Brosso, has a history of building and selling healthcare data companies. His previous company, which was the first to use the Universal Patient Key (now Datavant) for de-identification, was sold to IMS Health (now IQVIA). - The company has raised a total of $43 million over three funding rounds, with a significant $40 million Series B in 2022 led by Primus Capital. This funding was aimed at accelerating investment in new data assets and product development. - AI copilots are emerging to accelerate data workflows in healthcare, from automating the generation of clinical notes to assisting with SQL query writing and data exploration. These tools aim to reduce the administrative burden on both clinicians and data teams, allowing them to focus more on analysis and patient care.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.