Databricks & MinIO Target On-Prem AI Data
Databricks and MinIO have partnered to launch a solution to help enterprises use their on-premises data for AI and analytics. This directly addresses a major hurdle in corporate and PE due diligence: accessing siloed, secure data to build predictive models and accurately assess a company's digital asset value.
The partnership directly targets "data gravity," the challenge of moving massive, petabyte-scale datasets to the cloud for analysis. Many enterprises in regulated industries like finance and healthcare must keep data on-premises for security, compliance, or data sovereignty reasons, creating a major roadblock for cloud-based AI platforms. This joint solution allows Databricks' platform to analyze data where it resides, avoiding costly and complex data migration. At the core of the collaboration is MinIO's AIStor Table Sharing, which integrates Databricks' open-source Delta Sharing protocol directly into its on-premises object storage. This enables secure, real-time data sharing without creating duplicate datasets or complex data pipelines. The solution supports both Delta Lake and Apache Iceberg table formats, preventing vendor lock-in. The on-premises AI data center market was valued at over $34.5 billion in 2025 and is projected to grow at a CAGR of 22.3% through 2033. This growth is driven by the need to process sensitive data locally and the high costs associated with moving large datasets to the cloud. However, on-prem AI presents challenges like high upfront hardware costs, the need for specialized IT skills, and infrastructure that can quickly become obsolete. MinIO, a unicorn valued at $1 billion after its $103 million Series B in 2022, specializes in high-performance, S3-compatible object storage. Its software-defined storage is designed for large-scale AI/ML workloads and can run on any cloud or on-premises infrastructure. The company has seen rapid adoption, with ARR growing over 201% in 2021 and its customer base expanding by 208% the same year. Databricks, a major player in the data and AI space, has a valuation of $134 billion following a $4 billion funding round in late 2025. The company's Lakehouse platform unifies data warehousing and data lakes to provide a single source for analytics and AI. This partnership extends Databricks' reach to valuable on-premises datasets, a key strategic move as it competes with other major cloud and data analytics providers.