• Databricks is tightly integrated with Google Cloud’s analytics tools, making it easier to extend “AI driven insights” across data lakes and data warehouses, as well as multiple business intelligence tools.
  • Pre-built connectors to integrate Databricks with BigQuery and Google Cloud Storage, Looker, and Pub/Sub.
  • Google Cloud’s AI Platform allows for fast and scalable model training using data workflows created in Databricks. Additionally, models built in Databricks can be deployed using AI Platform Prediction to simplify deployment.

Both Databricks as well as Google have used strategies that strongly support open source for a long time. With this announcement, they put the spotlight on their “commitment to open innovation” and open source software. They stated that the new partnership will allow the companies to continue to support open source communities, encourage open innovation, and collaborate, making it easier for joint customers, to use open-source technologies. Databricks donated its open-source MLflow machine learning platform to the Linux Foundation last year. Other vendors have also formed a Databricks/Google Cloud partnership ecosystem with the two companies. These include Accenture, Cognizant and Collibra. Databricks was created by the Spark research team at UC Berkeley, which later became Apache Spark. The Spark big data distributed processing engine powers the company’s unified analytics platform. This platform allows data science teams to collaborate with data engineers and other lines of business to create data products.