Mar 29, 2021 | Apache Spark, Databricks
By: Kash Sabba Introduction Databricks is an advanced analytics platform that supports data engineering, data science, and machine learning use cases from data ingestion to model deployment in production. The prominent platform provides compute...
Jul 22, 2020 | Apache Spark, Big Data, Business Intelligence
By: Ken Adams Introduction Delta Lake is an open source storage layer that sits on top of cloud storage technology such as Azure Data Lake Storage or Amazon S3. The technology was introduced by Databricks in 2019, and all of the code is available here....
Jun 26, 2020 | Apache Spark, Azure, Big Data, Business Intelligence, Cloud Technology, Microsoft Power BI
By: Phillip Sharpless Introduction to Azure Databricks Finding the right tools to manage your big data ecosystem can be a daunting task, as there seem to be a myriad of options, all advertising impressive-sounding features. One analytics platform that is...