Ideal Cloud-based Data Lake Framework

Ryan Arjun
6 min readFeb 17, 2021

We know that with the right technology, we can do much better than just keep up and if we could also ensure flexible development and make it easier to protect our data, process to access, process and analyze data whenever it’s required. With the right tools and best practices, an organization can use all its data, making it accessible to more users and fueling better business decisions.

New technologies innovations can improve improve the modern cloud-based data lakes, data warehousing and analytics with regard to availability, simplicity, cost, and performance which should be meet current and future needs by ableing to scale both compute and storage independently. It shouldn’t interfere with any ongoing workloads, degrade performance, or result in service unavailability due to backup processes running in the background. And it should be cheap, with clever ways to preserve our data without having to copy and move it somewhere else.

The modern data lake is foundational for the modern enterprise. If set up properly, a data lake will draw people to naturally gravitate there with ideas and come away with useful insights when it comes to ensuring the durability, resiliency, and availability of the system. 

Technology is the most basic need for Any Modern Data Lakes — Now a days, many technologies such as Databricks, Microsoft Azure…

--

--

Ryan Arjun

BI Specialist || Azure || AWS || GCP — SQL|Python|PySpark — Talend, Alteryx, SSIS — PowerBI, Tableau, SSRS