filmov
tv
Databricks Real Time Project- Preparing for Production: Automating Databricks Notebook-Part 1
![preview_player](https://i.ytimg.com/vi/VyRi6d34lJQ/maxresdefault.jpg)
Показать описание
Hello Everyone, I am Naval Yemul. Welcome back to Data Master!
Follow me on LinkedIn:
In this video, we dive into the essential skill of productionizing and automating Databricks notebooks, perfect for anyone looking to upskill in Databricks or add impactful projects to their resume. We'll walk through a real-world example from one of our e-commerce clients on Amazon, who provides raw CSV files in Azure Data Lake Storage (ADLS).
Here's what you'll learn:
Ingesting raw data into the bronze layer.
Automating the ingestion process using Databricks workflow jobs.
If you enjoy this video, please share it with your friends and colleagues, like the video, and subscribe to the channel. Have specific topics or questions? Leave a comment below!
Check out the medallion architecture and other related videos linked in the "i" button.
Links:
Medallion Architecture Explained:
Managed and External Tables:
Databricks Playlist:
Databricks Certification Playlist:
0:00 Introduction to Real Time Project/ Understanding clients requirement.
3:22 Databricks Workspace
4:16 Understanding Sample Amazon Dataset
7:10 Uploading raw data in ADLS container
8:37 Exploring External Location/ Storage Credentials
10:18 Preparing for Production Notebook
15:11 Renaming column name using toDF
17:17 Writing to External Delta table
23:31 Explaining about Widget
24:31 Creating widgets
29:03 Removing raw file
29:28 Uploading new file
33:24 Schedule the notebook
Follow me on LinkedIn:
In this video, we dive into the essential skill of productionizing and automating Databricks notebooks, perfect for anyone looking to upskill in Databricks or add impactful projects to their resume. We'll walk through a real-world example from one of our e-commerce clients on Amazon, who provides raw CSV files in Azure Data Lake Storage (ADLS).
Here's what you'll learn:
Ingesting raw data into the bronze layer.
Automating the ingestion process using Databricks workflow jobs.
If you enjoy this video, please share it with your friends and colleagues, like the video, and subscribe to the channel. Have specific topics or questions? Leave a comment below!
Check out the medallion architecture and other related videos linked in the "i" button.
Links:
Medallion Architecture Explained:
Managed and External Tables:
Databricks Playlist:
Databricks Certification Playlist:
0:00 Introduction to Real Time Project/ Understanding clients requirement.
3:22 Databricks Workspace
4:16 Understanding Sample Amazon Dataset
7:10 Uploading raw data in ADLS container
8:37 Exploring External Location/ Storage Credentials
10:18 Preparing for Production Notebook
15:11 Renaming column name using toDF
17:17 Writing to External Delta table
23:31 Explaining about Widget
24:31 Creating widgets
29:03 Removing raw file
29:28 Uploading new file
33:24 Schedule the notebook
Комментарии