SQL for CSV datasets using Databricks

preview_player
Показать описание
SQL is usually reserved for interacting with databases but in this video I show how you can use Databricks to run SQL queries against a CSV dataset. There are a few defaults that can make working with a CSV dataset problematic, like disabled schema infering and no headers. These are crucial when running SQL against the CSV since the defaults will treat every single value as a string.

Resources:

00:00 Introduction
00:30 Uploading a CSV dataset
02:15 Create a Notebook to work with
02:42 Change default options for SQL
05:18 Execute SQL against the CSV dataset
Рекомендации по теме
welcome to shbcf.ru