filmov
tv
SQL for CSV datasets using Databricks

Показать описание
SQL is usually reserved for interacting with databases but in this video I show how you can use Databricks to run SQL queries against a CSV dataset. There are a few defaults that can make working with a CSV dataset problematic, like disabled schema infering and no headers. These are crucial when running SQL against the CSV since the defaults will treat every single value as a string.
Resources:
00:00 Introduction
00:30 Uploading a CSV dataset
02:15 Create a Notebook to work with
02:42 Change default options for SQL
05:18 Execute SQL against the CSV dataset
Resources:
00:00 Introduction
00:30 Uploading a CSV dataset
02:15 Create a Notebook to work with
02:42 Change default options for SQL
05:18 Execute SQL against the CSV dataset