006 Common Data Models: to use or not?

Показать описание

Episode 006 discusses the broader question of whether to use a common data model like Open Cybersecurity Schema Framework (OCSF) or not. It outlines the options including schema-on-write and schema-on-read approaches, and also elucidates the trade-offs especially in the context of Databricks Lakehouse for cybersecurity. The video also demonstrates how you could even query Zeek logs (extracted from PCAP using Zeek) in cloud storage without ingesting that data into delta lake format if you really need to trade performance for cost savings.

All opinions expressed in the video are my own and do not necessarily reflect the views and opinions of my employers past or present.

lipyeow-sec

Рекомендации по теме

Комментарии

Mostly semantics, however I would still consider direct queries against the raw files to be part of the medallion model bronze layer, but I would register those as views against the raw files, vs materializing them as bronze delta tables. Those bronze views could still be used as a source for further ELT into silver tables at a later point if needed, or would be read and available for ad-hoc analysis whenever needed.

Also, with different teams, use-cases, and evolution of industry data model preferences over time, there may be situations where the raw data needs to be simultaneously mapped to multiple data models, eg OCSF (relatively new) + CIM (established). In these cases, support for each individual data model would have a discreet schema on read/write mapping decision based on how each model was intended to be utilized and for how long it is expected to be maintained.

rlhf

006 Common Data Models: to use or not?

006 Common Data Models: to use or not?

NSO Dev Days: Common Data Models

Common Data Model - Mini Tip 1

BioIT 2020 Talk - A Common Data Model

Conceptual vs Logical Data Models - What are the key differences?

Common Data Model

What do we mean when we say “data model”?

Common Data Model & Extract, Transform and Load Tutorial

Common Data Model CDM and Common Data Service CDS Oh My! Stoneridge Confab

How to Accelerate CIM Data Models — Splunk for Security Tutorials: Normalisation (Episode 2)

10 Steps to Optimize Your Data Model in Power BI

DBCC2020 - Common Data Model

Data Concepts & Types | Intro to BI | Part 6

(3) Kollaboration in BIM-Projekten mittels Common Data Environment (CDE) - Christoph Großmann

06: Data Models, Schemas, & Google Dremel (BigQuery, Protobuf)

Standardize Your Research Data with the NIH Common Data Elements Repository

How to read Conceptual Data Models - Ellie.ai

How to Read and Understand Common Data Charts

Introduction to Dataverse Tables | Common Data Model | How to Create a Table | Shaheer Ahmad

Step 2 - Common Data: Save time and money in you Civil Design workflow

Data profiling for data quality assessment - Clinical Data Models and Data Quality Assessments

What is Common Data Environment? | Studio4

Master Data Modeling in Power BI - Beginner to Pro Full Course

Why R? Webinar 006 - N.Zumel + J.Mount - Advanced Data Preparation for Supervised Machine Learning