Optimizing Hive on Azure HDInsight (Managed Hadoop on Azure)

preview_player
Показать описание
HDInsight allows you to run Big Data technologies (including Hadoop) on Microsoft Azure. If you have a Hadoop cluster, more than likely you use Hive in some capacity. Hive is the SQL engine on Hadoop and is mature, scalable, and heavily used in production scenarios. Hive can run different types of workloads including ETL, reporting, data mining and others. Each of these workloads needs to be tuned to get the best performance. At this session you will learn how to optimize your system better. We will discuss performance optimization at both an architecture layer and at the execution engine layer. Come prepared for a hands-on view of HDInsight including demos.
Рекомендации по теме