Optimizing Databricks Workloads: Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

Accelerate computations and make the most of your data effectively and efficiently on Databricks
Key Features: Understand Spark optimizations for big data workloads and maximizing performanceBuild efficient big data engineering pipelines with Databricks and Delta LakeEfficiently manage Spark clusters for big data processing Book Description: Databricks is an industry-leading, cloud-based platform for data analytics, data science, and data engineering supporting thousands of organizations across the world in their data journey. It is a fast, easy, and collaborative Apache Spark-based big data analytics platform for data science and data engineering in the cloud.In Optimizing Databricks Workloads, you will get started with a brief introduction to Azure Databricks and quickly begin to understand the important optimization techniques. The book covers how to select the optimal Spark cluster configuration for running big data processing and workloads in Databricks, some very useful optimization techniques for Spark DataFrames, best practices for optimizing Delta Lake, and techniques to optimize Spark jobs through Spark core. It contains an opportunity to learn about some of the real-world scenarios where optimizing workloads in Databricks has helped organizations increase performance and save costs across various domains.By the end of this book, you will be prepared with the necessary toolkit to speed up your Spark jobs and process your data more efficiently.
What You Will Learn: Get to grips with Spark fundamentals and the Databricks platformProcess big data using the Spark DataFrame API with Delta LakeAnalyze data using graph processing in DatabricksUse MLflow to manage machine learning life cycles in DatabricksFind out how to choose the right cluster configuration for your workloadsExplore file compaction and clustering methods to tune Delta tablesDiscover advanced optimization techniques to speed up Spark jobs
Who this book is for: This book is for data engineers, data scientists, and cloud architects who have working knowledge of Spark/Databricks and some basic understanding of data engineering principles. Readers will need to have a working knowledge of Python, and some experience of SQL in PySpark and Spark SQL is beneficial.

Citeste mai mult

-10%

transport gratuit

PRP: 363.65 Lei

Acesta este Pretul Recomandat de Producator. Pretul de vanzare al produsului este afisat mai jos.

327.28Lei

363.65 Lei

Primesti 327 puncte

Primesti puncte de fidelitate dupa fiecare comanda! 100 puncte de fidelitate reprezinta 1 leu. Foloseste-le la viitoarele achizitii!

Livrare in 2-4 saptamani

Adauga in cos

Descrierea produsului

Citeste mai mult

Detaliile produsului

Editie: Paperback

Nr. pagini: 230

Cod: BRT9781801819077

Afiseaza mai mult

De pe acelasi raft

-10%

transport gratuit

Optimizing Databricks Workloads: Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads - Anirudh Kala

PRP: 363.65 Lei

327.28 Lei

327.28 Lei363.65 Lei

Adauga in cos
-10%

transport gratuit

Azure Databricks Cookbook: Accelerate and scale real-time analytics solutions using the Apache Spark-based analytics service - Phani Raj

PRP: 454.58 Lei

409.12 Lei

409.12 Lei454.58 Lei

Adauga in cos
-10%

transport gratuit

The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snow - Ron L'esteve

PRP: 407.92 Lei

367.13 Lei

367.13 Lei407.92 Lei

Adauga in cos
-10%

transport gratuit

Learning Spark: Lightning-Fast Data Analytics - Jules S. Damji

PRP: 435.13 Lei

391.62 Lei

391.62 Lei435.13 Lei

Adauga in cos
-10%

transport gratuit

The Definitive Guide to Azure Data Engineering: Modern Elt, Devops, and Analytics on the Azure Cloud Platform - Ron C. L'esteve

PRP: 367.12 Lei

330.41 Lei

330.41 Lei367.12 Lei

Adauga in cos
-10%

transport gratuit

Genomics in the Azure Cloud: Scaling Your Bioinformatics Workloads Using Enterprise-Grade Solutions - Colby Ford

PRP: 495.94 Lei

446.35 Lei

446.35 Lei495.94 Lei

Adauga in cos
-10%

transport gratuit

Azure Data Engineering Cookbook - Second Edition: Get well versed in various data engineering techniques in Azure using this recipe-based guide - Nagaraj Venkatesan

PRP: 430.51 Lei

387.46 Lei

387.46 Lei430.51 Lei

Adauga in cos
-10%

transport gratuit

Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely a - Manoj Kukreja

PRP: 404.98 Lei

364.48 Lei

364.48 Lei404.98 Lei

Adauga in cos
-10%

transport gratuit

Machine Learning Engineering in Action - Ben Wilson

PRP: 495.92 Lei

446.33 Lei

446.33 Lei495.92 Lei

Adauga in cos
-10%

transport gratuit

Beginning Azure Synapse Analytics: Transition from Data Warehouse to Data Lakehouse - Bhadresh Shiyal

PRP: 448.72 Lei

403.85 Lei

403.85 Lei448.72 Lei

Adauga in cos
-10%

transport gratuit

Beginning Apache Spark 2

PRP: 269.20 Lei

242.28 Lei

242.28 Lei269.20 Lei

Adauga in cos
-10%

transport gratuit

Data Engineering on Azure - Vlad Riscutia

PRP: 339.92 Lei

305.93 Lei

305.93 Lei339.92 Lei

Adauga in cos
-10%

transport gratuit

Azure Data Engineer Associate Certification Guide: A hands-on reference guide to developing your data engineering skills and preparing for the DP-203 - Newton Alex

PRP: 454.58 Lei

409.12 Lei

409.12 Lei454.58 Lei

Adauga in cos
-10%

transport gratuit

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud - Robert Ilijason

PRP: 326.32 Lei

293.69 Lei

293.69 Lei326.32 Lei

Adauga in cos
-10%

transport gratuit

Trino: The Definitive Guide: SQL at Any Scale, on Any Storage, in Any Environment - Matt Fuller

PRP: 495.94 Lei

446.35 Lei

446.35 Lei495.94 Lei

Adauga in cos
-10%

transport gratuit

Azure Machine Learning Engineering: Deploy, fine-tune, and optimize ML models using Microsoft Azure - Sina Fakhraee

PRP: 347.12 Lei

312.41 Lei

312.41 Lei347.12 Lei

Adauga in cos
-10%

transport gratuit

Building an Event-Driven Data Mesh: Patterns for Designing & Building Event-Driven Architectures - Adam Bellemare

PRP: 409.14 Lei

368.23 Lei

368.23 Lei409.14 Lei

Adauga in cos
-10%

transport gratuit

High-Performance Big Data Computing - Dhabaleswar K. Panda

PRP: 400.75 Lei

360.68 Lei

360.68 Lei400.75 Lei

Adauga in cos
-10%

transport gratuit

Exam Ref MS-100 Microsoft 365 Identity and Services - Orin Thomas

PRP: 220.85 Lei

198.76 Lei

198.76 Lei220.85 Lei

Adauga in cos
-10%

transport gratuit

Cisco Cloud Infrastructure - Avinash Shukla

PRP: 464.92 Lei

418.43 Lei

418.43 Lei464.92 Lei

Adauga in cos

Parerea ta e inspiratie pentru comunitatea Libris!

Optimizing Databricks Workloads: Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

De (autor): Anirudh Kala

Optimizing Databricks Workloads: Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads

De (autor): Anirudh Kala

Descrierea produsului

De pe acelasi raft

Optimizing Databricks Workloads: Harness the power of Apache Spark in Azure and maximize the performance of modern big data workloads - Anirudh Kala

Azure Databricks Cookbook: Accelerate and scale real-time analytics solutions using the Apache Spark-based analytics service - Phani Raj

The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snow - Ron L'esteve

Learning Spark: Lightning-Fast Data Analytics - Jules S. Damji

The Definitive Guide to Azure Data Engineering: Modern Elt, Devops, and Analytics on the Azure Cloud Platform - Ron C. L'esteve

Genomics in the Azure Cloud: Scaling Your Bioinformatics Workloads Using Enterprise-Grade Solutions - Colby Ford

Azure Data Engineering Cookbook - Second Edition: Get well versed in various data engineering techniques in Azure using this recipe-based guide - Nagaraj Venkatesan

Data Engineering with Apache Spark, Delta Lake, and Lakehouse: Create scalable pipelines that ingest, curate, and aggregate complex data in a timely a - Manoj Kukreja

Machine Learning Engineering in Action - Ben Wilson

Beginning Azure Synapse Analytics: Transition from Data Warehouse to Data Lakehouse - Bhadresh Shiyal

Beginning Apache Spark 2

Data Engineering on Azure - Vlad Riscutia

Azure Data Engineer Associate Certification Guide: A hands-on reference guide to developing your data engineering skills and preparing for the DP-203 - Newton Alex

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud - Robert Ilijason

Trino: The Definitive Guide: SQL at Any Scale, on Any Storage, in Any Environment - Matt Fuller

Azure Machine Learning Engineering: Deploy, fine-tune, and optimize ML models using Microsoft Azure - Sina Fakhraee

Building an Event-Driven Data Mesh: Patterns for Designing & Building Event-Driven Architectures - Adam Bellemare

High-Performance Big Data Computing - Dhabaleswar K. Panda

Exam Ref MS-100 Microsoft 365 Identity and Services - Orin Thomas

Cisco Cloud Infrastructure - Avinash Shukla

Acum se comanda