Kniha Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python Brian Lipp

Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python

Autor: Brian Lipp
Jazyk: Angličtina
Väzba: Brožovaná
Vydavateľ: PACKT PUB
Dostupnosť: Skladom u dodávateľa
Odosielame za 9-15 dní
46.75
Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and KafkaKey Features...

Informácie o knihe

Autor
Jazyk
Angličtina
Väzba
Kniha - Brožovaná
Vydalo
2023
Stránok
318
EAN
9781801070492
ISBN
1801070490
Enbook ID
44209037
Vydavateľ
Hmotnosť
549
Rozmery
191 x 235 x 17

Kompletný popis

Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka


Key Features:


  • Develop modern data skills used in emerging technologies
  • Learn pragmatic design methodologies such as Data Mesh and data lakehouses
  • Gain a deeper understanding of data governance
  • Purchase of the print or Kindle book includes a free PDF eBook


Book Description:


Modern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You'll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.


Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You'll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you'll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you'll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you'll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you'll get hands-on experience with Apache Spark, one of the key data technologies in today's market.


By the end of this book, you'll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.


What You Will Learn:


  • Understand data patterns including delta architecture
  • Discover how to increase performance with Spark internals
  • Find out how to design critical data diagrams
  • Explore MLOps with tools such as AutoML and MLflow
  • Get to grips with building data products in a data mesh
  • Discover data governance and build confidence in your data
  • Introduce data visualizations and dashboards into your data practice


Who this book is for:


This book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they're not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.

Mohlo by vás zaujímať

73.97

Black Manhattan

James Weldon Johnson
15.32

Analysis of Chinese Characters

George Durand 1869- Wilder
23.08

Cinema Speculation

Quentin Tarantino
22.59

Modern Gothic

Lerah Mae Barcenilla
13.55
18.46
9.72
47.84
9.72

Lana Del Rey

Selena Fragassi
14.33

Computer Systems

David R. O'Hallaron
236.66

Zákazníci, ktorí si kúpili túto knihu, kúpili tiež

Amateri

Radaković Borivoj
5.00

Úlet na dva týdny

Samantha Towle
13.81
9.32

Splitterfasernackt

Lilly Lindner
14.92

The Great Nowitzki

Thomas Pletzinger
29.56