Adi Polak Book

Language: en
Pages: 294

Scaling Machine Learning with Spark

Author(s): Adi Polak

Categories: Computers

Type: Book
-
Published: 2023-03-07
-
Publisher: "O'Reilly Media, Inc."

Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better. Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book show...

Language: en
Pages: 323

Scaling Machine Learning with Spark

Author(s): Adi Polak

Categories: Computers

Type: Book
-
Published: 2023-03-07
-
Publisher: "O'Reilly Media, Inc."

Language: en
Pages: 186

Introducing MLOps

Author(s): Mark Treveil, Nicolas Omont, Clément Stenac, Kenji Lefevre, Du Phan, Joachim Zentici, Adrien Lavoillotte, Makoto Miyazaki, Lynn Heidmann

Categories: Computers

Type: Book
-
Published: 2020-11-30
-
Publisher: O'Reilly Media

More than half of the analytics and machine learning (ML) models created by organizations today never make it into production. Some of the challenges and barriers to operationalization are technical, but others are organizational. Either way, the bottom line is that models not in production can't provide business impact. This book introduces the key concepts of MLOps to help data scientists and application engineers not only operationalize ML models to drive real business change but also maintain and improve those models over time. Through lessons based on numerous MLOps applications around the world, nine experts in machine learning provide insights into the five steps of the model life cyc...

Language: en
Pages: 263

97 Things Every Data Engineer Should Know

Author(s): Tobias Macey

Categories: Computers

Type: Book
-
Published: 2021-06-11
-
Publisher: "O'Reilly Media, Inc."

Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges. Edited by Tobias Macey, host of the popular Data Engineering Podcast, this book presents 97 concise and useful tips for cleaning, prepping, wrangling, storing, processing, and ingesting data. Data engineers, data architects, data team managers, data scientists, machine learning e...

Language: en
Pages: 454

Fundamentals of Data Engineering

Author(s): Joe Reis, Matt Housley

Categories: Computers

Type: Book
-
Published: 2022-06-22
-
Publisher: "O'Reilly Media, Inc."

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle. Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, ...

Language: en
Pages: 391

Delta Lake: The Definitive Guide

Author(s): Denny Lee, Tristen Wentling, Scott Haines, Prashanth Babu

Categories: Computers

Type: Book
-
Published: 2024-10-30
-
Publisher: "O'Reilly Media, Inc."

Ready to simplify the process of building data lakehouses and data pipelines at scale? In this practical guide, learn how Delta Lake is helping data engineers, data scientists, and data analysts overcome key data reliability challenges with modern data engineering and management techniques. Authors Denny Lee, Tristen Wentling, Scott Haines, and Prashanth Babu (with contributions from Delta Lake maintainer R. Tyler Croy) share expert insights on all things Delta Lake--including how to run batch and streaming jobs concurrently and accelerate the usability of your data. You'll also uncover how ACID transactions bring reliability to data lakehouses at scale. This book helps you: Understand key data reliability challenges and how Delta Lake solves them Explain the critical role of Delta transaction logs as a single source of truth Learn the Delta Lake ecosystem with technologies like Apache Flink, Kafka, and Trino Architect data lakehouses with the medallion architecture Optimize Delta Lake performance with features like deletion vectors and liquid clustering

Language: en
Pages: 267

Fundamentals of Data Observability

Author(s): Andy Petrella

Categories: Computers

Type: Book
-
Published: 2023-08-14
-
Publisher: "O'Reilly Media, Inc."

Quickly detect, troubleshoot, and prevent a wide range of data issues through data observability, a set of best practices that enables data teams to gain greater visibility of data and its usage. If you're a data engineer, data architect, or machine learning engineer who depends on the quality of your data, this book shows you how to focus on the practical aspects of introducing data observability in your everyday work. Author Andy Petrella helps you build the right habits to identify and solve data issues, such as data drifts and poor quality, so you can stop their propagation in data applications, pipelines, and analytics. You'll learn ways to introduce data observability, including setting up a framework for generating and collecting all the information you need. Learn the core principles and benefits of data observability Use data observability to detect, troubleshoot, and prevent data issues Follow the book's recipes to implement observability in your data projects Use data observability to create a trustworthy communication framework with data consumers Learn how to educate your peers about the benefits of data observability

Language: en

Scaling Machine Learning with Spark

Author(s): Adi Polak

Type: Book
-
Published: 2023-04-04
-
Publisher: O'Reilly Media

Get up to speed on Apache Spark, the popular engine for large-scale data processing, including machine learning and analytics. If you're looking to expand your skill set or advance your career in scalable machine learning with MLlib, distributed PyTorch, and distributed TensorFlow, this practical guide is for you. Using Spark as your main data processing platform, you'll discover several open source technologies designed and built for enriching Spark's ML capabilities. Scaling Machine Learning with Spark examines various technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLFlow, TensorFlow, PyTorch, and Petastorm. This book sho...

Language: en
Pages: 463

Machine Learning Engineering with Python

Author(s): Andrew P. McMahon

Categories: Computers

Type: Book
-
Published: 2023-08-31
-
Publisher: Packt Publishing Ltd

Transform your machine learning projects into successful deployments with this practical guide on how to build and scale solutions that solve real-world problems Includes a new chapter on generative AI and large language models (LLMs) and building a pipeline that leverages LLMs using LangChain Key Features This second edition delves deeper into key machine learning topics, CI/CD, and system design Explore core MLOps practices, such as model management and performance monitoring Build end-to-end examples of deployable ML microservices and pipelines using AWS and open-source tools Book DescriptionThe Second Edition of Machine Learning Engineering with Python is the practical guide that MLOps a...

Language: en
Pages: 603

The Machine Learning Solutions Architect Handbook

Author(s): David Ping

Categories: Computers

Type: Book
-
Published: 2024-04-15
-
Publisher: Packt Publishing Ltd

Design, build, and secure scalable machine learning (ML) systems to solve real-world business problems with Python and AWS Purchase of the print or Kindle book includes a free PDF eBook Key Features Go in-depth into the ML lifecycle, from ideation and data management to deployment and scaling Apply risk management techniques in the ML lifecycle and design architectural patterns for various ML platforms and solutions Understand the generative AI lifecycle, its core technologies, and implementation risks Book DescriptionDavid Ping, Head of GenAI and ML Solution Architecture for global industries at AWS, provides expert insights and practical examples to help you become a proficient ML solution...

Welcome to our book review site go-pdf.online!

Scaling Machine Learning with Spark

Scaling Machine Learning with Spark

Introducing MLOps

97 Things Every Data Engineer Should Know

Fundamentals of Data Engineering

Delta Lake: The Definitive Guide

Fundamentals of Data Observability

Scaling Machine Learning with Spark

Machine Learning Engineering with Python

The Machine Learning Solutions Architect Handbook

Recently Searched