作者简介

Joe Reis is a business-minded data nerd who’s worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and Co-Founder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah. In addition, he volunteers with several technology groups and teaches at the University of Utah. In his spare time, Joe likes to rock climb, produce electronic music, and take his kids on crazy adventures.
Matt Housley is a data engineering consultant and cloud specialist. After some early programming experience with Logo, Basic and 6502 assembly, he completed a PhD in mathematics at the University of Utah. Matt then began working in data science, eventually specializing in cloud based data engineering. He co-founded Ternary Data with Joe Reis, where he leverages his teaching experience to train future data engineers and advise teams on robust data architecture. Matt and Joe also pontificate on all things data on The Monday Morning Data Chat.

内容简介

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available in the framework of the data engineering lifecycle.

Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, governance, and deployment that are critical in any data environment regardless of the underlying technology.

This book will help you:

Assess data engineering problems using an end-to-end data framework of best practices

Cut through marketing hype when choosing data technologies, architecture, and processes

Use the data engineering lifecycle to design and build a robust architecture

Incorporate data governance and security across the data engineering lifecycle


Joe Reis is a business-minded data nerd who’s worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and Co-Founder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah...

下载地址

豆瓣评论

  • 三七李
    读了大半年,获得了一个总体概念,蛮好的,知道了很多不知道的东西,大数据的软件咋这么多…08-08
  • 流光
    前后部分对调一下可能会读得更顺畅。还是提供了一些能面试的时候bb的东西03-18
  • 2024
    early release 版本写得挺像通讯约稿的/"The data engineer we discuss in this book can be described more precisely as a data lifecycle engineer."/工业界以数据为对象的生产实践中,数据科学和数据工程的分野。/Data Mesh, Serving Data for Analytics, Machine Learning, and Reverse ETL/中间有几章可以当作checklist07-02
  • 3点一直线
    这本书很详细。 很高屋建瓴, 主要讲了一些大方向和技术选型 技术栈的选择, 而且很新, 和业界很贴切。 唯一的问题就是有点抽象。 可能适合没在这一行工作的人, 了解日常工作内容? 或者作为tech lead 做技术选型。 对于一线工作的, 这个有点太overview了, 没实例内容, 都要靠自己感悟。02-27
  • elfish
    全面也不过时,但纯理论概念总则介绍没有case study实例设计。也许更适合有相关工作经验的人阅读。先看Part II再看Part I更好些。推荐油管上CMU的15-721 Ad DBS视频:https://15721.courses.cs.cmu.edu/spring2023/。07-27

猜你喜欢

大家都喜欢