Highlights from the Data Engineering Summit Now Available On Demand
We’ve just wrapped up our first-ever Data Engineering Summit. Thank you to all the speakers, partners, and attendees who joined us! If you weren’t able to make it, don’t worry, you can watch the sessions on-demand and keep up-to-date on essential data engineering tools and skills. Learn more about some of the popular sessions below.
Streaming Featurization with Ibis, Substrait and Apache Arrow
Wes Mckinney | CTO and Cofounder | Voltron Data and David Palaitis | Managing Director | Two Sigma
This session covers how Two Sigma and Voltron Data have leveraged their areas of expertise to collaborate and improve the performance of featurization workflows using the Ibis, Substrait, and Arrow software stack.
Thrive in the Data Tooling Tornado: Lead, Hire, and Execute Better by Escaping Older Industrial Antipatterns
Adam Breindel | Independent Consultant
This talk covers the challenges faced by data tooling products and how they are the result of outdated anti-patterns held over from the 20th-century industry. It also looks at solutions that make it easier for data teams to hire, manage, retain, and execute effectively using modern data tooling — all while gaining that sought-after efficiency.
Reliable Pipelines and High Quality Data without the Toil
Kyle Kirwan | Co-Founder | Big Eye
Check out this talk to learn about the history of data quality testing and data observability inside Uber, the differences between data observability and other methods like data pipeline tests, how techniques developed there can be applied by data engineers anywhere, and an overview of both commercially available and open source tools available today.
Building a Data Mesh: Strategies and Best Practices for Navigating the Implementation of a Data Mesh
Hajar Khizou | Lead Data Engineer | SustainCERT
This session explores the principles and practices of data mesh, a new approach to thinking about data based on a distributed architecture that promotes decentralized ownership and control of data assets. It also addresses the strategies and best practices for implementing a data mesh.
Applying Engineering Best Practices in Data Lakes Architectures
Einat Orr | Ceo and Co-Founder | Treeverse
This talk examines why agile methodology, continuous integration, and continuous deployment and production monitoring are essential for data lakes. You’ll also learn how these best practices create a safe environment that produces higher-quality data in less time.
Getting into Data Engineering
Joe Reis | CEO | Ternary Data
Perfect for those interested in getting into data engineering, this talk covers the key questions you should answer when embarking on a data engineering career. Joe Reis addresses the current economic climate in 2023 in particular.
Beyond Monitoring: The Rise of Data Observability
Shane Murray Field | CTO | Monte Carlo
This session addresses the problem of “data downtime” — periods of time when data is partial, erroneous, missing or otherwise inaccurate — and how to eliminate it in your data ecosystem with end-to-end data observability. It will cover why data observability matters and the tactics you can use to address it today.
Watch for free here
You can access all of the sessions from the Data Engineering Summit on-demand here. Looking for more expert-led instruction in data engineering? Check out our upcoming conference, ODSC East 2023, for 3 days of training sessions, workshops, and talks. And be sure to register soon to lock in our Early Bird Discount of 60% off any in-person or virtual pass.
Originally posted on OpenDataScience.com
Read more data science articles on OpenDataScience.com, including tutorials and guides from beginner to advanced levels! Subscribe to our weekly newsletter here and receive the latest news every Thursday. You can also get data science training on-demand wherever you are with our Ai+ Training platform. Subscribe to our fast-growing Medium Publication too, the ODSC Journal, and inquire about becoming a writer.