Peanut Butter & Chocolate: Integrating Hadoop with SQL Server

WARNING!

This presentation is now obsolete. The information in this talk has not been updated in a while and could be outdated. Use the information in this presentation at your own risk and be sure to double-check things to ensure that any information you use is accurate.

ABSTRACT

So you jumped on the bandwagon and set up a Hadoop cluster--but now what? Your database developers and app developers know how to integrate with and develop against SQL Server, but the Hadoop world can be a completely different experience. This talk will help bridge the gap between SQL Server and Hadoop, using tools such as SQL Server Integration Services and Sqoop to migrate data between a Hadoop cluster and a SQL Server instance, as well as PolyBase to integrate the two like never before.

ADDITIONAL MEDIA

On August 16, 2017, I gave a version of this talk at NDC Sydney. You can get the recording on the NDC Youtube channel.

DEMO CODE

Click here to access demo code for this presentation. This includes a various T-SQL and Hive scripts, an SSIS project, and an F# project

The source code is licensed under the terms offered by the GPL.