MOD 20775: Perform Data Engineering on Microsoft HDInsight


This five-day course will give students the ability plan and implement big data workflows on HDInsight.

Those who may be interested in this course are:

  • Data engineers
  • Data architects
  • Data scientists
  • Data developers who intend to implement big data engineering workflows on HDInsight.

After completing this course, students will be able to:

  • Deploy HDInsight Clusters
  • Authorizing Users to Access Resources
  • Loading Data into HDInsight
  • Troubleshooting HDInsight
  • Implement Batch Solutions
  • Design Batch ETL Solutions for Big Data with Spark
  • Analyze Data with Spark SQL
  • Analyze Data with Hive and Phoenix
  • Describe Stream Analytics
  • Implement Spark Streaming Using the DStream API
  • Develop Big Data Real-Time Processing Solutions with Apache Storm
  • Build Solutions that use Kafka and HBase


It is recommended that as well as professional experience, students should have:

  • Programming experience using R, and familiarity with common R packages
  • Knowledge of common statistical methods and data analysis best practices
  • Basic knowledge of the Microsoft Windows operating system and its core functionality
  • Working knowledge of relational databases
Contact Us



FTL: 954.351.7040

MIA: 305.648.2000

Request More Information


Current Promotions!






Email Newsletter icon, E-mail Newsletter icon, Email List icon, E-mail List icon Sign up for our Email Newsletter!



Students - Orbund Log-In






  • Follow us on
  • Facebook Academy Page
  • Twitter Academy Page