Get an overview of HDInsight components, common terminology, and scenarios, and see resources for HDInsight, Apache Hadoop, and Microsoft Business Intelligence.
Provision a cluster, run a sample MapReduce program from the sample gallery, examine the output of this program, and connect to BI tools.
Learn how to use the Microsoft HDInsight Emulator for Windows Azure, which provides a local development environment.
Learn how to develop and test a Hadoop streaming MapReduce program on HDInsight Emulator, and then run it on HDInsight using a PowerShell script.
Use HiveQL to query data in an Apache log4j log file, and report basic statistics.
Write Pig Latin statements to analyze an Apache log4j log file, and run various queries on the data to generate output.
The four samples included are intended to get you started quickly and to give you an extensible testing bed to work through concepts. Create data sets, and observe the effects of data size on jobs.
Learn what Hadoop components and versions are included in HDInsight.
Learn how HDInsight works with data that is stored in Windows Azure Blob storage, when to store data in HDFS, and when to store it in Blob storage.
Learn how to upload and access data in HDInsight using Azure Storage Explorer, Windows Azure PowerShell, the Hadoop command line, or Sqoop.
A key feature of Microsoft’s Big Data solution is solid integration of Apache Hadoop with Microsoft Business Intelligence (BI) components. Learn how to use Power Query to import HDInsight data into Excel.
Import data from Windows Azure HDInsight into Excel using the Microsoft Hive ODBC Driver.
Learn how to submit MapReduce and Hive jobs using PowerShell and HDInsight .NET SDK.
Learn how to use Windows Azure Management Portal to create an HDInsight cluster, and how to open the administrative tools.
Learn how to manage HDInsight clusters using a local Windows Azure PowerShell console.
Configure and run jobs on HDInsight clusters using Windows Azure HDInsight PowerShell. Develop applications that manage HDInsight jobs with the Windows Azure HDInsight .NET SDK.
Learn how to use the Cross-Platform Command-Line Interface to manage HDInsight clusters.
Learn how to provision HDInsight clusters using Windows Azure Management Portal, PowerShell, the command-line interface, and the HDInsight .NET SDK.
Learn about errors that can occur when using PowerShell to manage HDInsight and the steps for recovering from them.