課程簡介

Introduction

  • Why and how project teams adopt Hadoop
  • How it all started
  • The Project Manager's role in Hadoop projects

Understanding Hadoop's Architecture and Key Concepts

  • HDFS
  • MapReduce
  • Other pieces of the Hadoop ecosystem

What Constitutes Big Data?

Different Approaches to Storing Big Data

HDFS (Hadoop Distributed File System) as the Foundation

How Big Data is Processed

  • The power of distributed processing

Processing Data with MapReduce

  • How data is picked apart step by step

The Role of Clustering in Large-Scale Distributed Processing

  • Architectural overview
  • Clustering approaches

Clustering Your Data and Processes with YARN

The Role of Non-Relational Database in Big Data Storage

Working with Hadoop's Non-Relational Database: HBase

Data Warehousing Architectural Overview

Managing Your Data Warehouse with Hive

Running Hadoop from Shell-Scripts

Working with Hadoop Streaming

Other Hadoop Tools and Utilities

Getting Started on a Hadoop Project

  • Demystifying complexity

Migrating an Existing Project to Hadoop

  • Infrastructure considerations
  • Scaling beyond your allocated resources

Hadoop Project Stakeholders and Their Toolkits

  • Developers, data scientists, business analysts and project managers

Hadoop as a Foundation for New Technologies and Approaches

Closing Remarks

最低要求

  • A general understanding of programming
  • An understanding of databases
  • Basic knowledge of Linux
 14 時間:

客戶評論 (5)

相關課程

Hortonworks Data Platform (HDP) for Administrators

21 時間:

Apache Ambari: Efficiently Manage Hadoop Clusters

21 時間:

Impala for Business Intelligence

21 時間:

Data Analysis with Hive/HiveQL

7 時間:

Administrator Training for Apache Hadoop

35 時間:

Big Data Analytics in Health

21 時間:

Datameer for Data Analysts

14 時間:

Hadoop Administration

21 時間:

Hadoop For Administrators

21 時間:

Hadoop for Developers (4 days)

28 時間:

Advanced Hadoop for Developers

21 時間:

Hadoop for Developers and Administrators

21 時間:

Hadoop Administration on MapR

28 時間:

Hadoop with Python

28 時間:

Hadoop and Spark for Administrators

35 時間:

課程分類