HDP Operations Hadoop Administration 1

Course Overview

All Dates Are Guaranteed to Run!

This course is designed for administrators who will be managing the Hortonworks Data Platform (HDP) with Ambari. It covers installation, configuration, and other typical cluster maintenance tasks.

Target Audience

IT administrators and operators responsible for installing, configuring and supporting an HDP deployment in a Linux environment using Ambari.

Prerequisites

Attendees should be familiar with with Hadoop and Linux environments.

Course Objectives

  • Summarize and enterprise environment including Big Data, Hadoop and the Hortonworks Data Platform (HDP)
  • Install HDP
  • Manage Ambari Users and Groups
  • Manage Hadoop Services
  • Use HDFS Storage
  • Manage HDFS Storage
  • Configure HDFS Storage
  • Configure HDFS Transparent Data Encryption
  • Configure the YARN Resource Manager
  • Submit YARN Jobs
  • Configure the YARN Capacity Scheduler
  • Add and Remove Cluster Nodes
  • Configure HDFS and YARN Rack Awareness
  • Configure HDFS and YARN High Availability
  • Monitor a Cluster
  • Protect a Cluster with Backups

Course Outline

Class Format: 
60% Lecture/Discussion
40% Hands-on Labs

Labs: 

  • Introduction to the Lab Environment
  • Performing an Interactive Ambari HDP Cluster Installation
  • Configuring Ambari Users and Groups
  • Managing Hadoop Services
  • Using HDFS Files and Directories
  • Using WebHDFS
  • Configuring HDFS ACLs
  • Managing HDFS
  • Managing HDFS Quotas
  • Configuring HDFS Transparent Data Encryption
  • Configuring and Managing YARN
  • Non-Ambari YARN Management
  • Configuring YARN Failure Sensitivity, Work Preserving Restarts, and Log Aggregation Settings
  • Submitting YARN Jobs
  • Configuring Different Workload Types
  • Configuring User and Groups for YARN Labs
  • Configuring YARN Resource Behavior and Queues
  • User, Group and Fine-Tuned Resource Management
  • Adding Worker Nodes
  • Configuring Rack Awareness
  • Configuring HDFS High Availability
  • Configuring YARN High Availability
  • Configuring and Managing Ambari Alerts
  • Configuring and Managing HDFS Snapshots
  • Using Distributed Copy (DistCP)