Course Outline

Introduction to Hortonworks Data Platform (HDP)

Overview of Big Data and Apache Hadoop

Installing and Configuring HDP

Setting up, Deploying, and Managing Hadoop Cluster

Understanding and ConfiguringYARN and MapReduce

Overview of Job Scheduling

Ensuring Data Integrity

Understanding Enterprise Data Movement

Using HDFS Commands & Services

Transferring Data Using Flume

Working with Hive

Scheduling Workflow Using Oozie

Exploring Hadoop 2.x

Understanding Hbase Architecture

Monitoring HDP2 Services Using Ambari

New Features in HDP

Troubleshooting

Summary and Next Steps

Requirements

  • An understanding of Hadoop and big data
  • An understanding of Spark
  • Familiarity with the command line
  • System administration experience

Audience

  • Hadoop administrators
 21 Hours

Number of participants



Price per participant

Testimonials (5)

Related Courses

Python and Spark for Big Data (PySpark)

21 Hours

Introduction to Graph Computing

28 Hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 Hours

Apache Spark MLlib

35 Hours

Big Data Analytics in Health

21 Hours

Hadoop and Spark for Administrators

35 Hours

A Practical Introduction to Stream Processing

21 Hours

Magellan: Geospatial Analytics on Spark

14 Hours

Apache Spark for .NET Developers

21 Hours

SMACK Stack for Data Science

14 Hours

Apache Spark Fundamentals

21 Hours

Administration of Apache Spark

35 Hours

Apache Spark in the Cloud

21 Hours

Spark for Developers

21 Hours

Scaling Data Pipelines with Spark NLP

14 Hours

Related Categories

1