call-icon (469) 491 4477
0 student

Big Data and Hadoop Administrator Training

The course provides an in-depth understanding of Hadoop framework, HDFS, and Hadoop cluster including Sqoop, Flume, Pig, Hive, and Impala. You will learn about cluster management solutions, core Hadoop distribution, and Cloudera manager. It includes 4 industry-based projects and is aligned to Cloudera’s CCAH ‘CCA-500’ certification. This course is best suited for IT professionals, data engineers, system administrators, and cloud administrators.

Overview

Big Data Hadoop Administrator Training provides an in-depth understanding of Hadoop framework, HDFS, and Hadoop cluster including Sqoop, Flume, Pig, Hive, and Impala. You will learn about cluster management solutions, core Hadoop distribution, and Cloudera Manager. It includes 4 industry-based projects and is aligned to Cloudera’s CCAH ‘CCA-500’ certification. Big Data Hadoop Administrator Training is best suited for IT professionals, data engineers, system administrators, and cloud administrators.
We also provide training in many other courses under Big Data and Analysis.
Please follow us on Facebook, Twitter, LinkedIn, Google+, Youtube etc. and share your experience with our PMP Training.

Key Features

Classroom Training:

  • 32 hours of instructor-led training
  • 100% Money Back Guarantee*
  • 20 hours of self-paced video
  • Includes 4 real industry-based projects
  • Prepares for Cloudera CCAH ‘CCA-500’ certification exam
  • Includes 3 simulation exams aligned to ‘CCA-500’ certification exam

Online Instructor led Training:

  • 32 hours of instructor-led training
  • 20 hours of self-paced video
  • Includes 4 real industry-based projects
  • Prepares for Cloudera CCAH ‘CCA-500’ certification exam
  • Includes 3 simulation exams aligned to ‘CCA-500’ certification exam

FAQ

What are the System Requirements?To run Hadoop, your system needs to fulfill the following requirements:

  • 64-bit Operating System
  • 4GB RAM

We will help you to set up a Virtual Machine with local access.

Who are the trainers?The training is delivered by highly qualified and certified instructors with relevant industry experience.

We offer this training in the following modes:

  1. Classroom: Physical classroom training for those who prefer to attend in-person open house training or onsite training.
  2. Live Virtual Classroom or Online Classroom: With online classroom training, you have the option to attend the course remotely from your desktop via video conferencing. This format saves productivity challenges and decreases your time spent away from work or home.
  3. Online Self-Learning: In this mode, you will receive the lecture videos and you can go through the course as per your convenience.

Can I cancel my enrolment? Do I get a refund?Yes, you can cancel your enrolment if necessary. We will refund the course price after deducting an administration fee. To learn more, you can view our Refund Policy.

Are there any group discounts for classroom training programs?

What are the System Requirements?Yes, we have group discount options for our training programs. Contact us using the form on the right of any page on the Knowledge torch website, or select the Live Chat link. Our customer service representatives will be able to give you more details.

FAQ

Big Data Hadoop Administrator Training course will prepare you for Cloudera’s CCAH ‘CCA-500’ certification and equip you with all the skills for your next Big Data admin assignment. Big Data Hadoop Administrator Training covers the Core Hadoop distributions—Apache Hadoop and Vendor specific distribution—CDH (Cloudera Distribution of Hadoop).
You will learn the need for cluster management solutions, about Cloudera manager and its capabilities. It teaches you how to set up Hadoop cluster and its components such as Sqoop, Flume, Pig, Hive, and Impala with basic or advanced configurations? The Hadoop administrator course also answers What is Hadoop’s Distributed File System, and its processing/computation frameworks? And How to plan, secure, safeguard, and monitor a cluster?
Big Data Hadoop Administrator Training will help you understand all basic and advanced concepts of Big Data and all technologies related to Hadoop stack and components within Hadoop Ecosystem.

What learning outcomes can be expected?
After completing this Big Data Hadoop Administrator Training, you will be able to:

  • Understand the fundamentals of Big Data and its characteristics, various scalability options to help organizations manage Big Data.
  • Master the concepts of the Hadoop framework; its architecture, working of Hadoop distributed file system and deployment of Hadoop cluster using core or vendor-specific distributions.
  • Learn about cluster management solutions such as Cloudera manager and its capabilities for setup, deploying, maintenance & monitoring of Hadoop Clusters.
  • Learn Hadoop Administration activities
  • Learn about computational frameworks for processing Big Data
  • Learn about Hadoop clients, nodes for clients and web interfaces like HUE to work with Hadoop Cluster
  • Learn about Cluster planning and tools for data ingestion into Hadoop clusters
  • Learn about Hadoop components within Hadoop ecosystem like Hive, HBase, Spark, and Kafka
  • Understand security implementation to secure data and clusters.
  • Learn about Hadoop cluster monitoring activities

Who should do this course?
Big Data career opportunities are on the rise, and Hadoop is quickly becoming a must-know technology for the following professionals:

  • Systems administrators and IT managers
  • IT administrators and operators
  • IT Systems Engineer
  • Data Engineer and database administrators
  • Data Analytics Administrator
  • Cloud Systems Administrator
  • Web Engineer

Exam And Certification

How to get certified?To become a Certified Big Data Hadoop developer, you must fulfill the following criteria:

  • Complete any one out of the two projects provided in the course. Submit the deliverables of the project  to support@knowledgtorch.com which will be evaluated by our lead trainer
  • Score minimum of 80% in any one of the four simulation tests
  • Complete 85% of the course

What do I need to do to unlock my certificate?
LVC:

  1. Complete at least 1 project and 1 simulation test with a minimum score of 60%.
  2. Complete at least 1 simulation test with a minimum score of 60%.
  3. Attend one class or complete 85% of the course.
  4. Complete at least 1 project.

OSL:

  1. Complete at least 1 project and 1 simulation test with a minimum score of 60%.
  2. Complete at least 1 simulation test with a minimum score of 60%.
  3. Complete 85% of the course.
  4. Complete at least 1 project.

Course Agenda

Lesson 1: Big Data & Hadoop Introduction

Big Data Hadoop Administrator Training course you will learn about Big Data characteristics need for a framework such as Hadoop & its ecosystem. You will also be introduced to important daemons that support functioning of a Hadoop cluster. Topics covered are:

  • Data & Existing Solutions
  • Welcome to the world of Big Data—What, Why & Where
  • Case studies
  • Hadoop & its Ecosystem
  • Hadoop Core components
  • Hadoop & its capabilities

Lesson 2: HDFS – Hadoop Distributed File System & Hadoop’s Distributions

In this lesson, you will learn about Hadoop Distributed file System, its architecture, working & internals, Hadoop different distributions and about their similarities & differences. Topics covered are:

  • Gain knowledge on HDFS its internals, working & features
  • Learn about possibilities without HDFS
  • Differentiate or find similarities in different distributions of Hadoop.
  • Identify the requirements to set up a Hadoop cluster

Lesson 3: Hadoop Cluster Setup & Working with Hadoop Cluster

In this lesson, you will learn about steps to setup Apache Hadoop (core distribution) & Cloudera Distribution of Hadoop (vendor specific), cluster management solutions and their benefits and nut & bolts of Cloudera Distribution of Hadoop. You will also learn how to verify your cluster. Topics covered are:

  • The need for Cluster Management Solution
  • Choice of Installation methods—Automated/ Manual
  • Linux machines setup—Virtualization & Cloud
  • Hadoop Cluster Setup—Apache Hadoop V2 & Cloudera Distribution of Hadoop (CDH)
  • Cloudera manager features and capabilities
  • Working with Hadoop cluster, HDFS & data
  • Working with management console/ UI ( user interfaces) & Linux terminals
  • Understand administration scenarios

Lesson 4: Hadoop Configurations & Daemon Logs

In this lesson, you will learn about configuration files, ports & properties that relate to the functioning of Hadoop cluster. You will also learn about Hadoop daemons logs and how they help in problem scenarios for diagnosing & gathering information. Topics covered are:

  • List and describe the files that control Hadoop configuration
  • Explain how to manage Hadoop configuration with Cloudera Manager
  • Locate configuration files and make changes
  • Explain how to deal with stale configurations
  • Explain the properties of addresses and ports of RPC and HTTP servers run by Hadoop Daemons
  • Locate log files generated on hosts
  • Filter information in log files
  • Explain how to get diagnostic information from log files

Lesson 5: Hadoop Cluster Maintenance & Administration

In this lesson, you will learn Hadoop cluster maintenance and administration activities. You will also learn the shortcomings of Hadoop v1 and how they are fulfilled by Hadoop v2 features. Topics covered are:

  • Explain how to add and remove nodes in an ad-hoc way
  • Explain how to add and remove nodes in a systematic way, otherwise known as commissioning and decommissioning of nodes
  • Explain how to balance a cluster
  • List the steps for managing services including adding, deleting, starting, stopping and checking the status of services
  • Explain the procedure to enable rack awareness
  • List the steps to add, remove and move role instances and hosts
  • Cite the challenges faced with the first version of Hadoop
  • Explain the features in the second version that help overcome the challenges faced in the first version

Lesson 6: Hadoop Computational Frameworks

In this lesson, you will learn about different types of computational frameworks, MapReduce & YARN concepts & configurations and how YARN manages applications. Topics covered are:

  • Describe the role of computational frameworks
  • Explain MapReduce concepts
  • Describe MRv2 on YARN
  • Explain configuring and understanding of YARN
  • Describe YARN applications
  • Describe YARN memory and CPU settings

Lesson 7: Scheduling—Managing resources via Schedulers

In this lesson, you will learn cluster scheduling concepts, managing resources in your YARN cluster by usage of schedulers & queue management to manage jobs/applications. Topics covered are:

  • Describe the scheduling concepts
  • Indentify the Schedulers
  • Explain the ways to manage resources using Schedulers
  • Describe FIFO, Fair Scheduler, and Capacity Scheduler
  • Explain how to configure Schedulers
  • Explain queue management

Lesson 8: Hadoop Cluster Planning

In this lesson you will learn about how to plan your Hadoop cluster, considerations for cluster sizing & workload patterns in Hadoop cluster, making choices pertaining to variables such as hardware, software & different cluster deployment options. Topics covered are:

  • Planning Hadoop Cluster
  • General Planning considerations
  • Workload and cluster sizing
  • Making Choices—Hardware, Software & Network
  • Making Choices—Master/Slave considerations
  • News from the world—Existing Setups

Lesson 9: Hadoop Clients & HUE interface

In this lesson you will learn about Hadoop clients, nodes that support Hadoop clients and web interface such as HUE which can be used to work with Hadoop cluster and its components. Topics covered are:

  • Explain the concepts of Hadoop client, edge nodes, and gateway nodes
  • Install and configure Hadoop clients
  • Explain how Hue works
  • Install and configure Hue
  • Describe how authentication and authorization is managed in Hue

Lesson 10: Data Ingestion in Hadoop Cluster

In this lesson you will learn about data ingestion types & tools. You will learn more about tools such as Flume, Sqoop that can be used for data import/export. Topics covered are:

  • Understand Data Ingestion & its types
  • Knowing about various data ingestion tools & their capabilities
  • Understanding how Flume works
  • Understanding how sqoop works

In this lesson you will learn about open-source components (also known as services in CDH) that work within Hadoop ecosystem such as Hive, Hbase, kafka & Spark. Topics covered are:

  • List some of the services and open-source components that work within the Hadoop ecosystem
  • List the advantages and key features of Hive
  • Describe briefly about the components of Hive
  • Explain how to configure Hive in different modes
  • Explain the architecture of HBase and cite the advantages of using HBase
  • Explain the working of Apache Kafka
  • Describe the architecture of Apache Spark

Lesson 11: Hadoop Security—Securing Hadoop Cluster

In this lesson you will learn about security aspects and security implementation in a Hadoop cluster to secure data & cluster. Topics covered are:

  • Describe the different ways to avoid risks and secure data
  • Identify the different threat categories
  • Describe the security aspects for different nodes
  • Describe operating system security
  • Describe Kerberos and how it works
  • Describe Service Level Authorization

In this lesson you will learn about basics of cluster monitoring, choosing right monitoring solutions, Hadoop metrics categories & types and Cloudera manager’s features and capabilities that can be used for monitoring your Hadoop cluster. Topics covered are:

  • Describe cluster monitoring
  • Describe the ways to choose the right monitoring solutions
  • List the features and considerations of Cloudera manager for monitoring
  • Describe the different categories of Hadoop Metrics
  • List the different types of Hadoop Metrics
  • List the steps to monitor a cluster by using Cloudera Manager
Curriculum is empty

Instructor

$1,899.00 $1,519.00

0 Comments

Leave a Reply

Your email address will not be published.