Quick Contact

Big Data & Hadoop Class Syllabus

  • BigData Introduction and Hadoop Fundamentals
    o Data Storage and Analysis
    o Comparison with RDBMS
    • Hadoop – A Brief History
    • MapReduce – Part1
    o Map and Reduce
    o Sample Program
    o Combiner
    o Practitioners and Custom Partitioned
    • Hadoop Streaming & Pipes
    • HDFS
    o Blocks
    o NN & DN
    o HDFS Federation & High Availability
    • HDFS Clients
    o HDFS Command Line
    o HDFS CLI – File System Operations Lab
    o HDFS Web UI
    o HDFS Java Client
    o HDFS Java Client – File System Operations Lab
    o CRUD Operations using Java Client
    o Anatomy of File Read and File Write
    o DistCp
    o Cluster balancing
    • YARN – Cluster Management (Hadoop 2.x)
    o How Yarn Applications run?
    o YARN vs MapReduce
    o YARN Scheduling
    ▪ Capacity Scheduler
    ▪ Fair Scheduler
    ▪ FIFO Scheduler
    • Map Reduce – Part2
    o Env Setup
    o Tool and ToolRunner
    o Mapper
    o Reducer
    o Driver program
    o How to package the job
    o MapReduce WebUI
    o How MapReduce Job run?
    o Shuffle & Sort
    o Speculative Execution
    • InputFormats
    o Input Splits and Record Reader
    o Default Input Formats
    o Implement Custom Input Format
    • OutputFormats
    o Default Output formats
    o Output Record Reader
  • • Zookeeper
    o Zookeeper in HBase
    o How Zookeeper is used in Production
  • YARN – Cluster Management (Hadoop 2.x)
    o How Yarn Applications run?
    o YARN vs MapReduce
    o YARN Scheduling
    ▪ Capacity Scheduler
    ▪ Fair Scheduler
    ▪ FIFO Scheduler
    • Map Reduce – Part2
    o Env Setup
    o Tool and ToolRunner
    o Mapper
    o Reducer
    o Driver program
    o How to package the job
    o MapReduce WebUI
    o How MapReduce Job run?
    o Shuffle & Sort
    o Speculative Execution
    • InputFormats
    o Input Splits and Record Reader
    o Default Input Formats
    o Implement Custom Input Format
    • OutputFormats
    o Default Output formats
  • ▪ Map-side joins
    ▪ Reduce-side joins
    ▪ Distributed Cache
    • Hive
    o Comparison with RDBMS
    o HQL
    o Data types
    o Tables
    o Importing and Exporting
    o Partitioning and Bucketing – Advanced.
    o Joins and Join Optimization.
    o Functions- Built in & user defined
    o Advanced Optimization of HQL
    o Storage File Formats – Advanced
    o Loading and Storing Data
    o SerDes – Advanced
    • Sqoop
    o Important basics
    o Import – Deep dive
    o Export – Deep dive
    o Sqoop Optimization – Incremental Load
    o Many more
    • PIG
    o Important basics
    o Pig Latin
  • Ambari
    o Real time Cluster deployment Using Ambari
    o Monitoring the Cluster • Rest API
  • o Cluster balancing
  • o Introduction
    o Real time Use cases of How REST is used with Hadoop
    • Labs:
    o Real Time use cases and Data sets covered (10+ Real Time datasets)
    o Word count, Sensors(Weather Sensors)Dataset, Social Media data sets like YouTube, Twitter data analysis,
    o Jav and Unix Basics Lab
    o Hadoop, Hive, Sqoop, Oozie, HBase, Flume Installations –Pseudo&Cluster
    • Master Project:
    o Real-time DataWarehouse migration:
    o Real-time concepts covered are
    ▪ Hive - Advanced topics
    ▪ Sqoop import/export
    ▪ Oozie Scheduling
    ▪ How Hadoop MR used in DW
    ▪ RDBMS concepts
    ▪ ETL tool concepts
    ▪ Integration with Reporting tools
  • • Compression
    o Map Output
    o Final Output
    o Splittable vs Non Splittable
    o Compression Codecs
    • Serialization
    o Data types –default
    o Writable vs Writable Comparable
    o Custom Data types – Custom Writable/Comparable
    • File Based Data structures
    o Sequence file
    o Reading and Writing into Sequence file
    o Map File
    • Tuning MapReduce Jobs
    • Advanced MapReduce
    o Counters
    ▪ Built-In Counters Classification
    ▪ User Defined Counters
    o Sorting
    ▪ Partial Sort
    ▪ Total Sort
    ▪ Secondary Sort
    o Joins
  • o Data types
    o Functions – Built-in, User Defined
    o Loading and Storing Data
    • Flume
    o Configure Flume and Import data
    o Architecture and LAB
    • Oozie
    o Different workflow jobs
    o Ooze scheduler.
    o LAB – covers advanced topics
    • HBase
    o NoSQL databases Introduction
    o CAP theorem
    o HBase Architecture
    o HBase Clients – Java Client
    o Loadling Data
    o UDF,UDAF,UDTFs

No Boring Lectures, No Theoretical Learning

PURE PRACTICAL CLASSES

  •  
    • Guaranteed Job Assistance: We will be sending you for interviews till you get Hired.
    • Faculty will make your Resume Ready as per industry Standards.
    • We provide Question and Answers which are asked in interviews
    • Mock Tests
    • Mock Interviews
    • Pre-Requisite: software developers, web analytics candidates, freshers
    • Work With Real Time Case Studies
    • TalentHub aims at your career success
  •  

    Course Duration

    • 2 - 3 Months Practical Classes
    • Advanced Java: 3 Months Practical Classes
    • In Class, You Get In-Depth Programming Knowledge on each Topic
    • Weekdays Classes
    • Weekends Saturday and Sunday Classes
    • Location: Courses are run in our pune training centres (BTM Layout, Marathahalli, Jayanagar and Rajaji Nagar)
    • Can be on-site at client locations (Corporate Training)
    • Online Big Data & Hadoop Courses
    • Pay only after FREE DEMO CLASS
  •  

    Student Free Benefits

    • Placement Assistance
    • Real life case studies to practice
    • Free Technical Support after Course Completion
    • Back up Classes Available
    • Certification in Big Data Hadoop
    • Free Wifi and Lab Facility
    • Latest Study Material
    • Attend 1st Class Free
    • Fast Track course available with best Fees
  •  

    Main Big Data & Hadoop Topics Covered

    • Hadoop Developer, Administrator & Data Analytics
    • BigData Introduction and Hadoop Fundamentals
    • MapReduce, HDFS, Hive
    • Word count, Sensors(Weather Sensors)Dataset, Social Media data sets like YouTube, Twitter data analysis,
    • Hadoop, Hive, Sqoop, Flume Installations –Pseudo Mode
    • Sqoop, HBase, Pig
    • Apache Spark & Oozie
    • Spark & Scala
    • Java and Unix Basics Lab
Best Hadoop Course in Pune
Classes & Training Reviews
Learn asp .net and java at our coaching institute in PuneAverage of 4.8out of 5Based on 383 Votes.


Reviewed byRamesh
6 days Ago
Getting trained at BTH is an amazing experience for .NET in Pune helped to sharpen my .NET skills.
Rating: rate 5/54/5 Star Rating: Good
Dot Net training Pune

Reviewed bySojal
9 days Ago
Trainer are Very helping and also giving more information than just a training
Rating: rate 5/54/5 Star Rating: Good
Dot Net training Pune

Reviewed bySonali
18 days Ago
I Joined BTH for Combo course Php + Testing I got excellent training and placement in McLean Technolgy... Thanks Blend TalentHub for such great career gift
Rating: rate 5/55/5 Star Rating: Good


Related Links