Talend Big Data Basics | Agilitics





Buy Courses

Scrum Developer Certified
January 9, 2018
Talend Big Data Advanced – MapReduce
January 9, 2018
Show all

Talend Big Data Basics


No. of Days: 1

Target audience Anyone who wants to use Talend Studio to interact with Big Data systems
Prerequisites Completion of Talend Data Integration Basics or Talend Data Integration Advanced
Course objectives
After completing this course, you will be able to:
  • Create cluster metadata manually, from configuration files, or automatically
  • Create HDFS and Hive metadata
  • Connect to your cluster to use HDFS, HBase, Hive, Pig, Sqoop, and MapReduce
  • Read and write data to/from HDFS (HDFS, HBase)
  • Read and write tables to/from HDFS (Hive, Sqoop)
  • Process tables stored on HDFS with Hive
  • Process data stored on HDFS with Pig
  • Process data stored on HDFS with Big Data batch Jobs
Course agenda Basic concepts

  • Opening a project
  • Monitoring the Hadoop cluster
  • Creating cluster metadata

Reading and writing data in HDFS

  • Storing a file on HDFS
  • Storing multiple files on HDFS
  • Reading data from HDFS
  • Using Hbase to store sparse data on HDFS

Working with tables

  • Importing tables with Sqoop
  • Creating tables in HDFS with Hive

Processing data and tables in HDFS

  • Processing Hive tables with Jobs
  • Profiling Hive tables (optional)
  • Processing data with Pig
  • Processing data with batch Jobs

Troubleshooting guide

  • Troubleshooting your cluster
Reviews (0)


There are no reviews yet.

Be the first to review “Talend Big Data Basics”

Your email address will not be published. Required fields are marked *

Request a Call Back
Request For Demo