Legacy Stage Libraries

Legacy stage libraries are older stage libraries that have been removed from Data Collector. Though we strongly advise using the stage libraries provided with Data Collector, and upgrading related systems, you can continue to use these legacy libraries when necessary.

For steps for upgrading pipelines that use legacy libraries to current stage libraries, see Update Pipelines using Legacy Stage Libraries.

To use a legacy library, you must install the legacy library. The installation method depends on how you installed Data Collector:
Tarball or cloud service provider installations
Install legacy stage libraries with Package Manager. Follow the instructions in Installing for Tarball Using Package Manager. You can click Legacy Stage Libraries to filter the list of stage libraries, showing only legacy libraries.
RPM package or Cloudera Manager installations
Install legacy stage libraries manually:
  1. Download the legacy libraries:
    1. Go to the StreamSets archives page and navigate to the release that you are using.
    2. Click the "Legacy" link and download the legacy libraries that you want to use.
  2. Install and manage the legacy libraries as you would custom stage libraries. For more information, see Custom Stage Libraries.
The following table lists the legacy stage libraries:
Legacy Stage Library Included Stages
streamsets-datacollector-apache-kafka_0_8_1-lib For Kafka version 0.8.1.
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-apache-kafka_0_8_2-lib For Kafka version 0.8.2.
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-apache-kafka_0_9-lib For Kafka version 0.9.x.
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-apache-kafka_0_10-lib For Kafka version 0.10.x.
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-apache-kafka_0_11-lib For Kafka version 0.11.x.
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-apache-kudu_1_0-lib For Kudu version 1.0.x.

Includes the Kudu Lookup processor and Kudu destination.

streamsets-datacollector-apache-kudu_1_1-lib For Kudu version 1.1.x.

Includes the Kudu Lookup processor and Kudu destination.

streamsets-datacollector-apache-kudu_1_2-lib For Kudu version 1.2.x.

Includes the Kudu Lookup processor and Kudu destination.

streamsets-datacollector-cdh_5_2-lib

For the Cloudera CDH version 5.2 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Solr destination
  • HDFS File Metadata executor
  • MapReduce executor
streamsets-datacollector-cdh_5_3-lib

For the Cloudera CDH version 5.3 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Solr destination
  • HDFS File Metadata executor
  • MapReduce executor
streamsets-datacollector-cdh_5_4-lib

For the Cloudera CDH version 5.4 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
streamsets-datacollector-cdh_5_5-lib

For the Cloudera CDH version 5.5 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
streamsets-datacollector-cdh_5_7-lib

For the Cloudera CDH version 5.7 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_8-lib

For the Cloudera CDH version 5.8 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_9-lib

For the Cloudera CDH version 5.9 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_10-lib

For the Cloudera CDH version 5.10 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_11-lib

For the Cloudera CDH version 5.11 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_12-lib

For the Cloudera CDH version 5.12 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Solr destination
  • HDFS Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_5_13-lib

For the Cloudera CDH version 5.13 distribution of Apache Hadoop.

Includes:

  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS Standalone origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Kudu Lookup processor
  • Spark Evaluator processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Kudu destination
  • Solr destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
  • Spark executor
streamsets-datacollector-cdh_kafka_1_2-lib For the Cloudera distribution of Apache Kafka 1.2 (0.8.2.0).
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-cdh_kafka_1_3-lib For the Cloudera distribution of Apache Kafka 1.3 (0.8.2.0).
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-cdh_kafka_2_0-lib For the Cloudera distribution of Apache Kafka 2.0.x (0.9.0).
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-cdh_kafka_2_1-lib For the Cloudera distribution of Apache Kafka 2.1.x (0.9.0).
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-cdh_kafka_3_0-lib For the Cloudera distribution of Apache Kafka 3.0.0 (0.11.0).
Includes:
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • Kafka Producer destination
streamsets-datacollector-cdh_spark_2_1-lib For the Cloudera CDH cluster Kafka with CDS powered by Spark 2.1.

Includes the Kafka Consumer origin for cluster mode pipelines.

streamsets-datacollector-hdp_2_2-lib For the Hortonworks version 2.2 distribution of Apache Hadoop.
Includes:
  • Hadoop FS origin for cluster mode pipelines

  • Hadoop FS Standalone origin
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Kafka Producer destination
  • HDFS File Metadata executor
  • Hive Query executor
  • MapReduce executor
streamsets-datacollector-hdp_2_3-lib For the Hortonworks version 2.3 distribution of Apache Hadoop.
Includes:
  • Flume destination
  • Hadoop FS origin for cluster mode pipelines
  • Hadoop FS destination
  • Hadoop FS Standalone origin
  • HBase destination
  • HBase Lookup processor
  • HDFS File Metadata executor
  • HTTP to Kafka origin
  • Kafka Consumer origin
  • Kafka Producer destination
  • MapReduce executor
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
streamsets-datacollector-hdp_2_3-hive1-lib The Hortonworks version 2.3.x distribution of Apache Hive 1.x.
Includes:
  • Hive Metadata processor
  • Hive Metastore destination
  • Hive Streaming destination
  • Hive Query executor
streamsets-datacollector-hdp_2_4-lib For the Hortonworks version 2.4 distribution of Apache Hadoop.
Includes:
  • Hadoop FS origin for cluster mode pipelines

  • Hadoop FS Standalone origin
  • HTTP to Kafka origin
  • Kafka Consumer origin for standalone pipelines
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • HBase Lookup processor
  • Flume destination
  • Hadoop FS destination
  • HBase destination
  • Kafka Producer destination
  • HDFS Metadata executor
  • MapReduce executor
streamsets-datacollector-hdp_2_4-hive1-lib For the Hortonworks version 2.4.x distribution of Apache Hive version 1.x.
Includes:
  • Hive Metadata processor
  • Hive Metastore destination
  • Hive Streaming destination
  • Hive Query executor
streamsets-datacollector-hdp_2_5-lib For the Hortonworks version 2.5.x distribution of Apache Hadoop.
Includes:
  • Hadoop FS origin for cluster mode pipelines

  • Hadoop FS Standalone origin
  • HTTP to Kafka origin
  • Kafka Consumer origin for standalone and cluster mode pipelines
  • Kafka Multitopic Consumer origin
  • SDC RPC to Kafka origin
  • UDP to Kafka origin
  • HBase Lookup processor
  • Hive Metadata processor
  • Hadoop FS destination
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination
  • Kafka Producer destination
  • HDFS Metadata executor
  • Hive Query executor
  • MapReduce executor
streamsets-datacollector-hdp_2_5-flume-lib For the Hortonworks version 2.5.x distribution of Apache Flume.

Includes the Flume destination.

streamsets-datacollector-mapr_5_0-lib For MapR version 5.0.
Includes:
  • MapR FS Standalone origin
  • MapR FS destination
  • MapR FS File Metadata executor
streamsets-datacollector-mapr_5_1-lib For MapR version 5.1.

Includes:

  • MapR DB JSON origin
  • MapR FS origin for cluster mode pipelines
  • MapR FS Standalone origin
  • MapR Streams Consumer origin for standalone and cluster mode pipelines
  • HBase Lookup processor
  • Hive Metadata processor
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination using the MapR library
  • MapR Streams Producer destination
  • MapR DB destination
  • MapR DB JSON destination
  • MapR FS destination
  • MapR FS File Metadata executor
streamsets-datacollector-mapr_5_2-lib For MapR version 5.2.

Includes:

  • MapR DB JSON origin
  • MapR FS origin for cluster mode pipelines
  • MapR FS Standalone origin
  • MapR Multitopic Streams Consumer origin
  • MapR Streams Consumer origin for standalone and cluster mode pipelines
  • HBase Lookup processor
  • Hive Metadata processor
  • Spark Evaluator processor
  • HBase destination
  • Hive Metastore destination
  • Hive Streaming destination using the MapR library
  • MapR Streams Producer destination
  • MapR DB destination
  • MapR DB JSON destination
  • MapR FS destination
  • MapR FS File Metadata executor