GPSS Producer (deprecated)
Supported pipeline types:
|
When you configure the GPSS Producer destination, you specify the connection information for a Greenplum Database master and a Greenplum Stream Server, define the table to use, and optionally define field mappings. By default, the destination writes field data to columns with matching names.
The GPSS Producer destination can use CRUD operations defined in the
sdc.operation.type
record header attribute to write
data. You can define a default operation for records without the header
attribute or value. You can also configure how to handle records with
unsupported operations.
For information about Data Collector change data
processing and a list of CDC-enabled origins, see Processing Changed Data.
Before you use the GPSS Producer destination, you must install the GPSS stage library and complete the other prerequisite tasks. The GPSS stage library is an Enterprise stage library. Releases of Enterprise stage libraries occur separately from Data Collector releases. For more information, see Enterprise Stage Libraries in the Data Collector documentation.
Prerequisites
Install the GPSS Stage Library
You must install the GPSS stage library before using the GPSS Producer destination.
You can install Enterprise stage libraries using Package Manager for a tarball Data Collector installation or as custom stage libraries for a tarball, RPM, or Cloudera Manager Data Collector installation.
Supported Versions
Data Collector Version | Supported Stage Library Version |
---|---|
Data Collector 3.8.2 and later | GPSS Enterprise Library 1.0.x |
Installing with Package Manager
You can use Package Manager to install the GPSS stage library on a tarball Data Collector installation.
Installing as a Custom Stage Library
You can install the GPSS Enterprise stage library as a custom stage library on a tarball, RPM, or Cloudera Manager Data Collector installation.
Install, Configure, and Start GPSS in Greenplum Database
The Greenplum Stream Server (GPSS) manages communication and data transfer between the GPSS Producer destination and Greenplum Database. Before using the destination, you must install, configure, and start GPSS in the Greenplum Database cluster. For more information, see the Pivotol Greenplum documentation.
Define the CRUD Operation
The GPSS Producer destination can insert, update, or merge data. The destination writes the records based on the CRUD operation defined in a CRUD operation header attribute or in operation-related stage properties.
You define the CRUD operation in the following ways:
- CRUD record header attribute
- You can define the CRUD
operation in a CRUD operation record header attribute. The
destination looks for the CRUD operation to use in the
sdc.operation.type
record header attribute. - Operation stage properties
- You define a default operation in the destination
properties. The destination uses the default operation when the
sdc.operation.type
record header attribute is not set.
Configuring a GPSS Producer Destination
Before you use the GPSS Producer destination in a pipeline, complete the prerequisite tasks.