Kinesis Firehose
Supported pipeline types:
|
To write data to Amazon Kinesis Streams, use the Kinesis Producer destination. To write data directly to Amazon S3, use the Amazon S3 destination.
When you use the Kinesis Firehose destination to deliver data to Amazon S3, Firehose can buffer incoming records into larger file sizes before delivering the data to Amazon S3. You configure the buffer size and buffer interval when you create the delivery stream.
When you configure the Kinesis Firehose destination, you specify an existing delivery stream to write to, AWS credentials and region, and the data format to use.
AWS Credentials
When Data Collector writes data to a Kinesis Firehose destination, it must pass credentials to Amazon Web Services.
Use one of the following methods to pass AWS credentials:
- IAM role
- When Data Collector runs on an Amazon EC2 instance, you can use the AWS Management Console to configure an IAM role for the EC2 instance. Data Collector uses the IAM instance profile credentials to automatically connect to AWS.
- AWS access key pairs
- When Data Collector does not run on an Amazon EC2 instance or when the EC2 instance doesn’t have an IAM role, you must specify the Access Key ID and Secret Access Key properties in the destination.
Delivery Stream
The Kinesis Firehose destination writes data to an existing delivery stream in Amazon Kinesis Firehose. Before using the Kinesis Firehose destination, use the AWS Management Console to create a delivery stream to an Amazon S3 bucket or Amazon Redshift table.
For more information about creating a Firehose delivery stream, see the Amazon Kinesis Firehose documentation.
Data Formats
The Kinesis Firehose destination writes data to a Kinesis Firehose delivery stream based on the data format that you select.
In Data Collector Edge pipelines,
the destination supports only the JSON data format.
The Kinesis Firehose destination processes data formats as follows:
- Delimited
- The destination writes records as delimited data. When you use this data format, the root field must be list or list-map.
- JSON
- The destination writes records as JSON data. Use the multiple objects format,
where each file includes multiple JSON objects. Each object is a JSON
representation of a record.Note: The JSON array of objects format is not supported for the Kinesis Firehose destination.
Configuring a Kinesis Firehose Destination
Configure a Kinesis Firehose destination to write data to an Amazon Kinesis Firehose delivery stream.