amazon s3 - Can I customize partitioning in Kinesis Firehose before delivering to S3?

Question

Welcome To Ask or Share your Answers For Others

amazon s3 - Can I customize partitioning in Kinesis Firehose before delivering to S3?

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

amazon s3 - Can I customize partitioning in Kinesis Firehose before delivering to S3?

I have a Firehose stream that is intended to ingest millions of events from different sources and of different event-types. The stream should deliver all data to one S3 bucket as a store of rawunaltered data.

I was thinking of partitioning this data in S3 based on metadata embedded within the event message like event-souce, event-type and event-date.

However, Firehose follows its default partitioning based on record arrival time. Is it possible to customize this partitioning behavior to fit my needs?

Update: Accepted answer updated as a new answer suggests the feature is available as of Sep 2021

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T17:53:44+0000

No. You cannot 'partition' based upon event content.

Some options are:

Send to separate Firehose streams
Send to a Kinesis Data Stream (instead of Firehose) and write your own custom Lambda function to process and save the data (See: AWS Developer Forums: Athena and Kinesis Firehose)
Use Kinesis Analytics to process the message and 'direct' it to different Firehose streams

If you are going to use the output with Amazon Athena or Amazon EMR, you could also consider converting it into Parquet format, which has much better performance. This would require post-processing of the data in S3 as a batch rather than converting the data as it arrives in a stream.

Categories

amazon s3 - Can I customize partitioning in Kinesis Firehose before delivering to S3?

amazon s3 - Can I customize partitioning in Kinesis Firehose before delivering to S3?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags