jeudi 30 avril 2015

Use AWS Kinesis as a data source for an EMR MapReduce job

I set up a AWS Kinesis stream that receive data from multiple sources. I'd like to process that data in multiple incremental batches using MapReduce in EMR.

How do I specify the input source in my job? Are there any specific libraries to handle a Kinesis record? Sample code would be much appreciated!




Aucun commentaire:

Enregistrer un commentaire