lundi 6 juillet 2015

ClusterID vs JobFlowID on AWS EMR

I am a bit confused about the APIs available and the two identifiers. I am using boto, but don't think that is the problem here : my question regards any api (but not cli).

I start a JobFlow with RunJobFlow which returns me a JobFlowId. Let's assume I don't want to keep the number, but rather find later what JobFlows are running to add steps to them.

I think I should be able to use DescribeJobFlows, to find all jobflow_ids and proceed from there. But on documentation (http://ift.tt/1dH9ACl) this api call is marked as deprecated, and directs us to use ListClusters, which returns cluster_ids.

What ties the 2 together ? Is it the same identifier ? If not how can I get jobflows ids from the cluster id ?

I think the confusion also comes by the fact that on cli the command is "create-cluster" and that returns a cluster_id, and add-steps also takes a cluster_id....




Aucun commentaire:

Enregistrer un commentaire