This is a continuation of my questions on running out of space in root device on EMR. See other questions like this here, here and here.
The hive intelligence seems to suggest that an EBS volume attached to the instances is the right way to get around this. But I can't figure out how to specify the "InstanceGroups"
dict of the boto3 run_job_flow
method.
There is some advice here for EC2 and boto2 here, but I am not sure how that translates into boto3 advice.
Aucun commentaire:
Enregistrer un commentaire