I would like to filter all the strings that contains internal, but the data is not filtered. In my pig scripts I have:
preload = load '$INPUT' as (textline:chararray);
filterdata = FILTER preload BY SIZE(textline) > 100;
filterInternal = FILTER filterdata by NOT(textline MATCHES '.*internal.*');
Using Pig 0.12.0 on AWS
Aucun commentaire:
Enregistrer un commentaire