lundi 29 décembre 2014

Pig filter matches not working with pig and EMR

I would like to filter all the strings that contains internal, but the data is not filtered. In my pig scripts I have:



preload = load '$INPUT' as (textline:chararray);
filterdata = FILTER preload BY SIZE(textline) > 100;
filterInternal = FILTER filterdata by NOT(textline MATCHES '.*internal.*');


Using Pig 0.12.0 on AWS





Aucun commentaire:

Enregistrer un commentaire