Recursively searching log files of an EMR cluster

Submitted by Jochus on Mon, 13/03/2017 - 07:28 | Posted in: Java

Investigation log files on an EMR cluster can be sometimes very hard. If you want to grep recursively in all .gz files (controller, stderr, stdout, ...), you can use:

$ find -name \*.gz -print0 | xargs -0 zgrep "#WORD_TO_SEARCH#"

