Recursively searching log files of an EMR cluster

Submitted by Jochus on Mon, 13/03/2017 - 07:28 | Posted in: Java

Investigation log files on an EMR cluster can be sometimes very hard. If you want to grep recursively in all .gz files (controller, stderr, stdout, ...), you can use:

$ find -name \*.gz -print0 | xargs -0 zgrep "#WORD_TO_SEARCH#"

Add new comment

The content of this field is kept private and will not be shown publicly.


  • Lines and paragraphs break automatically.
  • You can caption images (data-caption="Text"), but also videos, blockquotes, and so on.
  • Web page addresses and email addresses turn into links automatically.
  • You can enable syntax highlighting of source code with the following tags: <code>, <blockcode>, <bash>, <cpp>, <java>, <php>, <sql>, <xml>. The supported tag styles are: <foo>, [foo].