How can I handle spaces in file names when using xargs on find results?

One of my common practices is to perform greps on all files of a certain type, e.g., find all the HTML files that have the word "rumpus" in them. To do it, I use

find /path/to -name "*.html" | xargs grep -l "rumpus"

Occasionally, find will return a file with a space in its name such as my new file.html. When xargs passed this to grep, however, I get these errors:

grep: /path/to/bad/file/my: No such file or directory
grep: new: No such file or directory
grep: file.html: No such file or directory

I can see what's going on here: either the pipe or the xargs is treating the spaces as delimiters between files. For the life of me, though, I can't figure out how to prevent this behavior. Can it be done with find + xargs? Or do I have to use an entirely different command?


Use

find ... -print0 | xargs -0 ...

e.g.

find /path/to -name "*.html"  -print0 | xargs -0  grep -l "rumpus"

from the find man page

-print0
          True; print the full file name on the standard  output,  followed
          by  a  null  character  (instead  of  the  newline character that
          ‘-print’ uses).  This allows file names that contain newlines  or
          other  types  of  white space to be correctly interpreted by pro-
          grams that process the find output.  This option  corresponds  to
          the ‘-0’ option of xargs.

You do not need to use xargs, because find can execute commands itself. When doing this, you do do not have to worry about the shell interpreting characters in the name.

find /path/to -name "*.html" -exec grep -l "rumpus" '{}' +

from the find man page

-exec command {} +
This variant of the -exec action runs the specified command on the selected files, but the command line is built by appending each selected file name at the end; the total number of invocations of the command will be much less than the number of matched files. The command line is built in much the same way that xargs builds its command lines. Only one instance of `{}' is allowed within the command. The command is executed in the starting directory.


If find and xarg versions on your system doesn't support -print0 and -0 switches (for example AIX find and xargs) you can use this:

find /your/path -name "*.html" | sed 's/ /\\ /g' | xargs grep -l "rumpus"

Here sed will take care of escaping the spaces for xargs.