How to join text files?

I have saved many documents as txt. I want to print them together so first I want them together in a single file. The order doesn't matter in this case.

I want a solution that does not involve typing the names of the files to be merged, but one that would just merge all txt files within the folder.

Can I do it with a command or some GUI?


I looked here. Don't know how to use join.


Solution 1:

Use cat with output redirection. Syntax: cat file [file] [[file] ...] > joined-file.

Example with just two files (you can have many more):

$ echo "some text in a file" > file1
$ echo "another file with some text" > file2
$ cat file1 file2 > mergedfiles
$ cat mergedfiles
some text in a file
another file with some text

In case you have "many documents", make use of shell globbing (patterns):

cat input-files-dir/* > joined-file

This will join all files in that directory to the current directory (preventing it to match the output file itself). It is totally independent to the use of cat and output redirection - it's just Bash providing all the files as arguments to cat.


File types

It will just glue (join) files together as you would do with paper and tape. It does not care about the actual file format being capable of handling this. It will work for text files, but not for PDFs, ODTs, etc. Well, it will glue them together, but it's not a valid PDF/ODT anymore.


Order of joining

As phoibos pointed out the shell globbing will result in alphabetical order of file names. This is how Bash and shell globbing works.


Addendum about input file is output file error

When the pattern of the input files matches the very same file as being output, this will cause an error. It's a safety feature. Example: cat *.txt > out.txt run the second time will cause this.

What you can do about it:

  • Choose a more specific pattern to match the actual input files, not matching the output name. Example: input files pattern *.txt with output file output.out will not collide.
  • Work in different directories. In the example above I've used a separate input-files-dir directory to place all files in, and output to the current working directory. This makes it impossible to get this error.

Solution 2:

A simple way to do that is by using cat:

cat file1 file2 > joined_file

If you just issue cat file1 file2 you'll see both files on the standard output. By using >, you're just redirecting the standard output to a file. That will work also with another commands.

Solution 3:

Do it with a simple loop:

for i in *.txt; do cat "$i" >> complete.txt; done

>> appends to the file.

Note: If for some reason you have to run the command again, you have to remove complete.txt, otherwise you'd write the file to itself, which doesn't work.

Solution 4:

If the files you want to combine all end in .txt, keep it simple:

cat *.txt > combined.txt

If the directory only contains text files, it's also simple:

cat * > combined.txt

(Note that once you create combined.txt, doing it again would include it in the expansion of *, leading to odd behavior).

If you want to select some files in the directory and not others, it's best if the filenames allow you to distinguish which ones you want. If not, you can get fancy with find. But I doubt you need to go that far.