Convert batch of Word files to PDFs in Mac OS X

Say I have several Word files in a folder. Is there a way to generate a batch of PDFs from these files?


Solution 1:

Provided you have MS Word (or any other app that can open MS Word files) installed, you can use Automator. Here is a step by step guide on how to set it up for your needs: http://aseriesoftubes.com/articles/how-to-batch-convert-doc-files-to-pdf-format-using-mac-osx-automator/

Brief overview of the whole process:

  1. Open Automator
  2. Create a new workflow
  3. From the library panel on the left, select Files & Folders then double-click Get Specified Finder Items
  4. Add the all the files to convert
  5. From the library panel, now select Documents, then double click Convert Format of Word Documents
  6. From the dropdown menu, select Portable Document Format (PDF)
  7. Finally, click the Run button, and it will convert all the files and save them in the same folder where the original Word files are.

Solution 2:

You can use the docx2pdf command line utility to batch convert docx to pdf on macOS (or windows). It uses Microsoft Word's APIs to directly convert to PDF creating a perfect copy. It uses JXA (Javscript for Automation, basically AppleScript in JS) in macOS and win32com in Windows.

pip install docx2pdf
docx2pdf myfolder/

Disclaimer: I wrote this tool after struggling to find a cross-platform solution for batch converting docx to pdf with zero formatting issues since it directly uses Microsoft Word. https://github.com/AlJohri/docx2pdf

Solution 3:

One option using the command line would be to use pandoc (which requires LaTeX for PDF generation).

  1. Install pandoc and LaTeX. Because LaTeX is so big, I installed BasicTeX as recommended in the pandoc docs. With Homebrew and Homebrew Cask: brew install pandoc && brew install --cask basictex
  2. Make sure the Word files are in .docx format. If they are in .doc format, you can convert them with OS X's built-in textutil: textutil -convert docx *.doc
  3. On El Capitan you have to add the texbin utilities to your PATH: export PATH=/Library/TeX/texbin:"$PATH"
  4. Convert: pandoc -o myfile.pdf myfile.docx

Because your question was regarding batch converting multiple files, you could easily put this into a loop:

#! /bin/bash
for file in *.doc; do
  textutil -convert docx "$file"

  # Account for the new `x` in `docx`
  pandoc -o "${file%doc}pdf" "${file}x"
done

Solution 4:

Unfortunately Office 2016 and later (including Office 365) no longer supports Automator. You can work around this using AppleScript.

This solution uses:

  • AppleScript to open microsoft word and save as a PDF
  • An Automator Application to wrap the AppleScript
  • Find from the Terminal to find word documents to call the Automator Applicator

The AppleScript script

The purpose of the script is, for the filename passed in, to open the file in word, and then save the file with a .pdf extension instead of .doc or .docx.

on run {input, parameters}
    repeat with aFile in input
        repeat 1 times -- # fake loop

            -- Create the output filename
            tell application "System Events"
                set inputFile to disk item (aFile as text)

                tell (info for input) to set Nm to name
                if (text -5 thru -1 of Nm) is ".docx" then
                    set outputFileName to ((text 1 thru -6 of Nm) & ".pdf")
                else if (text -4 thru -1 of Nm) is ".doc" then
                    -- .doc
                    set outputFileName to ((text 1 thru -5 of Nm) & ".pdf")
                else
                    -- If this isn't a word document, continue to the next file.
                    exit repeat
                end if
            end tell

            -- Open word, save as PDF in default path
            tell application id "com.microsoft.Word"
                activate
                open aFile
                tell active document
                    save as it file name outputFileName file format format PDF
                    close saving no
                end tell
                set defaultPath to get default file path file path type documents path
            end tell

            -- Move to output location
            tell application "System Events"
                set outputPath to (container of inputFile)
                set outputFile to disk item outputFileName of folder defaultPath
                move outputFile to outputPath
            end tell
        end repeat
    end repeat
    return input
end run

The Automator Application

  1. Open automator
  2. File / New then select Application
  3. From the actions Library in the left hand nav, select "Run AppleScript" and drag it into the window on the righ
  4. Copy and paste the code above into the Run AppleScript window.
  5. Save the application as "WordToPdf.app" on the Desktop

Calling it from the command line

  1. Open Terminal
  2. Change Directory to the one you wish to run the script in. Note that it is recursive and will process all files in all subdirectories
  3. Run the following code:
     find . -type f \( -iname \*.doc -o -iname \*.docx \) -print0 | while read -d $'\0' file
     do
         automator -i "$file" ~/Desktop/WordToPdf.app
     done
    
  4. Double check that you are happy, and then optionally remove all of the word documents:
    find . -type f \( -iname \*.doc -o -iname \*.docx \) -exec rm {} \;
    

Solution 5:

Seems like the Automator solution doesn't work anymore.

So there is a simple one.

Install the CUPS-PDF module for OS X. This is a virtual PDF printer that looks like a "real" printer to the system, but creates a PDF file when you send a document to it.

https://bitbucket.org/codepoet/cups-pdf-for-mac-os-x/wiki/Home

If you make it as your default printer, then you just have to mass select all your files and do "Command" + "P" (shortcut). Mac OSX will open all files, send them to print in pdf, then close them = Simple and it works (on my Mac OS X 10.8.2 Montain Lion)

other tip :

"After you have installed the Virtual Printer using CUPS-PDF, there is a simpler and more powerful way to batch convert any set of documents to PDF files.

Use Printer Setup Utility to create a Desktop Printer icon for the Virtual Printer. After it is created, I put it in my Dock for easy access.

In the Finder, drag your file icons to the Virtual Printer icon. For Microsoft Word documents, Microsoft Word will be opened and instructed to print each document to that printer.

Unlike the Microsoft Word Macro method, you can do this for any other document created by any other program. As necessary, the Finder will open the appropriate application and tell it to print to the Virtual Printer.

There are two mild limitations to this general method.

(1) The files you drag to the printer icon need to be ones that the Mac knows what to do with (i.e. you can double-click to open it in some program on your Mac).

(2) The document's default "Open With" application needs to support the AppleScript command for Print. All well-behaved MacOS X programs do this. NeoOffice for example doesn't, and thus batch converting native NeoOffice documents does not work for this printer icon method."

Source (skip the macro part) : http://hints.macworld.com/article.php?story=20070315023808642