Technical Translation from English into Russian in Computer and Telecommunication Industries
The author Articles Recourses Useful links По-русски

Exporting PDF documents

Saving (export) in other format is necessary to have the editable version of PDF file content, which will be translated in other application and then will be saved in PDF format. For example, we can save a PDF file in DOC format (more precisely, in RTF format), translate it in Word editor as usual, and save the translated document back in PDF.

Adobe Acrobat enables you to convert your PDF to many different formats with the Save As dialog. These filters work best when the PDF is tagged. If your PDF is not tagged, Acrobat uses an inference engine to assemble the letters into words and the words into paragraphs. It tries to detect and create tables. It works best on documents with very simple formatting. Tables and formatted pages generally don't survive. Also Adobe Reader enables you to convert your PDF to text by selecting File > Save As Text.

Because a PDF document is a form of PostScript, you can change it into a wide variety of file formats for use in various circumstances and by various applications. The concept of file formats is easily understood if you divide any document into two parts: its contents and the container that holds those contents. Contents are document elements such as text and graphics. File formats are the container into which you place your document contents.

You employ different file formats (containers) for different uses. A PDF is one file format, and a PostScript file format is another. Both may have essentially the same content stored in different containers bound for different uses. A PDF document can be saved out as an integrated text and graphic document, such as another PDF, EPS, HTML, or XML; as a text-based file, such as a TXT; or as a graphics file, such as TIFF or one of the JPEGs, depending upon how you might want to translate the content of your PDF.

One of the easiest ways to change a PDF�s file format is to use the Save As function in Adobe Acrobat:
  1. Open the PDF whose file format you would like to change.
  2. Select File > Save As. The Save As window will appear
  3. Click the Format menu and select a file format. After you have selected the file format, be sure to click the Settings button so that you can configure your new format to match your needs for this new file. Each file format will have its own unique Save As Settings dialog. Acrobat will automatically add the appropriate three-character extension onto your new file.
  4. When you are through configuring your file format settings, click the OK button in the Save As Settings dialog; then click the Save button in the Save As dialog to create your new file.

Please notice, that after saving file in other format we lost practically all design of the source document, therefore received from PDF file a set of separate elements in DOC format is assembled manually to precisely reproduce the design of the source document as possible. It is sometimes more useful to keep these elements well-ordered, i.e. before everything else "pull out" all text, then save all figures, and further form a new design in parallel with text translation.

Text can be easily exported from a PDF document in various ways. You can export all the text in your PDF, export all the images at once, or copy and paste selected content.

Exporting All Text or All Images

To export all the text in your PDF file, choose File > Save As and select one of the many text file formats available there.

Instead of accessing one image at a time, you may need to export all images from a PDF document with the same format and with the same settings. To do so, open the document from which you would like to export all of the images and choose Advanced > Export All Images. Configure the dialog as follows:
  • Select a file Format for the images.
  • Click the Settings button, assign the settings for the format you chose, and then click OK to return to the Export All Images dialog.
  • Establish a base name for your images using the Save As field. Acrobat will create a sequence of images using this base name.
  • Leave Hide Extension unchecked so that the three-character file extension (for example, .jpg) will be visible at the end of all the exported graphics filenames.

Click the

Save

button to complete the process.

So, we can separately save the text and the graphics, but remember, what text fragments not always will be exported (some fragments presented by graphics, will remain the images), as well not all graphic images will be exported to (some figures are simply lost for the different reasons). For adding the missing context elements there are two ways: recognition of the text and copy-past operation.

Next page