Forgot password? | Forgot username? | Register

Acrobat DC PDF to Text

Acrobat DC PDF to Text

Hi Everyone,

Has anyone had trouble with using Acrobat DC in uploading OCRed documents to EMu where the text is not pulled through the PDF to Text registry entry? All PDFs I've ocred with Acrobat 9 have worked fine. Acrobat DC has major new changes and the text is not being pulled and saved as a separate .txt file in EMu. I'm assuming it has something to do with ImageMagick and/or Ghostscript.

Thanks,

Foy
Oriental Institute

Administrator has disabled public posting. Please login or register in order to proceed.

Re: Acrobat DC PDF to Text

Hi Foy,

Could you confirm what document handle is being associated with the .pdf file extensions on your desktop? The document handle used by your workstation can be found by

1) selecting "Start > Run" on windows,
2) entering "regedit" to open the "Registry Editor", and
3) navigating on the left panel to "Computer > HKEY_CLASSES_ROOT > .pdf"
The information you want is under the "Data" column on the right panel.

The registry entry used to identify the document handle used on your workstation to convert the PDF to text is called the "Convert" registry entry. For example, you would enter the registry entry below and replace Key 6 with the document handle found above. This will allow the system to know that document handle is associated with a PDF file extension and should convert the data to text.

Key 1: System
Key 2: Setting
Key 3: Multimedia
Key 4: Convert
Key 5: [Enter Document Handle]
Key 6: txt
Key 7: Server Command
Value: /usr/bin/pdftotext -eol dos "%1" "%2"

Forther information on the above instructions can be found in the KE EMu Help files under the article "EMu administration > The EMu Registry > Registry settings > Multimedia > Documents > Generating and managing document formats > Convert Registry entry > How to identify a document handle".

Hope this helps.

Regards,
Chresty

Chresty Torres (Axiell Toronto)
useravatar
Offline
2 Posts
Website 
Administrator has disabled public posting. Please login or register in order to proceed.

Re: Acrobat DC PDF to Text

Hi Chresty,

Thanks. That worked. The problem was a lack of a registry entry with "Acrobat.Document.DC" in Key 5 to reflect the data in the regedit data column.

Thanks,

Foy

Administrator has disabled public posting. Please login or register in order to proceed.
There are 0 guests and 0 other users also viewing this topic

Board Info

Board Stats
 
Total Topics:
599
Total Polls:
0
Total Posts:
1362
User Info
 
Total Users:
814
Newest User:
Paul
Members Online:
3
Guests Online:
201