[TriLUG] convert pdf to doc
Jeffery Painter via TriLUG
trilug at trilug.org
Thu Sep 7 16:10:44 EDT 2017
I agree - after conversion, I did a select all and copied the text out
to a separate file. No formatting is preserved, but I didn't care too
much about that.
--
Jeff
On 09/07/2017 04:08 PM, Brian via TriLUG wrote:
> OCR made the document *bigger*?? That's senseless. Perhaps it
> retained decompressed versions of the images?
>
> On 09/07/2017 04:01 PM, Jeffery Painter via TriLUG wrote:
>>
>> Just a heads up.
>>
>> If your PDF is a collection of images rather than real text, I just
>> installed pdfocr and it worked wonderfully! Our HOA bylaws were scans
>> of docs and not searchable - in a few minutes, it had converted the 60
>> page document into a search friendly PDF (however it exploded the size
>> from 2.5mb to about 60mb). YMMV
>>
>> https://github.com/gkovacs/pdfocr
>>
>> --
>> Jeff
>>
>>
>> On 09/06/2017 02:18 PM, David Burton via TriLUG wrote:
>>> The latest Kingsoft WPS Office
>>> <http://www.kingsoftstore.com/software/kingsoft-office-freeware> Writer
>>> (self-identifies at v. 10.2.0.5934) supports PDF-to-DOC file
>>> conversion. I
>>> just tried it on a .pdf file, and the .doc conversion looks just about
>>> perfect.
>>>
>>> Caveat #1: I only tried it on one file.
>>>
>>> Caveat #2: WPS Office has the very obnoxious habit of seizing
>>> control of
>>> .pdf files (and perhaps MS Office files), repeatedly making itself the
>>> default Windows application for opening such files. See the forwarded
>>> email
>>> & annotated screenshot, below, for how to disable that.
>>>
>>> Dave
>>>
>>>
>>>
>>> ---------- Begin forwarded message ----------
>>> From: David Burton
>>> Date: Sun, Jul 23, 2017 at 5:53 AM
>>> Subject: How to stop Kingsoft Writer from changing Windows file
>>> association
>>> for PDF files
>>> To: Kingsoft <officesupport at kingsoft.com>
>>>
>>>
>>> Dear Kingsoft folks,
>>>
>>>
>>> Kingsoft WPS Office is a very fine tool, and I am quite impressed with
>>> your
>>> many improvements. In particular, I very much prefer the ads which the
>>> free
>>> version now has, to the horrible printing "watermarks" which it used to
>>> have!
>>>
>>> But I have encountered a problem in the latest version, which is very
>>> annoying. The problem is that, even after I manually change the default
>>> file association for .pdf files to Adobe Reader, WPS Office changes the
>>> default file association for .pd files from "Adobe Reader" to "WPS
>>> Writer."
>>>
>>> That is very undesirable and unfriendly behavior! That should not
>>> happen by
>>> default!
>>>
>>> I google-searched
>>> <https://www.google.com/search?q=How+to+stop+Kingsoft+Writer+from+changing+Windows+file+association+for+PDF+files&btnG=Search>
>>>
>>>
>>> and
>>> found an article
>>> <http://www.binarynow.com/office-suite/change-kingsoft-office-default-file-association-file-formats-to-microsoft-office-doc-rtf-xls-and-ppt/>
>>>
>>>
>>> about
>>> using "Kingsoft Office Configuration Tool" (now called WPS Office
>>> Configuration Tool) to change the file associations. So I think that
>>> will
>>> stop this bad behavior by WPS Writer.
>>>
>>> But the behavior should never happen, anyhow. Kingsoft Office should
>>> never
>>> make itself the default application for handling another program's
>>> native
>>> files without first obtaining the permission of the user. For .pdf
>>> files,
>>> the native programs are Adobe's Acrobat and Acrobat Reader. For .doc
>>> and
>>> .docx files, the native programs are Microsoft Word and Word Viewer.
>>> Etc.
>>>
>>> Will you please fix WPS Office to never seize another program's file
>>> associations without first obtaining the user's permission to do so?
>>> *...[snip]*
>>> ---------- End forwarded message ----------
>>>
>>>
>>> http://www.geeksalive.com/kingsoft_wps_settings_fix1.png
>>> <http://www.geeksalive.com/kingsoft_wps_settings_fix1.png>
>>>
>>> *(click for bigger version)*
>>
>
More information about the TriLUG
mailing list