[TriLUG] convert pdf to doc

Jeffery Painter via TriLUG trilug at trilug.org
Thu Sep 7 16:10:44 EDT 2017


I agree - after conversion, I did a select all and copied the text out 
to a separate file.  No formatting is preserved, but I didn't care too 
much about that.

--
Jeff

On 09/07/2017 04:08 PM, Brian via TriLUG wrote:
> OCR made the document *bigger*??  That's senseless.  Perhaps it 
> retained decompressed versions of the images?
>
> On 09/07/2017 04:01 PM, Jeffery Painter via TriLUG wrote:
>>
>> Just a heads up.
>>
>> If your PDF is a collection of images rather than real text, I just
>> installed pdfocr and it worked wonderfully!  Our HOA bylaws were scans
>> of docs and not searchable - in a few minutes, it had converted the 60
>> page document into a search friendly PDF (however it exploded the size
>> from 2.5mb to about 60mb). YMMV
>>
>> https://github.com/gkovacs/pdfocr
>>
>> -- 
>> Jeff
>>
>>
>> On 09/06/2017 02:18 PM, David Burton via TriLUG wrote:
>>> The latest Kingsoft WPS Office
>>> <http://www.kingsoftstore.com/software/kingsoft-office-freeware> Writer
>>> (self-identifies at v. 10.2.0.5934) supports PDF-to-DOC file
>>> conversion. I
>>> just tried it on a .pdf file, and the .doc conversion looks just about
>>> perfect.
>>>
>>> Caveat #1:  I only tried it on one file.
>>>
>>> Caveat #2:  WPS Office has the very obnoxious habit of seizing 
>>> control of
>>> .pdf files (and perhaps MS Office files), repeatedly making itself the
>>> default Windows application for opening such files. See the forwarded
>>> email
>>> & annotated screenshot, below, for how to disable that.
>>>
>>> Dave
>>>
>>>
>>>
>>> ---------- Begin forwarded message ----------
>>> From: David Burton
>>> Date: Sun, Jul 23, 2017 at 5:53 AM
>>> Subject: How to stop Kingsoft Writer from changing Windows file
>>> association
>>> for PDF files
>>> To: Kingsoft <officesupport at kingsoft.com>
>>>
>>>
>>> Dear Kingsoft folks,
>>>
>>>
>>> Kingsoft WPS Office is a very fine tool, and I am quite impressed with
>>> your
>>> many improvements. In particular, I very much prefer the ads which the
>>> free
>>> version now has, to the horrible printing "watermarks" which it used to
>>> have!
>>>
>>> But I have encountered a problem in the latest version, which is very
>>> annoying. The problem is that, even after I manually change the default
>>> file association for .pdf files to Adobe Reader, WPS Office changes the
>>> default file association for .pd files from "Adobe Reader" to "WPS
>>> Writer."
>>>
>>> That is very undesirable and unfriendly behavior! That should not
>>> happen by
>>> default!
>>>
>>> I google-searched
>>> <https://www.google.com/search?q=How+to+stop+Kingsoft+Writer+from+changing+Windows+file+association+for+PDF+files&btnG=Search> 
>>>
>>>
>>> and
>>> found an article
>>> <http://www.binarynow.com/office-suite/change-kingsoft-office-default-file-association-file-formats-to-microsoft-office-doc-rtf-xls-and-ppt/> 
>>>
>>>
>>> about
>>> using "Kingsoft Office Configuration Tool" (now called WPS Office
>>> Configuration Tool) to change the file associations. So I think that 
>>> will
>>> stop this bad behavior by WPS Writer.
>>>
>>> But the behavior should never happen, anyhow. Kingsoft Office should
>>> never
>>> make itself the default application for handling another program's 
>>> native
>>> files without first obtaining the permission of the user. For .pdf 
>>> files,
>>> the native programs are Adobe's Acrobat and Acrobat Reader. For .doc 
>>> and
>>> .docx files, the native programs are Microsoft Word and Word Viewer. 
>>> Etc.
>>>
>>> Will you please fix WPS Office to never seize another program's file
>>> associations without first obtaining the user's permission to do so?
>>> *...[snip]*
>>> ---------- End forwarded message ----------
>>>
>>>
>>> http://www.geeksalive.com/kingsoft_wps_settings_fix1.png
>>> <http://www.geeksalive.com/kingsoft_wps_settings_fix1.png>
>>>
>>> *(click for bigger version)*
>>
>



More information about the TriLUG mailing list