Copy protection


#1

Hi,

I noticed that in some documents the text is copied as an image.
It is not convenient, could you please fix it?

For example, in the pdf viewer Evince does not have such problems. I really like your program, and I would like SumatraPDF not to have such problems.


#2

In some PDFs what looks like text is actually an image. Sumatra doesn’t do any text -> image conversion on extraction.

To do image -> text we would have to do OCR but Sumatra can’t do that (because it’s hard).


#3

It is likely 11Mihaylov is referring to “protected” text (from other user comments re no-copy DRM “Evince supports copy protection, but it can be turned off, some distros may turn it off by default”.)

In such cases standard SumatraPDF indicate it will only copy the text as an image (respecting the authors “wishes”)


#4

See also Why I have switched away from SumatraPDF : software


#5

All the instances where I find copy protected PDF files is oddly enough electronic data sheets. The entire purpose is to provide data so you can design in their parts but they don’t want you to copy any of that data directly… into your design documents for example. Crazy. So that is why I want to be able to bypass such copy protection.


#6

Why not add a program that strips PDF copy protection to your workflow? Either that, or modify Sumatra’s source and compile it so it ignores the PDF restriction flags.