Senior Member
Join Date: Aug 2012
Posts: 432
|
I bow to Rob's superior understanding of PDFs (which should come as no great surprise to anyone, really).
Chief Calendar Champion Chemlak Join the unofficial Realm Works IRC channel! Join #realm-works |
#11 |
Senior Member
Join Date: Dec 2013
Posts: 798
|
Join the (unofficial) Realm-Works IRC Chat: #realm-works on the Rizon Network (https://wiki.rizon.net/index.php?title=Servers) -> Browser Client: https://kiwiirc.com/client/irc.rizon.net |
#12 |
Senior Member
Join Date: May 2013
Posts: 1,458
|
Quote:
To paste them into RealmWorks, but the spacing's off. It takes so long To edit what I've pasted, man this task is rough. Text on a page. A PDF's just text on a page. Copy Paste Proofing takes forever, and it hurts my eyes All this text I think I'll hit the forums, get some good advice Ooooh, ooooh, ooooooooooooh *violins* |
|
#13 |
Senior Member
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335
|
Here's my favorite one for this issue: Even printing it and running it through a decent OCR program it has issues.
If non~ ofth~ qu~tions ~aMWf'rw with li~., and at IU5l On~ qu~.tionwa. an.w,,~wilh a tm~ altS\<",r, th~ spirits II1iI)' implant a "P"ll in your mind asArd<-sdlr$$oIkrCOllc.oct~d.. As Ions a.you ~ast at J.e.iil 011" spdI of 5£b In",1 or '-",r wilhin l-l hours bdore casting Ard~.5af~conlaet,)'ou immwiat~1y pu-~ a .prll in~of th~aprndw spell. This n"" spdI is of th" """""' !nod a. th~ aprndw .prll. and is~by tM spirit in q~n. no! by you: 'fOU nttd. DDt """" know 'M spell, and in rareG1l56 ~ spdl IIW)'DDt nom be on ~clawiliR. Thccbo5rn spdI is 5Iond in your mind as though p~in thr: nonn.tl. fashion. 1h~ spdl is 5riU prq»M ........, ifyou are a "P"n~ spdlG1lSt~r, m~anins thallhr .........."'~ spell 5101 can only be apr~On 1Mrna..,., sprll (though 01M!" spell MoIsan 1lIUl'Ifft~). If th~ implantw spell ""'I.uiU'S mat~rial compoIl"nts, you still mUSI provid~ thtm in ordu to cut ~ spdl. Thr implantw "P"U u-mains prq»~ until th" =xt tim,,)'Ou ....t and rKO\",r opells, and if it ham't!>ttn cast by the ~nd of that time, it iswast~. Minutus cantorum, minutus balorum, minutus carborata descendum pantorum. |
#14 |
Senior Member
Join Date: Jan 2016
Location: Adelaide, Australia
Posts: 2,294
|
Oh wow that's a bad case. I see the odd issue but nothing on this level!
|
#15 |
Senior Member
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335
|
Yeah, it's fantastic isn't it? And it happens with a good chunk of this 3rd party publisher's product.
Minutus cantorum, minutus balorum, minutus carborata descendum pantorum. |
#16 |
Senior Member
Lone Wolf Staff
Join Date: May 2005
Posts: 8,232
|
Yikes! More than likely, the 3PP is using a freeware font that doesn't properly support all the things that a professional font would normally include, which then results in the corresponding text just being gibberish.
I've seen this with a few fonts used sparsely in a few PDF products, but nothing remotely as messed up as the example above. Ugh. |
#17 |
Senior Member
Join Date: Jan 2013
Location: Rochester, MN
Posts: 1,518
|
Along with the possible font issue, I wonder what they're using to generate those PDFs and what application they came from.
If they left it in, you can see that info in Adobe Reader in the Properties window (File/Properties... or Ctrl-D) on the Description tab. Last edited by Parody; April 14th, 2016 at 08:31 PM. |
#18 |
Senior Member
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335
|
Let's see:
Helvetica Helvetica Bold Helvetica Oblique Hidden HorzOCR (embedded) Magical Medieval (Embedded subset) Olsen TF Regular Times Bold Times Italic Times Roman Minutus cantorum, minutus balorum, minutus carborata descendum pantorum. |
#19 |
Senior Member
Join Date: Jan 2013
Location: Rochester, MN
Posts: 1,518
|
OK then. It looks like some or all of the document was created by Acrobat's built-in OCR capabilities and they didn't bother to check and fix the resulting text. Depending on the options they chose it may actually look like a mix of text and bitmaps, or it might appear as just the scanned images and the text completely hidden (only used for searching and copy/paste).
FWIW: I have not seen this in legitimately published book PDFs but I can believe that some publishers might not care to fix their work, especially for old books for which they no longer have either the source document (and all of its parts) or the application in which it was created and a system that can run that application (or that predate "modern" desktop publishing). :( (Sorry if I'm flailing a bit; I don't OCR things but I do a lot of other desktop publishing work.) Last edited by Parody; April 15th, 2016 at 07:03 AM. |
#20 |
|
|