Lone Wolf Development Forums  

Go Back   Lone Wolf Development Forums > Realm Works Forums > Realm Works Discussion
Register FAQ Community Today's Posts Search

Notices

Reply
 
Thread Tools Display Modes
Chemlak
Senior Member
 
Join Date: Aug 2012
Posts: 432

Old April 14th, 2016, 01:26 PM
I bow to Rob's superior understanding of PDFs (which should come as no great surprise to anyone, really).

Chief Calendar Champion Chemlak

Join the unofficial Realm Works IRC channel! Join #realm-works
Chemlak is offline   #11 Reply With Quote
Acenoid
Senior Member
 
Join Date: Dec 2013
Posts: 798

Old April 14th, 2016, 03:31 PM
Maybe this will help?

http://www.wikihow.com/Modify-Font-P...g_the_File_sub

Join the (unofficial) Realm-Works IRC Chat: #realm-works on the Rizon Network (https://wiki.rizon.net/index.php?title=Servers)
-> Browser Client: https://kiwiirc.com/client/irc.rizon.net
Acenoid is offline   #12 Reply With Quote
EightBitz
Senior Member
 
Join Date: May 2013
Posts: 1,458

Old April 14th, 2016, 04:02 PM
Quote:
Originally Posted by rob View Post
Yes, that's correct. A PDF is nothing more than individual letters positioned on a page. That's it. Nothing more.
I copy words
To paste them into RealmWorks, but the spacing's off.
It takes so long
To edit what I've pasted, man this task is rough.

Text on a page.
A PDF's just text on a page.
Copy Paste
Proofing takes forever, and it hurts my eyes
All this text
I think I'll hit the forums, get some good advice

Ooooh, ooooh, ooooooooooooh
*violins*
EightBitz is offline   #13 Reply With Quote
Asandir
Senior Member
 
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335

Old April 14th, 2016, 04:29 PM
Here's my favorite one for this issue: Even printing it and running it through a decent OCR program it has issues.

If non~ ofth~ qu~tions ~aMWf'rw with
li~., and at IU5l On~ qu~.tionwa. an.w,,~wilh a
tm~ altS\<",r, th~ spirits II1iI)' implant a "P"ll in your
mind asArd<-sdlr$$oIkrCOllc.oct~d.. As Ions a.you
~ast at J.e.iil 011" spdI of 5£b In",1 or '-",r wilhin
l-l hours bdore casting Ard~.5af~conlaet,)'ou
immwiat~1y pu-~ a .prll in~of th~aprndw
spell. This n"" spdI is of th" """""' !nod a. th~
aprndw .prll. and is~by tM spirit in
q~n. no! by you: 'fOU nttd. DDt """" know 'M
spell, and in rareG1l56 ~ spdl IIW)'DDt nom be on
~clawiliR. Thccbo5rn spdI is 5Iond in your
mind as though p~in thr: nonn.tl. fashion. 1h~
spdl is 5riU prq»M ........, ifyou are a "P"n~
spdlG1lSt~r, m~anins thallhr .........."'~ spell 5101 can
only be apr~On 1Mrna..,., sprll (though 01M!"
spell MoIsan 1lIUl'Ifft~). If th~ implantw spell
""'I.uiU'S mat~rial compoIl"nts, you still mUSI provid~
thtm in ordu to cut ~ spdl. Thr implantw "P"U
u-mains prq»~ until th" =xt tim,,)'Ou ....t and
rKO\",r opells, and if it ham't!>ttn cast by the ~nd of
that time, it iswast~.

Minutus cantorum, minutus balorum, minutus carborata descendum pantorum.
Asandir is offline   #14 Reply With Quote
daplunk
Senior Member
 
Join Date: Jan 2016
Location: Adelaide, Australia
Posts: 2,294

Old April 14th, 2016, 04:39 PM
Oh wow that's a bad case. I see the odd issue but nothing on this level!
daplunk is offline   #15 Reply With Quote
Asandir
Senior Member
 
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335

Old April 14th, 2016, 04:42 PM
Yeah, it's fantastic isn't it? And it happens with a good chunk of this 3rd party publisher's product.

Minutus cantorum, minutus balorum, minutus carborata descendum pantorum.
Asandir is offline   #16 Reply With Quote
rob
Senior Member
Lone Wolf Staff
 
Join Date: May 2005
Posts: 8,232

Old April 14th, 2016, 05:11 PM
Yikes! More than likely, the 3PP is using a freeware font that doesn't properly support all the things that a professional font would normally include, which then results in the corresponding text just being gibberish.

I've seen this with a few fonts used sparsely in a few PDF products, but nothing remotely as messed up as the example above. Ugh.
rob is offline   #17 Reply With Quote
Parody
Senior Member
 
Join Date: Jan 2013
Location: Rochester, MN
Posts: 1,518

Old April 14th, 2016, 08:09 PM
Along with the possible font issue, I wonder what they're using to generate those PDFs and what application they came from.

If they left it in, you can see that info in Adobe Reader in the Properties window (File/Properties... or Ctrl-D) on the Description tab.


Last edited by Parody; April 14th, 2016 at 08:31 PM.
Parody is offline   #18 Reply With Quote
Asandir
Senior Member
 
Join Date: Dec 2010
Location: Virginia, USA
Posts: 335

Old April 15th, 2016, 05:46 AM
Let's see:

Helvetica
Helvetica Bold
Helvetica Oblique
Hidden HorzOCR (embedded)
Magical Medieval (Embedded subset)
Olsen TF Regular
Times Bold
Times Italic
Times Roman

Minutus cantorum, minutus balorum, minutus carborata descendum pantorum.
Asandir is offline   #19 Reply With Quote
Parody
Senior Member
 
Join Date: Jan 2013
Location: Rochester, MN
Posts: 1,518

Old April 15th, 2016, 06:33 AM
OK then. It looks like some or all of the document was created by Acrobat's built-in OCR capabilities and they didn't bother to check and fix the resulting text. Depending on the options they chose it may actually look like a mix of text and bitmaps, or it might appear as just the scanned images and the text completely hidden (only used for searching and copy/paste).

FWIW: I have not seen this in legitimately published book PDFs but I can believe that some publishers might not care to fix their work, especially for old books for which they no longer have either the source document (and all of its parts) or the application in which it was created and a system that can run that application (or that predate "modern" desktop publishing). :(

(Sorry if I'm flailing a bit; I don't OCR things but I do a lot of other desktop publishing work.)


Last edited by Parody; April 15th, 2016 at 07:03 AM.
Parody is offline   #20 Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -8. The time now is 03:44 PM.


Powered by vBulletin® - Copyright ©2000 - 2024, vBulletin Solutions, Inc.
wolflair.com copyright ©1998-2016 Lone Wolf Development, Inc. View our Privacy Policy here.