View Single Post
Ladyofdragons
Senior Member
 
Join Date: Dec 2011
Location: NJ, USA
Posts: 130

Old September 3rd, 2014, 05:38 AM
my mind is a tad boggled on how you would automate this.

Step 1 you would have to convert PDF to text. Sometimes this works well, other times not so much. Lots of editing may be required.

After you get it into text format, then you need to divide all that content up into topic chunks. That's not something that can be automated, you'd want to specifically design topics and heirarchy and content/topic type. Even if you wanted zero control over structure you'd have to break up the content somehow. Most publishers use different standards, and certainly different games do, so there's not really a sure way to determine how to automatically split content.

Lastly, you'd have to divide all those topics up into logical snippets, which also is not something that would be done remotely well if it was automated. Most you could do there is one big text snippet per topic, which you would then have to go through all the work splitting it up to be readable/useable anyway.

My feeling is you'd be much better off converting your PDF to text and doing logical human-led copy/paste. Or wait for the content to come out on the marketplace.

-------------------------------------
"...You're going to backstab him with a ballista?"
Ladyofdragons is offline   #6 Reply With Quote