Community
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Converting PDFs to text

 
Post new topic   Reply to topic    Community Forum Index -> Tips
View previous topic :: View next topic  
Author Message
jeffy



Joined: 03 Mar 2003
Posts: 327
Location: Philadelphia

PostPosted: Mon Oct 13, 2003 7:19 pm    Post subject: Converting PDFs to text Reply with quote

I found this great tool for converting the text in PDFs to plain text:

http://pricelessware.org/2003/PL2003TEXT.htm#P110

It's a command-line program. Call it, for example, like this:

C:\applications\PDF-TXT1.EXE "C:\whatever\My PDF Document.pdf" C:\temp\from_pdf.txt

I have nothing to do with the creation of this program. I just use it and love it. Enjoy.
Back to top
View user's profile Send private message AIM Address Yahoo Messenger
lichudang



Joined: 27 Jan 2004
Posts: 5

PostPosted: Fri Jan 30, 2004 9:08 pm    Post subject: Reply with quote

Thanks, Great!
Back to top
View user's profile Send private message
Fredkc



Joined: 10 Apr 2007
Posts: 6
Location: Riverside, Ca.

PostPosted: Wed Apr 11, 2007 8:27 am    Post subject: Reply with quote

Another handy comand line tool is PDFToHTML

Reads a PDF file and does it's best to make an HTML file of it. Multi-column pages are not it's strong point, but it does a pretty god job.

Freeware:
http://sourceforge.net/projects/pdftohtml/

And no, I don't have a thing to do with this one, either; but yes I use it all the time.
_________________
Life IS mystical. It's just that we're used to it.
Back to top
View user's profile Send private message Visit poster's website
dak



Joined: 22 Mar 2007
Posts: 18

PostPosted: Thu Apr 12, 2007 5:49 am    Post subject: Re: Converting PDFs to text Reply with quote

jeffy wrote:
I found this great tool for converting the text in PDFs to plain text:

http://pricelessware.org/2003/PL2003TEXT.htm#P110

Looks like this utility is no longer available from this site.

It is listed on the front page, but is not available from the Text section.

Cheers,

dak
Back to top
View user's profile Send private message
SteveH



Joined: 03 Apr 2003
Posts: 329
Location: Edinburgh, Scotland

PostPosted: Thu Apr 12, 2007 11:55 am    Post subject: Reply with quote

It may also be worth trying xPDF. The Windows version includes a utility called pdftotext that will convert a pdf to text and, optionally, preserve formatting.
Back to top
View user's profile Send private message Visit poster's website
proximity4



Joined: 07 Jul 2010
Posts: 1

PostPosted: Wed Jul 07, 2010 10:13 am    Post subject: Reply with quote

A-PDF Text Extractor is a free utility designed to extract text from Adobe PDF files for use in other applications. There are three mode of output text: In PDF Order, Smart Rearrange and With Position. Learn more about the output type here.

The program is freeware, which means that you can use it either persionally or commercially for free.

The program is a standalone application; no Adobe Acrobat needed. A command line version is available also to allow you to call in your program or script.

If you want to grap images from PDF files, you may check out the A-PDF Image Extractor.
__________________________________________________________

office chair | office chairs
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    Community Forum Index -> Tips All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB