2007/8/3, chen bin <chenbin.sh at gmail.com>: > I try to extract text from pdf files. My algorithm to combine > characters is simple and works well on most situations. Have a look at pdftotext. Best Martin