[Gs-dgs-devel] Re: [gs-devel] Extracting Text from Ghost Script
Rajkumar Sivasamy
rajkumar.siva@wipro.com
Fri, 22 Feb 2002 11:56:26 +0530
This is a multi-part message in MIME format.
------=_NextPartTM-000-19e8e527-2757-11d6-a942-00b0d0d06be8
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Hi Russell,
Thanks for the input. I tried with Pstotext, and it worked fine in
converting postscript doc to text. It will be of more helpful if there
is a way to *tune* ghostscript to return back the text directly.
Pls, mail U'r comments..
Thanks & Regards,
Raj.
> > Rajkumar Wrote:
> > Am trying to implement Accessibility for Ghost View(ggv) application in
> > Linux. Ghost-View is an application which is used for viewing
> > post-script and pdf files. Ghost view makes use of the "ghost script" to
> > render text on the screen. From accessibility perspective the text
> > (Document's Text) has to be exported to the AT (Assistive Technologies
> > like Screen Reader, Screen Magnifier) which will make use of it to
> > replicate the same to user (A screen reader will make use of the text to
> > read out them to the disabled user). Since Ghost View makes use of
> > "ghost script" to render the text on the screen, I believe there could
> > be some ways of getting back the text (ASCII format) from ghost script.
> >
> > I have found many utilities like "pstotext", "ps2ascii", "prescript" to
> > convert the postscript to text or html. But it will be helpful if there
> > is a way by which the ghost script can be directly tuned to return back
> > the text.
> >
> > PS: Is there any "mode" in which Ghost Script can be operated to return
> > back the text to a file?
>
> Russell wrote:
> Not that I know about. The best of the tools you mention
> is probably pstotext. However this can be confused by
> PostScript or PDF files that contain (I think) embedded
> Type 42 (TrueType) fonts.
>
> pstotext can give you either the text with line breaks as
> shown on the page, or it can give you each word separately
> with a bounding box for the word. The latter is what I use
> from GSview, since it allows me to highlight selected words
> or the result of a "find".
>
------=_NextPartTM-000-19e8e527-2757-11d6-a942-00b0d0d06be8
Content-Type: text/plain;
name="Wipro_Disclaimer.txt"
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment;
filename="Wipro_Disclaimer.txt"
**************************Disclaimer************************************
Information contained in this E-MAIL being proprietary to Wipro Limited
is 'privileged' and 'confidential' and intended for use only by the
individual or entity to which it is addressed. You are notified that any
use, copying or dissemination of the information contained in the E-MAIL
in any manner whatsoever is strictly prohibited.
********************************************************************
------=_NextPartTM-000-19e8e527-2757-11d6-a942-00b0d0d06be8--