[gs-bugs] [Bug 685335] PDF interpreter doesn't process ToUnicode

bugs.ghostscript.com-bugzilla-daemon at ghostscript.com bugs.ghostscript.com-bugzilla-daemon at ghostscript.com
Sun Jul 5 00:29:24 PDT 2009


------- Additional Comments From simengman at yahoo.com.cn  2009-07-05 00:29 -------
Thank you very much! I can use MuPDF to extract the text while using GS to
create a JPEG file, but I want to do the two things at the same time by GS, in 
order to save times and get some other informations. In gxchar.c, I add code 
in show_proceed(gs_show_enum * penum):
	    switch ((code = get_next_char_glyph((gs_text_enum_t *)penum,
						&chr, &glyph))
		    ) {
		default:	/* error */
			return code;
		case 2:	/* done */
		    return show_finish(penum);
		case 1:	/* font change */
		    pfont = penum->fstack.items[penum->fstack.depth].font;
		    penum->current_font = pfont;
		    pgs->char_tm_valid = false;
		    pair = 0;
		    penum->pair = 0;
		    /* falls through */
		case 0:	/* plain char */
gs_char unicode = pfont->procs.decode_glyph((gs_font *)pfont, glyph);
When I run "gswin32.exe -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -
sOutputFile=out.PDF x.PDF", decode_glyph can get correct code, but 
run "gswin32.exe -dProvideUnicodeDecoding -dProvideUnicode -dNOPAUSE -dBATCH -
sDEVICE=jpeg -sOutputFile=out.jpg x.PDF", decode_glyph get incorrect code. How 
can I make JPEG device to handle text, or decode_glyph can work, like pdf 
write device?

------- You are receiving this mail because: -------
You are the QA contact for the bug, or are watching the QA contact.

More information about the gs-bugs mailing list