[gs-bugs] [Bug 691523] New: Text with TrueType fonts in PDF/A files (CIDFonts) will not translate to Unicode

bugzilla-daemon at ghostscript.com bugzilla-daemon at ghostscript.com
Tue Aug 3 15:49:50 UTC 2010


http://bugs.ghostscript.com/show_bug.cgi?id=691523

           Summary: Text with TrueType fonts in PDF/A files (CIDFonts)
                    will not translate to Unicode
           Product: Ghostscript
           Version: 8.71
          Platform: PC
        OS/Version: Windows Vista
            Status: NEW
          Severity: normal
          Priority: P4
         Component: PDF Writer
        AssignedTo: ken.sharp at artifex.com
        ReportedBy: ghostscript_bugs at woessner-it.de
         QAContact: gs-bugs at ghostscript.com
   Estimated Hours: 0.0


Created an attachment (id=6600)
 --> (http://bugs.ghostscript.com/attachment.cgi?id=6600)
PostScript source file

I am converting a PS file with embedded TrueType fonts into PDF/A.
The resulting PDF/A shows no obvious errors, but when copying text from the
file (i.E. with Adobe Reader) the copied chars will not translate properly to
Unicode.

I attach test files which contain the text ABCD-XYZ

Behaviour differs by version:

Up to GhostScript version 8.70, the hyphen will translate to (hex) 0500 (in
this example 5 is the offset of this glyph in the font subset).

In GhostScript version 8.71 the above text translates to (hex) 0100 0200 0300
0400 0500 0600 0700 0800.

If -dPDFA is not defined, all looks fine.

Looks like there is a major problem in the conversion of TrueType fonts to
TrueType CID fonts.

I add a sample PS file as attachment.

Please convert with:

gswin32c.exe -sOutputFile=out.pdf -sDEVICE=pdfwrite -dPDFA -dNOPAUSE -c save
pop .setpdfwrite -f in.ps -c quit

-- 
Configure bugmail: http://bugs.ghostscript.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.


More information about the gs-bugs mailing list