[gs-bugs] [Bug 690648] New: many incorrectly encoded Latin Extended-A characters

bugs.ghostscript.com-bugzilla-daemon at ghostscript.com bugs.ghostscript.com-bugzilla-daemon at ghostscript.com
Thu Jul 23 11:20:42 PDT 2009


http://bugs.ghostscript.com/show_bug.cgi?id=690648

           Summary: many incorrectly encoded Latin Extended-A characters
           Product: Ghostscript
           Version: 8.64
          Platform: PC
        OS/Version: Windows XP
            Status: UNCONFIRMED
          Severity: major
          Priority: P4
         Component: PDF Writer
        AssignedTo: ken.sharp at artifex.com
        ReportedBy: rpr.nospam at gmail.com
         QAContact: gs-bugs at ghostscript.com


This is the procedure that demonstrates the problem:

(1) Open http://en.wikipedia.org/wiki/Latin_Extended-A_unicode_block
and print it to a postscript file (I used the HP Universal Print Driver
ver. 4.7 installed in Windows XP Pro. SP3).

(2) Convert the PS file to a PDF file using the following command:
ps2pdf14 test1.ps test1.pdf

(3) Open the PDF file in a PDF reader (I've tried Adobe Reader 8),
select the whole text and copy it to a text processing application
(I've tried the OpenOffice.org Writer 3.1 and MS Word 2003 SP3) as
unformatted text.

The problem is that the following letters are not shown correctly in
the pasted text (although the Adobe Reader displays and prints them
correctly):

U+010A Ċ Latin Capital Letter C with dot above
U+010B ċ Latin Small Letter C with dot above
U+0110 Đ Latin Capital Letter D with stroke
U+0111 đ Latin Small Letter D with stroke
U+0116 Ė Latin Capital Letter E with dot above
U+0117 ė Latin Small Letter E with dot above
U+0120 Ġ Latin Capital Letter G with dot above
U+0121 ġ Latin Small Letter G with dot above
U+0122 Ģ Latin Capital Letter G with cedilla
U+0123 ģ Latin Small Letter G with cedilla
U+0130 İ Latin Capital Letter I with dot above
U+0136 Ķ Latin Capital Letter K with cedilla
U+0137 ķ Latin Small Letter K with cedilla
U+013B Ļ Latin Capital Letter L with cedilla
U+013C ļ Latin Small Letter L with cedilla
U+0145 Ņ Latin Capital Letter N with cedilla
U+0146 ņ Latin Small Letter N with cedilla
U+0150 Ő Latin Capital Letter O with double acute
U+0151 ő Latin Small Letter O with double acute
U+0156 Ŗ Latin Capital Letter R with cedilla
U+0157 ŗ Latin Small Letter R with cedilla
U+015E Ş Latin Capital Letter S with cedilla
U+015F ş Latin Small Letter S with cedilla
U+0162 Ţ Latin Capital Letter T with cedilla
U+0163 ţ Latin Small Letter T with cedilla
U+0170 Ű Latin Capital Letter U with double acute
U+0171 ű Latin Small Letter U with double acute
U+017B Ż Latin Capital Letter Z with dot above
U+017C ż Latin Small Letter Z with dot above

I'd say that the letters get encoded incorrectly in the PDF file.

-- rpr.



------- You are receiving this mail because: -------
You are the QA contact for the bug, or are watching the QA contact.



More information about the gs-bugs mailing list