[gs-bugs] [Bug 690648] New: many incorrectly encoded Latin
Extended-A characters
bugs.ghostscript.com-bugzilla-daemon at ghostscript.com
bugs.ghostscript.com-bugzilla-daemon at ghostscript.com
Thu Jul 23 11:20:42 PDT 2009
http://bugs.ghostscript.com/show_bug.cgi?id=690648
Summary: many incorrectly encoded Latin Extended-A characters
Product: Ghostscript
Version: 8.64
Platform: PC
OS/Version: Windows XP
Status: UNCONFIRMED
Severity: major
Priority: P4
Component: PDF Writer
AssignedTo: ken.sharp at artifex.com
ReportedBy: rpr.nospam at gmail.com
QAContact: gs-bugs at ghostscript.com
This is the procedure that demonstrates the problem:
(1) Open http://en.wikipedia.org/wiki/Latin_Extended-A_unicode_block
and print it to a postscript file (I used the HP Universal Print Driver
ver. 4.7 installed in Windows XP Pro. SP3).
(2) Convert the PS file to a PDF file using the following command:
ps2pdf14 test1.ps test1.pdf
(3) Open the PDF file in a PDF reader (I've tried Adobe Reader 8),
select the whole text and copy it to a text processing application
(I've tried the OpenOffice.org Writer 3.1 and MS Word 2003 SP3) as
unformatted text.
The problem is that the following letters are not shown correctly in
the pasted text (although the Adobe Reader displays and prints them
correctly):
U+010A Ċ Latin Capital Letter C with dot above
U+010B ċ Latin Small Letter C with dot above
U+0110 Đ Latin Capital Letter D with stroke
U+0111 đ Latin Small Letter D with stroke
U+0116 Ė Latin Capital Letter E with dot above
U+0117 ė Latin Small Letter E with dot above
U+0120 Ġ Latin Capital Letter G with dot above
U+0121 ġ Latin Small Letter G with dot above
U+0122 Ģ Latin Capital Letter G with cedilla
U+0123 ģ Latin Small Letter G with cedilla
U+0130 İ Latin Capital Letter I with dot above
U+0136 Ķ Latin Capital Letter K with cedilla
U+0137 ķ Latin Small Letter K with cedilla
U+013B Ļ Latin Capital Letter L with cedilla
U+013C ļ Latin Small Letter L with cedilla
U+0145 Ņ Latin Capital Letter N with cedilla
U+0146 ņ Latin Small Letter N with cedilla
U+0150 Ő Latin Capital Letter O with double acute
U+0151 ő Latin Small Letter O with double acute
U+0156 Ŗ Latin Capital Letter R with cedilla
U+0157 ŗ Latin Small Letter R with cedilla
U+015E Ş Latin Capital Letter S with cedilla
U+015F ş Latin Small Letter S with cedilla
U+0162 Ţ Latin Capital Letter T with cedilla
U+0163 ţ Latin Small Letter T with cedilla
U+0170 Ű Latin Capital Letter U with double acute
U+0171 ű Latin Small Letter U with double acute
U+017B Ż Latin Capital Letter Z with dot above
U+017C ż Latin Small Letter Z with dot above
I'd say that the letters get encoded incorrectly in the PDF file.
-- rpr.
------- You are receiving this mail because: -------
You are the QA contact for the bug, or are watching the QA contact.
More information about the gs-bugs
mailing list