iconv(1) DG/UX R4.11 iconv(1)
NAME
iconv - code set conversion
SYNOPSIS
iconv -f fromcode -t tocode [-m mode] [-v] [file(s)]
DESCRIPTION
Iconv converts the encoding of characters in file(s) from one code
set to another and writes the results to standard output.
The required arguments fromcode and tocode identify the input and
output code sets, respectively. The optional argument mode provides
a further distinction between multiple code set maps for the same
fromcode and tocode. The option -v verifies that the database is
organised correctly; no action is taken. If no file(s) arguments are
specified on the command line, iconv reads the standard input.
Iconv uses the database file /usr/lib/kbd/iconvdata. This file has
4 required fields fromcode, tocode, table, file and one optional
field mode. The order of the database fields is as named above. The
database fields are separated by spaces or tabs, and the database
rows are separated by newlines.
Iconv matches the required arguments fromcode and tocode and the
optional argument mode to the corresponding fields in the database.
The field mode does not have to be uniformly included or excluded
from the database, i.e. it may be included in some rows and not in
others. If the argument mode is not included in the iconv command
line, iconv will match the first row found that contains the correct
fromcode and tocode fields, ignoring any mode fields.
The naming conventions in the database are left entirely up to the
user. However, absolute pathnames are required for the file fields
not located in /usr/lib/kbd, as kbdpipe assumes that any file in the
"-f file" argument that does not begin with "/" will be found in
/usr/lib/kbd.
The codeset conversions supported in the supplied database are given
in the table below.
+---------------------------------------------+
| Code Set Conversions Supported |
|fromcode tocode modes comment |
+---------------------------------------------+
|ASCII 88591 d b e p |
|88591 ASCII d b e p |
|6937 88591 Teletext |
|88591 6937 Teletext |
|646 88591 d US Ascii |
|646DE 88591 d German |
|646DK 88591 d Danish |
|646GB 88591 d English Ascii |
|646ES 88591 d Spanish |
|646FR 88591 d French |
|646IT 88591 d Italian |
|646NO 88591 d Norwegian |
|646SE 88591 d Swedish |
|646PT 88591 d Portugese |
|646YU 88592 d Serbo Croation |
|88591 646 d b e p 7 bit Ascii |
|88592 646 d b e |
|88591 646DE d b e p German |
|88591 646DK d b e p Danish |
|88591 646GB d b e p English Ascii |
|88591 646ES d b e p Spanish |
|88591 646FR d b e p French |
|88591 646IT d b e p Italian |
|88591 646NO d b e p Norwegian |
|88591 646SE d b e p Swedish |
|88591 646PT d b e Portugese |
|88592 646YU d Serbo Croatian |
|PC437 88591 d b e p |
|88591 PC437 d b e p |
|PC850 88591 d b e |
|88591 PC850 d b e p |
|PC860 88591 d b e |
|88591 PC860 d b e |
|PC863 88591 d b e |
|88591 PC863 d b e |
|PC865 88591 d b e |
|88591 PC865 d b e |
+-+---------------------------------------------+-+
| Code Set Conversions Supported |
|fromcode tocode modes comment |
+-------------------------------------------------+
|PC437 ASCII d b e |
|PC850 ASCII d b e |
|PC860 ASCII d b e |
|PC863 ASCII d b e |
|PC865 ASCII d b e |
|ASCII 88591 d b e |
|ASCII EBCDIC d |
|EBCDIC ASCII d |
|ASCII IBM_EBCDIC d |
|PC437 EBCDIC d |
|EBCDIC PC437 d |
|PC437 IBM_EBCDIC d |
|88591 ROMAN8 d HP LaserJet II |
|88591 VT220 d b e |
|VT220 88591 d b e |
|88592 646 d b e |
|88593 646 d b e |
|88594 646 d b e |
|88595 646 d b e |
|88597 646 d b e |
|dgi 88591 d b e p |
|88591 dgi d b e p |
+-------------------------------------------------+
The fromcodes and tocodes 88591, 646 and 6937 correspond to the
International Standards ISO 8859-1, ISO 646 and ISO 6937
respectively.
The optional modes, d, b, e and p, have the following meaning:
d default
Any character that cannot be represented is mapped to the
"Ultimate fall back character" which in the tables supplied
is the underscore character '_'.
b best fit with no expansion
Characters are where possible, mapped to the closest
approximation of that character but always without
expansion, ie., all the character mappings are one-to-one.
This will be important, for example, when using curses-
based applications where any expansion of a character
representation would affect the screen
management. [If such code set mapping are
performed by the STREAMS-module in the TTY sub-system then
such mappings will be transparent and the application will
have no knowledge that these mappings take place.]
e best fit with expansion
Characters of the source code set are, where possible,
mapped to the closest approximation of that character in
the target code set. Where necessary the character in the
source code set is expanded to a sequence of characters in
the target code set.
p Printer mode - with overstriking.
If there is a non-destructive backspace, as exists on many
printers, then some characters that are not available can
be displayed by overstriking. In this way many accented
characters can be displayed.
EXAMPLES
An example of a database for iconv is below, with the following
fields:
fromcode tocode table file mode
(the above field names are not included in the database).
----------------------------------------------------------
88591 6937 88591.6937.b pubfile b
88591 6937 pubtable 88591.6937.d.t d
646 646DE togerman /mydir/togerman
Using the above database, the following converts the contents of
files mail1 and mail2 from code set 88591 to 6937 using b mode and
stores the results in file mail.local.
iconv -f 88591 -t 6937 -m b mail1 mail2 > mail.local
The following will accomplish the same result as above, as the b mode
from code set 88591 to 6937 will be the first row found containing
the correct match.
iconv -f 88591 -t 6937 mail1 mail2 > mail.local
FILES
/usr/lib/kbd/iconvdata default database
EXIT CODES
The exit status will be set with 0 upon successful completion, 1
otherwise.
SEE ALSO
kbdcomp(1M), kbdset(1), kbdload(1M), kbdpipe(1), conv(3C),
attkbd(7).
Licensed material--property of copyright holder(s)