Museum

Home

Lab Overview

Retrotechnology Articles

Online Manuals

⇒ jconv(1J) — NEWS-os 4.2.1R

Media Vault

Software Library

Restoration Projects

Artifacts Sought

Related Articles

cat(1)

JCONV(1J)  —  NEWS-OS Programmer’s Manual

NAME

jconv − convert Kanji codes

SYNOPSIS

jconv [ option ] [ file ] ... 
jis file

DESCRIPTION

jconv reads the specified files in order, converts them into another Kanji code and outputs to the standard out. When the file is not specified, the standard input is used.

If no option is specified, jconv will automatically decide the input Kanji code and convert it into the internal Kanji code specified by the environment variable LC_CTYPE or LANG. 

The option is either −IO or −O, whose description is given below:

I  input code
sShift-JIS code
eExtended Unix code (EUC)
jJIS code (any desired escape sequence)
Noneany code (SJIS/EUC/JIS)
 O  output code
sShift-JIS code
eExtended Unix code (EUC)
jJIS code  (default  Kanji:ESC$B  Alphanumeric:ESC(J
               Kana:8bits  Gaiji:ESC$(D  )
j[KE][N[G]]
K = Kanji (double byte code)
BESC$B  JIS X0208-1983
@ESC$@  JIS C6226-1978
+ESC&@ESC$B  JIS X0208-1990
E = Alphanumeric (single byte code)
JESC(J  JIS X0201-1976 Roman
BESC(B  ASCII
HESC(H  Swedish name
N = Kana (single byte code)
8JIS X0201 8 bit code (0xa1-0xdf)
7JIS X0201 7 bit code with SO/SI
IESC(I  JIS X0201-1976 Katakana
kESC(I  JIS X0201-1976 Katakana
G = Gaiji (double byte code)
DESC$(D  JIS X0212-1990 Hojo kanji
CESC$(C  KS C5601-1987 Hangul
?ESC$(?  another gaiji

Options s, e, j, k can be in uppercase (S, E, J, K). 

The jis command converts the JIS code into the internal Kanji code.  jis is linked to jconv in this version. 

EXAMPLE

jconv −je Converts JIS code into EUC

jconv −sj7 Converts Shift-JIS code into JIS 7 bit code

jconv −ejBBI Converts EUC into JIS code (JIS-83 Kanji, ASCII, and JIS Katakana character-set)

jconv −j Converts any code into JIS 8 bit code

NOTES

The −t and −v options used in NEWS-OS Release 3.2 are no longer available.  Use cat(1), instead. 

The escape sequences generated by the options j and J , which specify the output code, have been changed to ESC$B and ESC(J, respectively. 

Options i, I, n, N, b, and B are retained as before for compatibility.  For this reason, −jI is equivalent to −js.  When you want to specify −jBJI, use this option or −jk.  −jBB can be substituted by −b. 

ESC(H is used to designate the character-set for Swedish names.  ESC(H is retained for the compatibility of those terminarls using the sequence as the JIS Roman character-set.  Normally, do not use ESC(H. 

When the output is specified to be converted into JIS code and the Kanji character-set is designated, the escape sequence designating Roman characters is sent before sending the control codes such as a newline character (0x0a), tab (0x09), space (0x20) and delete (0x7f).  This is a verbose process, but it improves the operativeness with the Unix commands, which process text by a line at a time. 

In EUC, Gaiji area corresponds to the code set 3 (three bytes starting from SS3): i.e. from 0x8fa1a1 to 0x8ffefe (8836 (94 x 94) characters), while in Shift-JIS, from 0xf040 to 0xfcfc (only 2444 (26 x 94) characters).  Therefore, when converting EUC codes to Shift-JIS code, some codes whose corresponding codes does not exist, are converted to 0xfcfc. 

SEE ALSO

cat(1)

NEWS-OSRelease 4.2.1R

Typewritten Software • bear@typewritten.org • Edmonds, WA 98026