123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158 |
- .TH ASCII 1
- .SH NAME
- ascii, unicode \- interpret ASCII, Unicode characters
- .SH SYNOPSIS
- .B ascii
- [
- .B -8cnt
- ]
- [
- .B -dox
- |
- .B -b
- .I n
- ]
- [
- .I text
- ]
- .PP
- .B unicode
- .IB hexmin - hexmax
- .PP
- .B unicode
- [
- .B -t
- ]
- .I hex
- [
- \&...
- ]
- .PP
- .B unicode
- [
- .B -n
- ]
- .I characters
- .PP
- .B look
- .I hex
- .B /lib/unicode
- .SH DESCRIPTION
- .I Ascii
- prints the
- .SM ASCII
- values corresponding to characters and
- .I vice
- .IR versa ;
- under the
- .B -8
- option, the
- .SM ISO
- Latin-1 extensions (codes 0200-0377) are included.
- The values are interpreted in a settable numeric base;
- .B -o
- specifies octal,
- .B -d
- decimal,
- .B -x
- hexadecimal (the default), and
- .BI -b n
- base
- .IR n .
- .PP
- With no arguments,
- .I ascii
- prints a table of the character set in the specified base.
- Characters of
- .I text
- are converted to their
- .SM ASCII
- values, one per line. If, however, the first
- .I text
- argument is a valid number in the specified base, conversion
- goes the opposite way.
- Control characters are printed as two- or three-character mnemonics.
- Other options are:
- .TP
- .B -n
- Force numeric output.
- .TP
- .B -c
- Force character output.
- .TP
- .B -t
- Convert from numbers to running text; do not interpret
- control characters or insert newlines.
- .PP
- .I Unicode
- is similar; it converts between
- .SM UTF
- and character values from the Unicode Standard (see
- .IR utf (6)).
- If given a range of hexadecimal numbers,
- .I unicode
- prints a table of the specified Unicode characters \(em their values and
- .SM UTF
- representations.
- Otherwise it translates from
- .SM UTF
- to numeric value or vice versa,
- depending on the appearance of the supplied text;
- the
- .B -n
- option forces numeric output to avoid ambiguity with numeric characters.
- If converting to
- .SM UTF ,
- the characters are printed one per line unless the
- .B -t
- flag is set, in which case the output is a single string
- containing only the specified characters.
- Unlike
- .IR ascii ,
- .I unicode
- treats no characters specially.
- .PP
- The output of
- .I ascii
- and
- .I unicode
- may be unhelpful if the characters printed are not available in the current font.
- .PP
- The file
- .B /lib/unicode
- contains a
- table of characters and descriptions, sorted in hexadecimal order,
- suitable for
- .IR look (1)
- on the lower case
- .I hex
- values of characters.
- .SH EXAMPLES
- .TP
- .B "ascii -d"
- Print the
- .SM ASCII
- table base 10.
- .TP
- .B "unicode p"
- Print the hex value of `p'.
- .TP
- .B "unicode 2200-22f1"
- Print a table of miscellaneous mathematical symbols.
- .TP
- .B "look 039 /lib/unicode"
- See the start of the Greek alphabet's encoding in the Unicode Standard.
- .SH FILES
- .TF \fL/lib/unicode
- .TP
- .B /lib/unicode
- table of characters and descriptions.
- .SH SOURCE
- .B /sys/src/cmd/ascii.c
- .br
- .B /sys/src/cmd/unicode.c
- .SH "SEE ALSO"
- .IR look (1),
- .IR tcs (1),
- .IR utf (6),
- .IR font (6)
|