Monday, January 17, 2011

ASCII Table and Description

ASCII Table and Description

ASCII stands for American Standard Code for Information Interchange. Computers can only understand numbers, so an ASCII code is the numerical representation of a character such as 'a' or '@' or an action of some sort. ASCII was developed a long time ago and now the non-printing characters are rarely used for their original purpose. Below is the ASCII character table and this includes descriptions of the first 32 non-printing characters. ASCII was actually designed for use with teletypes and so the descriptions are somewhat obscure. If someone says they want your CV however in ASCII format, all this means is they want 'plain' text with no formatting such as tabs, bold or underscoring - the raw format that any computer can understand. This is usually so they can easily import the file into their own applications without issues. Notepad.exe creates ASCII text, or in MS 


BinaryOctDecHexAbbr[t 1][t 2][t 3]Description
000 0000000000NUL^@\0Null character
000 0001001101SOH^AStart of Header
000 0010002202STX^BStart of Text
000 0011003303ETX^CEnd of Text
000 0100004404EOT^DEnd of Transmission
000 0101005505ENQ^EEnquiry
000 0110006606ACK^FAcknowledgment
000 0111007707BEL^G\aBell
000 1000010808BS^H\bBackspace[t 4][t 5]
000 1001011909HT^I\tHorizontal Tab[t 6]
000 1010012100ALF^J\nLine feed
000 1011013110BVT^K\vVertical Tab
000 1100014120CFF^L\fForm feed
000 1101015130DCR^M\rCarriage return[t 7]
000 1110016140ESO^NShift Out
000 1111017150FSI^OShift In
001 00000201610DLE^PData Link Escape
001 00010211711DC1^QDevice Control 1 (oft. XON)
001 00100221812DC2^RDevice Control 2
001 00110231913DC3^SDevice Control 3 (oft. XOFF)
001 01000242014DC4^TDevice Control 4
001 01010252115NAK^UNegative Acknowledgement
001 01100262216SYN^VSynchronous idle
001 01110272317ETB^WEnd of Transmission Block
001 10000302418CAN^XCancel
001 10010312519EM^YEnd of Medium
001 1010032261ASUB^ZSubstitute
001 1011033271BESC^[\e[t 8]Escape[t 9]
001 1100034281CFS^\File Separator
001 1101035291DGS^]Group Separator
001 1110036301ERS^^[t 10]Record Separator
001 1111037311FUS^_Unit Separator
111 11111771277FDEL^?Delete[t 11][t 5]

  1. ^ The Unicode characters from the area U+2400 to U+2421 reserved for representing control characters when it is necessary to print or display them rather than have them perform their intended function. Some browsers may not display these properly.
  2. ^ Caret notation often used to represent control characters. This also indicates the key sequence to input the character traditionally on most text terminals: The caret (^) that begins these sequences represents holding down the "Ctrl" key while typing the second character.
  3. ^ Character Escape Codes in C programming language and many other languages influenced by it, such as Java and Perl (though not all implementations necessarily support all escape codes).
  4. ^ The Backspace character can also be entered by pressing the "Backspace", "Bksp", or ← key on some systems.
  5. a b The ambiguity of Backspace is due to early terminals designed assuming the main use of the keyboard would be to manually punch paper tape while not connected to a computer. To delete the previous character you had to back up the paper tape punch, which for mechanical and simplicity reasons was a button on the punch itself and not the keyboard, then type the rubout character. They therefore placed a key producing rubout at the location used on typewriters for backspace. When systems used these terminals and provided command-line editing, they had to use the "rubout" code to perform a backspace, and often did not interpret the backspace character (they might echo "^H" for backspace). Other terminals not designed for paper tape made the key at this location produce Backspace, and systems designed for these used that character to back up. Since the delete code often produced a backspace effect, this also forced terminal manufacturers to make any "Delete" key produce something other than the Delete character.
  6. ^ The Tab character can also be entered by pressing the "Tab" key on most systems.
  7. ^ The Carriage Return character can also be entered by pressing the "Return", "Ret", "Enter", or  key on most systems.
  8. ^ The '\e' escape sequence is not part of ISO C and many other language specifications. However, it is understood by several compilers.
  9. ^ The Escape character can also be entered by pressing the "Escape" or "Esc" key on some systems.
  10. ^ ^^ means Control-Caret (pressing the "Ctrl" and "^" keys), not Control-Control.
  11. ^ The Delete character can sometimes be entered by pressing the "Backspace", "Bksp", or ← key on some systems.

[edit]ASCII printable characters

Codes 0x21 to 0x7E, known as the printable characters, represent letters, digits, punctuation marks, and a few miscellaneous symbols.
Code 0x20, the space character, denotes the space between words, as produced by the space-bar of a keyboard. Since the space character is considered an invisible graphic (rather than a control character)[8] and thus would not normally be visible, it is represented here by Unicode character U+2420 "␠"; Unicode characters U+2422 "␢" or U+2423 "␣" are also available for use when a visible representation of a space is necessary.
Code 0x7F corresponds to the non-printable "Delete" (DEL) control character and is therefore omitted from this chart; it is covered in the previous section's chart.
BinaryOctDecHexGlyph
010 00000403220
010 00010413321!
010 00100423422"
010 00110433523#
010 01000443624$
010 01010453725%
010 01100463826&
010 01110473927'
010 10000504028(
010 10010514129)
010 1010052422A*
010 1011053432B+
010 1100054442C,
010 1101055452D-
010 1110056462E.
010 1111057472F/
011 000006048300
011 000106149311
011 001006250322
011 001106351333
011 010006452344
011 010106553355
011 011006654366
011 011106755377
011 100007056388
011 100107157399
011 1010072583A:
011 1011073593B;
011 1100074603C<
011 1101075613D=
011 1110076623E>
011 1111077633F?
BinaryOctDecHexGlyph
100 00001006440@
100 00011016541A
100 00101026642B
100 00111036743C
100 01001046844D
100 01011056945E
100 01101067046F
100 01111077147G
100 10001107248H
100 10011117349I
100 1010112744AJ
100 1011113754BK
100 1100114764CL
100 1101115774DM
100 1110116784EN
100 1111117794FO
101 00001208050P
101 00011218151Q
101 00101228252R
101 00111238353S
101 01001248454T
101 01011258555U
101 01101268656V
101 01111278757W
101 10001308858X
101 10011318959Y
101 1010132905AZ
101 1011133915B[
101 1100134925C\
101 1101135935D]
101 1110136945E^
101 1111137955F_
BinaryOctDecHexGlyph
110 00001409660`
110 00011419761a
110 00101429862b
110 00111439963c
110 010014410064d
110 010114510165e
110 011014610266f
110 011114710367g
110 100015010468h
110 100115110569i
110 10101521066Aj
110 10111531076Bk
110 11001541086Cl
110 11011551096Dm
110 11101561106En
110 11111571116Fo
111 000016011270p
111 000116111371q
111 001016211472r
111 001116311573s
111 010016411674t
111 010116511775u
111 011016611876v
111 011116711977w
111 100017012078x
111 100117112179y
111 10101721227Az
111 10111731237B{
111 11001741247C|
111 11011751257D}
111 11101761267E~

Extended ASCII Codes

EBCDIC and IBM Scan Codes

No comments:

Post a Comment