유니코드

어떤 플랫폼, 어떤 프로그램, 어떤 언어에도 상관없이 유니코드는 모든 문자에 대해 고유 번호를 제공합니다.
— Unicode.org, What is Unicode? 한국어판에서

유니코드(영어: Unicode)는 지구상에 존재하는 모든 문자컴퓨터 상에서 처리할 수 있도록 제작된 문자의 전산 처리 방식의 일종이다. 표준안은 유니코드 연합체(Unicode Consortium)에서 제정·관리하고 있으며, 최신 버전은 2023년 10월 15일 기준 유니코드 15.1이다.

유니코드 블록 목록[편집 | 원본 편집]

유니코드 15.1에 들어간 유니코드 블록의 수는 328개이다.

유니코드 블록 목록 (15.1 기준)
코드 범위 블록 이름 글자 수/블록 크기
U+0000 ~ U+FFFF 기본 다국어 평면 (BMP) 55,639/65,520[1]
U+0000 ~ U+007F Basic Latin 128/128
U+0080 ~ U+00FF Latin-1 Supplement 128/128
U+0100 ~ U+017F Latin Extended-A 128/128
U+0180 ~ U+024F Latin Extended-B 208/208
U+0250 ~ U+02AF IPA Extensions 96/96
U+02B0 ~ U+02FF Spacing Modifier Letters 80/80
U+0300 ~ U+036F Combining Diacritical Marks 112/112
U+0370 ~ U+03FF Greek and Coptic 135/144
U+0400 ~ U+04FF Cyrillic 256/256
U+0500 ~ U+052F Cyrillic Supplement 48/48
U+0530 ~ U+058F Armenian 91/96
U+0590 ~ U+05FF Hebrew 88/112
U+0600 ~ U+06FF Arabic 256/256
U+0700 ~ U+074F Syriac 77/80
U+0750 ~ U+077F Arabic Supplement 48/48
U+0780 ~ U+07BF Thaana 50/64
U+07C0 ~ U+07FF NKo 62/64
U+0800 ~ U+083F Samaritan 61/64
U+0840 ~ U+085F Mandaic 29/32
U+0860 ~ U+086F Syriac Supplement 11/16
U+0870 ~ U+089F Arabic Extended-B 41/48
U+08A0 ~ U+08FF Arabic Extended-A 96/96
U+0900 ~ U+097F Devanagari 128/128
U+0980 ~ U+09FF Bengali 96/128
U+0A00 ~ U+0A7F Gurmukhi 80/128
U+0A80 ~ U+0AFF Gujarati 91/128
U+0B00 ~ U+0B7F Oriya 91/128
U+0B80 ~ U+0BFF Tamil 72/128
U+0C00 ~ U+0C7F Telugu 100/128
U+0C80 ~ U+0CFF Kannada 91/128
U+0D00 ~ U+0D7F Malayalam 118/128
U+0D80 ~ U+0DFF Sinhala 91/128
U+0E00 ~ U+0E7F Thai 87/128
U+0E80 ~ U+0EFF Lao 83/128
U+0F00 ~ U+0FFF Tibetan 211/256
U+1000 ~ U+109F Myanmar 160/160
U+10A0 ~ U+10FF Georgian 88/96
U+1100 ~ U+11FF Hangul Jamo 256/256
U+1200 ~ U+137F Ethiopic 358/384
U+1380 ~ U+139F Ethiopic Supplement 26/32
U+13A0 ~ U+13FF Cherokee 92/96
U+1400 ~ U+167F Unified Canadian Aboriginal Syllabics 640/640
U+1680 ~ U+169F Ogham 29/32
U+16A0 ~ U+16FF Runic 89/96
U+1700 ~ U+171F Tagalog 23/32
U+1720 ~ U+173F Hanunoo 23/32
U+1740 ~ U+175F Buhid 20/32
U+1760 ~ U+177F Tagbanwa 18/32
U+1780 ~ U+17FF Khmer 114/128
U+1800 ~ U+18AF Mongolian 158/176
U+18B0 ~ U+18FF Unified Canadian Aboriginal Syllabics Extended 70/80
U+1900 ~ U+194F Limbu 68/80
U+1950 ~ U+197F Tai Le 35/48
U+1980 ~ U+19DF New Tai Lue 83/96
U+19E0 ~ U+19FF Khmer Symbols 32/32
U+1A00 ~ U+1A1F Buginese 30/32
U+1A20 ~ U+1AAF Tai Tham 127/144
U+1AB0 ~ U+1AFF Combining Diacritical Marks Extended 31/80
U+1B00 ~ U+1B7F Balinese 124/128
U+1B80 ~ U+1BBF Sundanese 64/64
U+1BC0 ~ U+1BFF Batak 56/64
U+1C00 ~ U+1C4F Lepcha 74/80
U+1C50 ~ U+1C7F Ol Chiki 48/48
U+1C80 ~ U+1C8F Cyrillic Extended-C 9/16
U+1C90 ~ U+1CBF Georgian Extended 46/48
U+1CC0 ~ U+1CCF Sundanese Supplement 8/16
U+1CD0 ~ U+1CFF Vedic Extensions 43/48
U+1D00 ~ U+1D7F Phonetic Extensions 128/128
U+1D80 ~ U+1DBF Phonetic Extensions Supplement 64/64
U+1DC0 ~ U+1DFF Combining Diacritical Marks Supplement 64/64
U+1E00 ~ U+1EFF Latin Extended Additional 256/256
U+1F00 ~ U+1FFF Greek Extended 233/256
U+2000 ~ U+206F General Punctuation 111/112
U+2070 ~ U+209F Superscripts and Subscripts 42/48
U+20A0 ~ U+20CF Currency Symbols 33/48
U+20D0 ~ U+20FF Combining Diacritical Marks for Symbols 33/48
U+2100 ~ U+214F Letterlike Symbols 80/80
U+2150 ~ U+218F Number Forms 60/64
U+2190 ~ U+21FF Arrows 112/112
U+2200 ~ U+22FF Mathematical Operators 256/256
U+2300 ~ U+23FF Miscellaneous Technical 256/256
U+2400 ~ U+243F Control Pictures 39/64
U+2440 ~ U+245F Optical Character Recognition 11/32
U+2460 ~ U+24FF Enclosed Alphanumerics 160/160
U+2500 ~ U+257F Box Drawing 128/128
U+2580 ~ U+259F Block Elements 32/32
U+25A0 ~ U+25FF Geometric Shapes 96/96
U+2600 ~ U+26FF Miscellaneous Symbols 256/256
U+2700 ~ U+27BF Dingbats 192/192
U+27C0 ~ U+27EF Miscellaneous Mathematical Symbols-A 48/48
U+27F0 ~ U+27FF Supplemental Arrows-A 16/16
U+2800 ~ U+28FF Braille Patterns 256/256
U+2900 ~ U+297F Supplemental Arrows-B 128/128
U+2980 ~ U+29FF Miscellaneous Mathematical Symbols-B 128/128
U+2A00 ~ U+2AFF Supplemental Mathematical Operators 256/256
U+2B00 ~ U+2BFF Miscellaneous Symbols and Arrows 253/256
U+2C00 ~ U+2C5F Glagolitic 96/96
U+2C60 ~ U+2C7F Latin Extended-C 32/32
U+2C80 ~ U+2CFF Coptic 123/128
U+2D00 ~ U+2D2F Georgian Supplement 40/48
U+2D30 ~ U+2D7F Tifinagh 59/80
U+2D80 ~ U+2DDF Ethiopic Extended 79/96
U+2DE0 ~ U+2DFF Cyrillic Extended-A 32/32
U+2E00 ~ U+2E7F Supplemental Punctuation 94/128
U+2E80 ~ U+2EFF CJK Radicals Supplement 115/128
U+2F00 ~ U+2FDF Kangxi Radicals 214/224
U+2FF0 ~ U+2FFF Ideographic Description Characters 16/16
U+3000 ~ U+303F CJK Symbols and Punctuation 64/64
U+3040 ~ U+309F Hiragana 93/96
U+30A0 ~ U+30FF Katakana 96/96
U+3100 ~ U+312F Bopomofo 43/48
U+3130 ~ U+318F Hangul Compatibility Jamo 94/96
U+3190 ~ U+319F Kanbun 16/16
U+31A0 ~ U+31BF Bopomofo Extended 32/32
U+31C0 ~ U+31EF CJK Strokes 37/48
U+31F0 ~ U+31FF Katakana Phonetic Extensions 16/16
U+3200 ~ U+32FF Enclosed CJK Letters and Months 255/256
U+3300 ~ U+33FF CJK Compatibility 256/256
U+3400 ~ U+4DBF CJK Unified Ideographs Extension A 6,592/6,592
U+4DC0 ~ U+4DFF Yijing Hexagram Symbols 64/64
U+4E00 ~ U+9FFF CJK Unified Ideographs 20,992/20,992
U+A000 ~ U+A48F Yi Syllables 1,165/1,168
U+A490 ~ U+A4CF Yi Radicals 55/64
U+A4D0 ~ U+A4FF Lisu 48/48
U+A500 ~ U+A63F Vai 300/320
U+A640 ~ U+A69F Cyrillic Extended-B 96/96
U+A6A0 ~ U+A6FF Bamum 88/96
U+A700 ~ U+A71F Modifier Tone Letters 32/32
U+A720 ~ U+A7FF Latin Extended-D 193/224
U+A800 ~ U+A82F Syloti Nagri 45/48
U+A830 ~ U+A83F Common Indic Number Forms 10/16
U+A840 ~ U+A87F Phags-pa 56/64
U+A880 ~ U+A8DF Saurashtra 82/96
U+A8E0 ~ U+A8FF Devanagari Extended 32/32
U+A900 ~ U+A92F Kayah Li 48/48
U+A930 ~ U+A95F Rejang 37/48
U+A960 ~ U+A97F Hangul Jamo Extended-A 29/32
U+A980 ~ U+A9DF Javanese 91/96
U+A9E0 ~ U+A9FF Myanmar Extended-B 31/32
U+AA00 ~ U+AA5F Cham 83/96
U+AA60 ~ U+AA7F Myanmar Extended-A 32/32
U+AA80 ~ U+AADF Tai Viet 72/96
U+AAE0 ~ U+AAFF Meetei Mayek Extensions 23/32
U+AB00 ~ U+AB2F Ethiopic Extended-A 32/48
U+AB30 ~ U+AB6F Latin Extended-E 60/64
U+AB70 ~ U+ABBF Cherokee Supplement 80/80
U+ABC0 ~ U+ABFF Meetei Mayek 56/64
U+AC00 ~ U+D7AF Hangul Syllables 11,172/11,184
U+D7B0 ~ U+D7FF Hangul Jamo Extended-B 72/80
U+D800 ~ U+DB7F High Surrogates 0/896
U+DB80 ~ U+DBFF High Private Use Surrogates 0/128
U+DC00 ~ U+DFFF Low Surrogates 0/1,024
U+E000 ~ U+F8FF Private Use Area 6,400/6,400
U+F900 ~ U+FAFF CJK Compatibility Ideographs 472/512
U+FB00 ~ U+FB4F Alphabetic Presentation Forms 58/80
U+FB50 ~ U+FDFF Arabic Presentation Forms-A 631/688
U+FE00 ~ U+FE0F Variation Selectors 16/16
U+FE10 ~ U+FE1F Vertical Forms 10/16
U+FE20 ~ U+FE2F Combining Half Marks 16/16
U+FE30 ~ U+FE4F CJK Compatibility Forms 32/32
U+FE50 ~ U+FE6F Small Form Variants 26/32
U+FE70 ~ U+FEFF Arabic Presentation Forms-B 141/144
U+FF00 ~ U+FFEF Halfwidth and Fullwidth Forms 225/240
U+FFF0 ~ U+FFFF Specials 5/16
U+10000 ~ U+1FFFF 보조 다국어 평면 (SMP) 23,276/26,160
U+10000 ~ U+1007F Linear B Syllabary 88/128
U+10080 ~ U+100FF Linear B Ideograms 123/128
U+10100 ~ U+1013F Aegean Numbers 57/64
U+10140 ~ U+1018F Ancient Greek Numbers 79/80
U+10190 ~ U+101CF Ancient Symbols 14/64
U+101D0 ~ U+101FF Phaistos Disc 46/48
U+10280 ~ U+1029F Lycian 29/32
U+102A0 ~ U+102DF Carian 49/64
U+102E0 ~ U+102FF Coptic Epact Numbers 28/32
U+10300 ~ U+1032F Old Italic 39/48
U+10330 ~ U+1034F Gothic 27/32
U+10350 ~ U+1037F Old Permic 43/48
U+10380 ~ U+1039F Ugaritic 31/32
U+103A0 ~ U+103DF Old Persian 50/64
U+10400 ~ U+1044F Deseret 80/80
U+10450 ~ U+1047F Shavian 48/48
U+10480 ~ U+104AF Osmanya 40/48
U+104B0 ~ U+104FF Osage 72/80
U+10500 ~ U+1052F Elbasan 40/48
U+10530 ~ U+1056F Caucasian Albanian 53/64
U+10570 ~ U+105BF Vithkuqi 70/80
U+10600 ~ U+1077F Linear A 341/384
U+10780 ~ U+107BF Latin Extended-F 57/64
U+10800 ~ U+1083F Cypriot Syllabary 55/64
U+10840 ~ U+1085F Imperial Aramaic 31/32
U+10860 ~ U+1087F Palmyrene 32/32
U+10880 ~ U+108AF Nabataean 40/48
U+108E0 ~ U+108FF Hatran 26/32
U+10900 ~ U+1091F Phoenician 29/32
U+10920 ~ U+1093F Lydian 27/32
U+10980 ~ U+1099F Meroitic Hieroglyphs 32/32
U+109A0 ~ U+109FF Meroitic Cursive 90/96
U+10A00 ~ U+10A5F Kharoshthi 68/96
U+10A60 ~ U+10A7F Old South Arabian 32/32
U+10A80 ~ U+10A9F Old North Arabian 32/32
U+10AC0 ~ U+10AFF Manichaean 51/64
U+10B00 ~ U+10B3F Avestan 61/64
U+10B40 ~ U+10B5F Inscriptional Parthian 30/32
U+10B60 ~ U+10B7F Inscriptional Pahlavi 27/32
U+10B80 ~ U+10BAF Psalter Pahlavi 29/48
U+10C00 ~ U+10C4F Old Turkic 73/80
U+10C80 ~ U+10CFF Old Hungarian 108/128
U+10D00 ~ U+10D3F Hanifi Rohingya 50/64
U+10E60 ~ U+10E7F Rumi Numeral Symbols 31/32
U+10E80 ~ U+10EBF Yezidi 47/64
U+10EC0 ~ U+10EFF Arabic Extended-C 3/64
U+10F00 ~ U+10F2F Old Sogdian 40/48
U+10F30 ~ U+10F6F Sogdian 42/64
U+10F70 ~ U+10FAF Old Uyghur 26/64
U+10FB0 ~ U+10FDF Chorasmian 28/48
U+10FE0 ~ U+10FFF Elymaic 23/32
U+11000 ~ U+1107F Brahmi 115/128
U+11080 ~ U+110CF Kaithi 68/80
U+110D0 ~ U+110FF Sora Sompeng 35/48
U+11100 ~ U+1114F Chakma 71/80
U+11150 ~ U+1117F Mahajani 39/48
U+11180 ~ U+111DF Sharada 96/96
U+111E0 ~ U+111FF Sinhala Archaic Numbers 20/32
U+11200 ~ U+1124F Khojki 65/80
U+11280 ~ U+112AF Multani 38/48
U+112B0 ~ U+112FF Khudawadi 69/80
U+11300 ~ U+1137F Grantha 86/128
U+11400 ~ U+1147F Newa 97/128
U+11480 ~ U+114DF Tirhuta 82/96
U+11580 ~ U+115FF Siddham 92/128
U+11600 ~ U+1165F Modi 79/96
U+11660 ~ U+1167F Mongolian Supplement 13/32
U+11680 ~ U+116CF Takri 68/80
U+11700 ~ U+1174F Ahom 65/80
U+11800 ~ U+1184F Dogra 60/80
U+118A0 ~ U+118FF Warang Citi 84/96
U+11900 ~ U+1195F Dives Akuru 72/96
U+119A0 ~ U+119FF Nandinagari 65/96
U+11A00 ~ U+11A4F Zanabazar Square 72/80
U+11A50 ~ U+11AAF Soyombo 83/96
U+11AB0 ~ U+11ABF Unified Canadian Aboriginal Syllabics Extended-A 16/16
U+11AC0 ~ U+11AFF Pau Cin Hau 57/64
U+11B00 ~ U+11B5F Devanagari Extended-A 10/96
U+11C00 ~ U+11C6F Bhaiksuki 97/112
U+11C70 ~ U+11CBF Marchen 68/80
U+11D00 ~ U+11D5F Masaram Gondi 75/96
U+11D60 ~ U+11DAF Gunjala Gondi 63/80
U+11EE0 ~ U+11EFF Makasar 25/32
U+11F00 ~ U+11F5F Kawi 86/96
U+11FB0 ~ U+11FBF Lisu Supplement 1/16
U+11FC0 ~ U+11FFF Tamil Supplement 51/64
U+12000 ~ U+123FF Cuneiform 922/1,024
U+12400 ~ U+1247F Cuneiform Numbers and Punctuation 116/128
U+12480 ~ U+1254F Early Dynastic Cuneiform 196/208
U+12F90 ~ U+12FFF Cypro-Minoan 99/112
U+13000 ~ U+1342F Egyptian Hieroglyphs 1,072/1,072
U+13430 ~ U+1345F Egyptian Hieroglyph Format Controls 38/48
U+14400 ~ U+1467F Anatolian Hieroglyphs 583/640
U+16800 ~ U+16A3F Bamum Supplement 569/576
U+16A40 ~ U+16A6F Mro 43/48
U+16A70 ~ U+16ACF Tangsa 89/96
U+16AD0 ~ U+16AFF Bassa Vah 36/48
U+16B00 ~ U+16B8F Pahawh Hmong 127/144
U+16E40 ~ U+16E9F Medefaidrin 91/96
U+16F00 ~ U+16F9F Miao 149/160
U+16FE0 ~ U+16FFF Ideographic Symbols and Punctuation 7/32
U+17000 ~ U+187FF Tangut 6,136/6,144
U+18800 ~ U+18AFF Tangut Components 768/768
U+18B00 ~ U+18CFF Khitan Small Script 470/512
U+18D00 ~ U+18D7F Tangut Supplement 9/128
U+1AFF0 ~ U+1AFFF Kana Extended-B 13/16
U+1B000 ~ U+1B0FF Kana Supplement 256/256
U+1B100 ~ U+1B12F Kana Extended-A 35/48
U+1B130 ~ U+1B16F Small Kana Extension 9/64
U+1B170 ~ U+1B2FF Nushu 396/400
U+1BC00 ~ U+1BC9F Duployan 143/160
U+1BCA0 ~ U+1BCAF Shorthand Format Controls 4/16
U+1CF00 ~ U+1CFCF Znamenny Musical Notation 185/208
U+1D000 ~ U+1D0FF Byzantine Musical Symbols 246/256
U+1D100 ~ U+1D1FF Musical Symbols 233/256
U+1D200 ~ U+1D24F Ancient Greek Musical Notation 70/80
U+1D2C0 ~ U+1D2DF Kaktovik Numerals 20/32
U+1D2E0 ~ U+1D2FF Mayan Numerals 20/32
U+1D300 ~ U+1D35F Tai Xuan Jing Symbols 87/96
U+1D360 ~ U+1D37F Counting Rod Numerals 25/32
U+1D400 ~ U+1D7FF Mathematical Alphanumeric Symbols 996/1,024
U+1D800 ~ U+1DAAF Sutton SignWriting 672/688
U+1DF00 ~ U+1DFFF Latin Extended-G 37/256
U+1E000 ~ U+1E02F Glagolitic Supplement 38/48
U+1E030 ~ U+1E08F Cyrillic Extended-D 63/96
U+1E100 ~ U+1E14F Nyiakeng Puachue Hmong 71/80
U+1E290 ~ U+1E2BF Toto 31/48
U+1E2C0 ~ U+1E2FF Wancho 59/64
U+1E4D0 ~ U+1E4FF Nag Mundari 42/48
U+1E7E0 ~ U+1E7FF Ethiopic Extended-B 28/32
U+1E800 ~ U+1E8DF Mende Kikakui 213/224
U+1E900 ~ U+1E95F Adlam 88/96
U+1EC70 ~ U+1ECBF Indic Siyaq Numbers 68/80
U+1ED00 ~ U+1ED4F Ottoman Siyaq Numbers 61/80
U+1EE00 ~ U+1EEFF Arabic Mathematical Alphabetic Symbols 143/256
U+1F000 ~ U+1F02F Mahjong Tiles 44/48
U+1F030 ~ U+1F09F Domino Tiles 100/112
U+1F0A0 ~ U+1F0FF Playing Cards 82/96
U+1F100 ~ U+1F1FF Enclosed Alphanumeric Supplement 200/256
U+1F200 ~ U+1F2FF Enclosed Ideographic Supplement 64/256
U+1F300 ~ U+1F5FF Miscellaneous Symbols and Pictographs 768/768
U+1F600 ~ U+1F64F Emoticons 80/80
U+1F650 ~ U+1F67F Ornamental Dingbats 48/48
U+1F680 ~ U+1F6FF Transport and Map Symbols 118/128
U+1F700 ~ U+1F77F Alchemical Symbols 124/128
U+1F780 ~ U+1F7FF Geometric Shapes Extended 103/128
U+1F800 ~ U+1F8FF Supplemental Arrows-C 150/256
U+1F900 ~ U+1F9FF Supplemental Symbols and Pictographs 256/256
U+1FA00 ~ U+1FA6F Chess Symbols 98/112
U+1FA70 ~ U+1FAFF Symbols and Pictographs Extended-A 107/144
U+1FB00 ~ U+1FBFF Symbols for Legacy Computing 212/256
U+20000 ~ U+2FFFF 보조 표의문자 평면 (SIP) 61,495/61,536
U+20000 ~ U+2A6DF CJK Unified Ideographs Extension B 42,720/42,720
U+2A700 ~ U+2B73F CJK Unified Ideographs Extension C 4,154/4,160
U+2B740 ~ U+2B81F CJK Unified Ideographs Extension D 222/224
U+2B820 ~ U+2CEAF CJK Unified Ideographs Extension E 5,762/5,776
U+2CEB0 ~ U+2EBEF CJK Unified Ideographs Extension F 7,473/7,488
U+2EBF0 ~ U+2EE5F CJK Unified Ideographs Extension I 622/624
U+2F800 ~ U+2FA1F CJK Compatibility Ideographs Supplement 542/544
U+30000 ~ U+3FFFF 삼차 표의문자 평면 (TIP) 9,131/9,136
U+30000 ~ U+3134F CJK Unified Ideographs Extension G 4,939/4,944
U+31350 ~ U+323AF CJK Unified Ideographs Extension H 4,192/4,192
U+E0000 ~ U+EFFFF 특수 목적 보조 평면 (SSP) 337/368
U+E0000 ~ U+E007F Tags 97/128
U+E0100 ~ U+E01EF Variation Selectors Supplement 240/240
U+F0000 ~ U+FFFFF 사용자 정의 영역-A (PUA-A)
U+F0000 ~ U+FFFFF Supplementary Private Use Area-A 65,534/65,536
U+100000 ~ U+10FFFF 사용자 정의 영역-B (PUA-B)
U+100000 ~ U+10FFFF Supplementary Private Use Area-B 65,534/65,536

유니코드의 인코딩[편집 | 원본 편집]

UTF-8[편집 | 원본 편집]

현재 가장 널리 쓰이는 인코딩 방식. 유니코드 범위에 따라 글자의 바이트 수도 달라진다. Basic Latin 영역의 문자는 1바이트로 인코딩을 해서 ASCII 체계와 완벽히 호환된다.

UTF-16[편집 | 원본 편집]

BMP에 할당된 문자는 2바이트로, 다른 문자는 4바이트로 처리되는 고정 바이트 형식 인코딩. 대표적인 사용 예로 NT 4.0 버전 이상의 Windows 운영 체제가 있다.

유니코드와 한글[편집 | 원본 편집]

유니코드에는 완성형 한글 자모조합 11,172자와 조합형 한글 낱자가 모두 수록되어 있다. 하지만 조합형은 데이터 크기가 커지기 때문에 옛한글 표기 등 특수한 목적을 위해서만 사용하고, 보통의 한글 인코딩에는 완성형을 사용한다.

본래 유니코드가 도입되기 전 대한민국에서는 KS X 1001 완성형 문자표에 기반한 EUC-KR 인코딩을 사용하였다. 그러나 이 문자표에 수록된 한글 자모조합은 2,350자에 불과하여 표준어조차 온전히 표기할 수 없었다. MS에서는 CP949(통합 완성형)이라는 코드로 한글의 나머지 글자들을 배열했지만 KS X 1001과 별개의 영역에 글자들을 우겨넣었기 때문에 코드가 다소 난잡했다.

한글은 유니코드 1.0 버전부터 지원하였다. 그러나 1.0때 KS X 1001의 2,350자, 1.1때 기타 4,306자를 추가해 6,656자밖에 표현이 불가능하였다. 이에 대한민국 대표는 유니코드 2.0 제정시 현대 한글 자모조합 11,172자를 다시 추가해줄 것을 요청하였고, 이에 2.0 버전에서 오늘날과 같은 완성형 자모조합이 코드표상에 제대로 배당되게 되었다.

각주

  1. 사용자 정의 영역 제외 글자 수