2012年2月6日星期一

Unicode代码段和字符集对应表

0000-007F:C0控制符及基本拉丁文(C0ControlandBasicLatin)
0080-00FF:C1控制符及拉丁文补充-1(C1ControlandLatin1Supplement)
0100-017F:拉丁文扩展-A(LatinExtended-A)
0180-024F:拉丁文扩展-B(LatinExtended-B)
0250-02AF:国际音标扩展(IPAExtensions)
02B0-02FF:空白修饰字母(SpacingModifiers)
0300-036F:结合用读音符号(CombiningDiacriticsMarks)
0370-03FF:希腊文及科普特文(GreekandCoptic)
0400-04FF:西里尔字母(Cyrillic)
0500-052F:西里尔字母补充(CyrillicSupplement)
0530-058F:亚美尼亚语(Armenian)
0590-05FF:希伯来文(Hebrew)
0600-06FF:阿拉伯文(Arabic)
0700-074F:叙利亚文(Syriac)
0750-077F:阿拉伯文补充(ArabicSupplement)
0780-07BF:马尔代夫语(Thaana)
07C0-077F:西非书面语言(N'Ko)
0800-085F:阿维斯塔语及巴列维语(AvestanandPahlavi)
0860-087F:Mandaic
0880-08AF:撒马利亚语(Samaritan)
0900-097F:天城文书(Devanagari)
0980-09FF:孟加拉语(Bengali)
0A00-0A7F:锡克教文(Gurmukhi)
0A80-0AFF:古吉拉特文(Gujarati)
0B00-0B7F:奥里亚文(Oriya)
0B80-0BFF:泰米尔文(Tamil)
0C00-0C7F:泰卢固文(Telugu)
0C80-0CFF:卡纳达文(Kannada)
0D00-0D7F:德拉维族语(Malayalam)
0D80-0DFF:僧伽罗语(Sinhala)
0E00-0E7F:泰文(Thai)
0E80-0EFF:老挝文(Lao)
0F00-0FFF:藏文(Tibetan)
1000-109F:缅甸语(Myanmar)
10A0-10FF:格鲁吉亚语(Georgian)
1100-11FF:朝鲜文(HangulJamo)
1200-137F:埃塞俄比亚语(Ethiopic)
1380-139F:埃塞俄比亚语补充(EthiopicSupplement)
13A0-13FF:切罗基语(Cherokee)
1400-167F:统一加拿大土著语音节(UnifiedCanadianAboriginalSyllabics)
1680-169F:欧甘字母(Ogham)
16A0-16FF:如尼文(Runic)
1700-171F:塔加拉语(Tagalog)
1720-173F:Hanunóo
1740-175F:Buhid
1760-177F:Tagbanwa
1780-17FF:高棉语(Khmer)
1800-18AF:蒙古文(Mongolian)
18B0-18FF:Cham
1900-194F:Limbu
1950-197F:德宏泰语(TaiLe)
1980-19DF:新傣仂语(NewTaiLue)
19E0-19FF:高棉语记号(KmerSymbols)
1A00-1A1F:Buginese
1A20-1A5F:Batak
1A80-1AEF:Lanna
1B00-1B7F:巴厘语(Balinese)
1B80-1BB0:巽他语(Sundanese)
1BC0-1BFF:PahawhHmong
1C00-1C4F:雷布查语(Lepcha)
1C50-1C7F:OlChiki
1C80-1CDF:曼尼普尔语(Meithei/Manipuri)
1D00-1D7F:语音学扩展(PhoneticExtensions)
1D80-1DBF:语音学扩展补充(PhoneticExtensionsSupplement)
1DC0-1DFF:结合用读音符号补充(CombiningDiacriticsMarksSupplement)
1E00-1EFF:拉丁文扩充附加(LatinExtendedAdditional)
1F00-1FFF:希腊语扩充(GreekExtended)
2000-206F:常用标点(GeneralPunctuation)
2070-209F:上标及下标(SuperscriptsandSubscripts)
20A0-20CF:货币符号(CurrencySymbols)
20D0-20FF:组合用记号(CombiningDiacriticsMarksforSymbols)
2100-214F:字母式符号(LetterlikeSymbols)
2150-218F:数字形式(NumberForm)
2190-21FF:箭头(Arrows)
2200-22FF:数学运算符(MathematicalOperator)
2300-23FF:杂项工业符号(MiscellaneousTechnical)
2400-243F:控制图片(ControlPictures)
2440-245F:光学识别符(OpticalCharacterRecognition)
2460-24FF:封闭式字母数字(EnclosedAlphanumerics)
2500-257F:制表符(BoxDrawing)
2580-259F:方块元素(BlockElement)
25A0-25FF:几何图形(GeometricShapes)
2600-26FF:杂项符号(MiscellaneousSymbols)
2700-27BF:印刷符号(Dingbats)
27C0-27EF:杂项数学符号-A(MiscellaneousMathematicalSymbols-A)
27F0-27FF:追加箭头-A(SupplementalArrows-A)
2800-28FF:盲文点字模型(BraillePatterns)
2900-297F:追加箭头-B(SupplementalArrows-B)
2980-29FF:杂项数学符号-B(MiscellaneousMathematicalSymbols-B)
2A00-2AFF:追加数学运算符(SupplementalMathematicalOperator)
2B00-2BFF:杂项符号和箭头(MiscellaneousSymbolsandArrows)
2C00-2C5F:格拉哥里字母(Glagolitic)
2C60-2C7F:拉丁文扩展-C(LatinExtended-C)
2C80-2CFF:古埃及语(Coptic)
2D00-2D2F:格鲁吉亚语补充(GeorgianSupplement)
2D30-2D7F:提非纳文(Tifinagh)
2D80-2DDF:埃塞俄比亚语扩展(EthiopicExtended)
2E00-2E7F:追加标点(SupplementalPunctuation)
2E80-2EFF:CJK部首补充(CJKRadicalsSupplement)
2F00-2FDF:康熙字典部首(KangxiRadicals)
2FF0-2FFF:表意文字描述符(IdeographicDescriptionCharacters)
3000-303F:CJK符号和标点(CJKSymbolsandPunctuation)
3040-309F:日文平假名(Hiragana)
30A0-30FF:日文片假名(Katakana)
3100-312F:注音字母(Bopomofo)
3130-318F:朝鲜文兼容字母(HangulCompatibilityJamo)
3190-319F:象形字注释标志(Kanbun)
31A0-31BF:注音字母扩展(BopomofoExtended)
31C0-31EF:CJK笔画(CJKStrokes)
31F0-31FF:日文片假名语音扩展(KatakanaPhoneticExtensions)
3200-32FF:封闭式CJK文字和月份(EnclosedCJKLettersandMonths)
3300-33FF:CJK兼容(CJKCompatibility)
3400-4DBF:CJK统一表意符号扩展A(CJKUnifiedIdeographsExtensionA)
4DC0-4DFF:易经六十四卦符号(YijingHexagramsSymbols)
4E00-9FBF:CJK统一表意符号(CJKUnifiedIdeographs)
A000-A48F:彝文音节(YiSyllables)
A490-A4CF:彝文字根(YiRadicals)
A500-A61F:Vai
A660-A6FF:统一加拿大土著语音节补充(UnifiedCanadianAboriginalSyllabicsSupplement)
A700-A71F:声调修饰字母(ModifierToneLetters)
A720-A7FF:拉丁文扩展-D(LatinExtended-D)
A800-A82F:SylotiNagri
A840-A87F:八思巴字(Phags-pa)
A880-A8DF:Saurashtra
A900-A97F:爪哇语(Javanese)
A980-A9DF:Chakma
AA00-AA3F:VarangKshiti
AA40-AA6F:SorangSompeng
AA80-AADF:Newari
AB00-AB5F:越南傣语(Vi?tThái)
AB80-ABA0:KayahLi
AC00-D7AF:朝鲜文音节(HangulSyllables)
D800-DBFF:High-halfzoneofUTF-16
DC00-DFFF:Low-halfzoneofUTF-16
E000-F8FF:自行使用区域(PrivateUseZone)
F900-FAFF:CJK兼容象形文字(CJKCompatibilityIdeographs)
FB00-FB4F:字母表达形式(AlphabeticPresentationForm)
FB50-FDFF:阿拉伯表达形式A(ArabicPresentationForm-A)
FE00-FE0F:变量选择符(VariationSelector)
FE10-FE1F:竖排形式(VerticalForms)
FE20-FE2F:组合用半符号(CombiningHalfMarks)
FE30-FE4F:CJK兼容形式(CJKCompatibilityForms)
FE50-FE6F:小型变体形式(SmallFormVariants)
FE70-FEFF:阿拉伯表达形式B(ArabicPresentationForm-B)
FF00-FFEF:半型及全型形式(HalfwidthandFullwidthForm)
FFF0-FFFF:特殊(Specials)
10300..1032F;OldItalic
10330..1034F;Gothic
10400..1044F;Deseret
1D000..1D0FF;ByzantineMusicalSymbols
1D100..1D1FF;MusicalSymbols
1D400..1D7FF;MathematicalAlphanumericSymbols
20000..2A6D6;CJKUnifiedIdeographsExtensionB
2F800..2FA1F;CJKCompatibilityIdeographsSupplement
E0000..E007F;Tags
F0000..FFFFD;PrivateUse
100000..10FFFD;PrivateUse