Our website is made possible by displaying online advertisements to our visitors.
Please consider supporting us by disabling your ad blocker.

Responsive image


GB 18030

GB 18030
GB 18030 encoding layout. "Half codes" indicates codes used in pairs as four-byte codes.
MIME / IANAGB18030
Alias(es)Code page 54936
Language(s)International, but primarily meant for Chinese
StandardGB 18030-2022, GB 18030-2005, GB 18030-2000
ClassificationUnicode Transformation Format, extended ASCII,[a] variable-width encoding, CJK encoding
ExtendsEUC-CN, GBK
Transforms / EncodesISO 10646 (Unicode)
Preceded byGBK, GB2312
  1. ^ Not in the strictest sense of the term, as ASCII bytes can appear as trail bytes.

GB 18030 is a Chinese government standard, described as Information Technology — Chinese coded character set and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312.[1] As a Unicode Transformation Format[a] (i.e. an encoding of all Unicode code points), GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB/T 2312, CP936,[b] and GBK 1.0.

The Unicode Consortium has warned implementers that the latest version of this Chinese standard, GB 18030-2022, introduces what they describe as "disruptive changes" from the previous version GB 18030-2005 "involving 33 different characters and 55 code positions".[2] GB 18030-2022 was enforced from 1 August 2023.[3] It has been implemented in ICU 73.2; and in Java 21,[4] and backported to older Java 8, 11, 17 (LTS releases) and 20.0.2.[5]

In addition to the encoding method, this standard contains requirements about which additional scripts and languages should be represented, and to whom this standard is applicable.[6] This standard however does not define the official character forms for the Chinese characters; this is standardised in List of Commonly Used Standard Chinese Characters.

  1. ^ Anthony Fok (2002-03-15). "Application of IANA Charset Registration for GB18030". IANA Character Set Registrations. Retrieved 2016-12-05.
  2. ^ "Disruptive Changes in GB 18030-2022" (PDF). www.unicode.org. Retrieved 2024-02-12.
  3. ^ "[JDK-8301119] Support for GB18030-2022 - Java Bug System". bugs.openjdk.org. Retrieved 2023-08-14.
  4. ^ "JDK 21 Release Notes". jdk.java.net. Retrieved 2023-08-14.
  5. ^ "[JDK-8307340] Release Note: Support for GB18030-2022 - Java Bug System". bugs.openjdk.org. Retrieved 2023-08-30.
  6. ^ Cite error: The named reference CESI-FAQ was invoked but never defined (see the help page).


Cite error: There are <ref group=lower-alpha> tags or {{efn}} templates on this page, but the references will not show without a {{reflist|group=lower-alpha}} template or {{notelist}} template (see the help page).


Previous Page Next Page






GB 18030 Czech GB 18030 German GB 18030 Spanish GB 18030 French GB 18030 Japanese GB18030 Ukrainian GB 18030 Chinese GB18030 ZH-YUE

Responsive image

Responsive image