5645
INFORMATIONAL
Update to the Language Subtag Registry
Authors: D. Ewell
Date: September 2009
Area: app
Working Group: ltru
Stream: IETF
Abstract
This memo defines the procedure used to update the IANA Language Subtag Registry, in conjunction with the publication of RFC 5646, for use in forming tags for identifying languages. This memo provides information for the Internet community.
RFC 5645
INFORMATIONAL
Network Working Group D. Ewell, Ed.
Request for Comments: 5645 Consultant
Category: Informational September 2009
<span class="h1">Update to the Language Subtag Registry</span>
Abstract
This memo defines the procedure used to update the IANA Language
Subtag Registry, in conjunction with the publication of <a href="./rfc5646">RFC 5646</a>, for
use in forming tags for identifying languages.
Status of This Memo
This memo provides information for the Internet community. It does
not specify an Internet standard of any kind. Distribution of this
memo is unlimited.
Copyright Notice
Copyright (c) 2009 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a> and the IETF Trust's Legal
Provisions Relating to IETF Documents in effect on the date of
publication of this document (<a href="http://trustee.ietf.org/license-info">http://trustee.ietf.org/license-info</a>).
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document.
Table of Contents
<a href="#section-1">1</a>. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . <a href="#page-2">2</a>
<a href="#section-2">2</a>. Updating the Registry . . . . . . . . . . . . . . . . . . . . <a href="#page-2">2</a>
<a href="#section-2.1">2.1</a>. Starting Point . . . . . . . . . . . . . . . . . . . . . . <a href="#page-2">2</a>
<a href="#section-2.2">2.2</a>. New Language Subtags . . . . . . . . . . . . . . . . . . . <a href="#page-4">4</a>
<a href="#section-2.3">2.3</a>. Modified Language Subtags . . . . . . . . . . . . . . . . <a href="#page-5">5</a>
<a href="#section-2.4">2.4</a>. New Region Subtags . . . . . . . . . . . . . . . . . . . . <a href="#page-6">6</a>
<a href="#section-2.5">2.5</a>. Grandfathered and Redundant Tags . . . . . . . . . . . . . <a href="#page-6">6</a>
<a href="#section-2.6">2.6</a>. Preferred-Value Changes . . . . . . . . . . . . . . . . . <a href="#page-9">9</a>
<a href="#section-2.7">2.7</a>. Additional Changes . . . . . . . . . . . . . . . . . . . . <a href="#page-9">9</a>
<a href="#section-3">3</a>. Updated Registry Contents . . . . . . . . . . . . . . . . . . <a href="#page-10">10</a>
<a href="#section-4">4</a>. Security Considerations . . . . . . . . . . . . . . . . . . . <a href="#page-10">10</a>
<a href="#section-5">5</a>. IANA Considerations . . . . . . . . . . . . . . . . . . . . . <a href="#page-11">11</a>
<a href="#section-6">6</a>. References . . . . . . . . . . . . . . . . . . . . . . . . . . <a href="#page-11">11</a>
<a href="#section-6.1">6.1</a>. Normative References . . . . . . . . . . . . . . . . . . . <a href="#page-11">11</a>
<a href="#section-6.2">6.2</a>. Informative References . . . . . . . . . . . . . . . . . . <a href="#page-12">12</a>
<a href="#appendix-A">Appendix A</a>. Acknowledgements . . . . . . . . . . . . . . . . . . <a href="#page-13">13</a>
<span class="grey">Ewell Informational [Page 1]</span>
<span id="page-2" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
<span class="h2"><a class="selflink" id="section-1" href="#section-1">1</a>. Introduction</span>
[<a id="ref-RFC4646">RFC4646</a>] provides for a Language Subtag Registry and describes its
format. The initial contents of the registry and rules for
determining them are specified in [<a href="./rfc4645" title=""Initial Language Subtag Registry"">RFC4645</a>].
[<a id="ref-RFC5646">RFC5646</a>] expands on [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] by adding support for approximately
7,500 new primary and extended language subtags based on [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>]
and [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>] alpha-3 code elements, and seven new region subtags
based on [<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>] exceptionally reserved code elements. This memo
describes the process of updating the registry to include these
additional subtags and to make secondary changes to the registry that
result from adding the new subtags and from other decisions made by
the Language Tag Registry Update (LTRU) Working Group.
In writing this document, a complete replacement of the contents of
the Language Subtag Registry was provided to the Internet Assigned
Numbers Authority (IANA) to record the necessary updates.
The format of the Language Subtag Registry as well as the definition
and intended purpose of each of the fields are described in
[<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>].
The registry is expected to change over time, as new subtags are
registered and existing subtags are modified or deprecated. The
process of updating the registry is described in <a href="./rfc5646#section-3">Section 3 of
[RFC5646]</a>.
Many of the subtags defined in the Language Subtag Registry are based
on code elements defined in [<a href="#ref-ISO639-1" title=""ISO 639- 1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code"">ISO639-1</a>], [<a href="#ref-ISO639-2" title=""ISO 639- 2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code"">ISO639-2</a>], [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>],
[<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>], [<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>], [<a href="#ref-ISO15924" title=""ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts"">ISO15924</a>], and [<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>]. The registry is
not a mirror of the code lists defined by these standards and should
not be used as one.
<span class="h2"><a class="selflink" id="section-2" href="#section-2">2</a>. Updating the Registry</span>
This section describes the process for determining the updated
contents of the Language Subtag Registry.
<span class="h3"><a class="selflink" id="section-2.1" href="#section-2.1">2.1</a>. Starting Point</span>
The version of the Language Subtag Registry that was current at the
time of IESG approval of this memo served as the starting point for
this update. This version was created according to the process
described in [<a href="./rfc4645" title=""Initial Language Subtag Registry"">RFC4645</a>] and maintained according to the process
described in [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>].
<span class="grey">Ewell Informational [Page 2]</span>
<span id="page-3" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
The source data for [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] used for this update consisted of
three files, available from the official site of the ISO 639-3
Registration Authority. (Note that this file is updated from time to
time. The version used in the preparation of this memo was the one
in place on February 24, 2009.)
o [<a href="#ref-iso-639-3_20090210">iso-639-3_20090210</a>] is a list of all language code elements in
[<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>], including the alpha-3 code element and reference name
for each code element. For example, the entry for the Dari
language contained the code element 'prs' and the name "Dari"
(among other information).
o [<a href="#ref-iso-639-3_Name_Index_20090210">iso-639-3_Name_Index_20090210</a>] is a list containing all names
associated with each language according to [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>], including
both inverted and non-inverted forms where appropriate. An
"inverted" name is one that is altered from the usual English-
language order by moving adjectival qualifiers to the end, after
the main language name and separated by a comma. A code element
may have more than one entry in this file; the reference name and
its inverted form are usually, but not always, given in the first
entry. For example, this file contained an entry for the code
element 'prs' with the name "Dari" (twice) and another entry with
the names "Eastern Farsi" and "Farsi, Eastern".
o [<a href="#ref-iso-639-3-macrolanguages_20090120">iso-639-3-macrolanguages_20090120</a>] is a list of all alpha-3 code
elements for languages that are encompassed by a macrolanguage in
[<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>], together with the alpha-3 code element for the
macrolanguage. For example, a line containing the code elements
'fas' and 'prs' indicated that the macrolanguage "Persian"
encompasses the individual language "Dari". (Note that these
alpha-3 code elements may not have corresponded directly to
subtags in the registry, which uses 2-letter subtags derived from
[<a href="#ref-ISO639-1" title=""ISO 639- 1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code"">ISO639-1</a>] when possible.)
The source data for [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>] used for this update consisted of one
file, available from the official site of the ISO 639-5 Registration
Authority. (Note that this file is updated from time to time. The
version used in the preparation of this memo was the one in place on
February 24, 2009.)
o [<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>] is a list of all language code elements in
[<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>], including the alpha-3 code elements and English name
for each code element. For example, this file includes an entry
containing the code element 'ira' and the name "Iranian languages"
(among other information).
<span class="grey">Ewell Informational [Page 3]</span>
<span id="page-4" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
Language code elements that were already retired in all of the source
standards prior to IESG approval of this memo were not listed in
these files and, consequently, were not considered in this update.
The values of the File-Date field, the Added date for each new subtag
record, and the Deprecated date for each existing grandfathered or
redundant tag deprecated by this update were set to a date as near as
practical to the date this memo was approved for publication by IESG.
<span class="h3"><a class="selflink" id="section-2.2" href="#section-2.2">2.2</a>. New Language Subtags</span>
For each language in [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] that was not already represented by a
language subtag in the Language Subtag Registry, a new language
subtag was added to the registry, using the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] code element
as the value for the Subtag field and using each of the non-inverted
[<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] names as a separate Description field. The [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>]
reference name is represented by the first Description field.
If the language was encompassed by one of the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>]
macrolanguages 'ar' (Arabic), 'kok' (Konkani), 'ms' (Malay), 'sw'
(Swahili), 'uz' (Uzbek), or 'zh' (Chinese), as determined by
[<a href="#ref-iso-639-3-macrolanguages_20090120">iso-639-3-macrolanguages_20090120</a>], an extended language subtag was
also added, with the primary language subtag of the macrolanguage as
the value for the Prefix field. These macrolanguage subtags were
already present in the Language Subtag Registry and were chosen
because they were determined by the LTRU Working Group to have been
used to represent a single dominant language as well as the
macrolanguage as a whole, making the extended language mechanism
suitable for languages encompassed by the macrolanguage.
If the name of the language included the word "Sign", an extended
language subtag was added, with the string "sgn" as the value for the
Prefix field. This is a special case that treats the existing
primary language subtag for "Sign languages" as if it were a
macrolanguage encompassing all sign languages.
All extended language subtags were added with a Preferred-Value equal
to the corresponding primary language subtag.
If the language was encompassed by a macrolanguage, as determined by
[<a href="#ref-iso-639-3-macrolanguages_20090120">iso-639-3-macrolanguages_20090120</a>], a Macrolanguage field was added
for the encompassed language, with a value equal to the subtag of the
macrolanguage. (Note that 'sgn' is defined as a "collection code" by
[<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] and hence is not included in that standard; therefore, no
Macrolanguage field was added for sign language subtags.)
If the language was assigned a "Scope" value of 'M' (Macrolanguage)
in [<a href="#ref-iso-639-3_20090210">iso-639-3_20090210</a>], a Scope value of "macrolanguage" was added
<span class="grey">Ewell Informational [Page 4]</span>
<span id="page-5" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
for the language. Otherwise, if the language was assigned a "Scope"
value of 'S' (Special), a Scope value of "special" was added. Most
languages in [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] have scope 'I' (Individual) and thus were not
assigned a Scope value in the registry.
For each language in [<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>] that was not already
represented by a language subtag in the Language Subtag Registry, a
new language subtag was added to the registry, using the [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>]
code element as the value for the Subtag field and using the "English
name" field as the Description field. Each of these languages was
assigned a Scope value of "collection" in the registry.
All subtags were added to the registry maintaining alphabetical order
within each type of subtag: all 2-letter "language" subtags first,
then all 3-letter "language" subtags, and finally all "extlang"
subtags. Some existing records were moved to ensure this order.
<span class="h3"><a class="selflink" id="section-2.3" href="#section-2.3">2.3</a>. Modified Language Subtags</span>
For each language in [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] that was already represented by a
language subtag in the Language Subtag Registry, Description fields
were added as necessary to reflect all non-inverted names listed for
that language in [<a href="#ref-iso-639-3_Name_Index_20090210">iso-639-3_Name_Index_20090210</a>]. Any existing
Description fields that reflected inverted names or that represented
a strict subset of the information provided by the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] name
were deleted. An example of the latter was the name "Ainu" for the
subtag 'ain', which provided less information than the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>]
name "Ainu (Japan)".
The order of Description fields was adjusted to ensure that the
reference name from [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] was listed first, followed by other
names from [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] in the order presented by that standard,
followed by any other names already existing in the registry. In
some cases, this resulted in a reordering of Description fields for
existing entries, even when no new values were added.
For each language that was encompassed by a macrolanguage in
[<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>], a Macrolanguage field was added, with a value equal to
the subtag of the macrolanguage.
For each language in [<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>] that was already represented
in the Language Subtag Registry, the Description field was adjusted
as necessary to match the "English name" field in [<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>].
Names in inverted form were rearranged to remove the inversion. Each
of these languages was assigned a Scope value of "collection".
Existing language subtags whose code elements were assigned prior to
the publication of [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] or [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>] and that were identified
by the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] Registration Authority as representing collections
<span class="grey">Ewell Informational [Page 5]</span>
<span id="page-6" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
were also assigned a Scope value of "collection", even though they
are not listed as such in [<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>].
Note in particular that the change from [<a href="#ref-ISO639-2" title=""ISO 639- 2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code"">ISO639-2</a>] names such as
"Afro-Asiatic (Other)" to [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>] names such as "Afro-Asiatic
languages" implies a broadening of scope for some of these subtags,
designated "remainder groups" in [<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>]. While
[<a href="#ref-iso639-5.tab.txt">iso639-5.tab.txt</a>] includes a field indicating which code elements
are designated as "groups" or "remainder groups" in [<a href="#ref-ISO639-2" title=""ISO 639- 2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code"">ISO639-2</a>],
[<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>] does not make this distinction, and consequently this field
was not used in updating the Language Subtag Registry.
A Scope value of "private-use" was added for the unique record with
Subtag value 'qaa..qtz'. This record has a Description of "Private
use" (changed from "PRIVATE USE") and corresponds to a range of code
elements that is reserved for private use in [<a href="#ref-ISO639-2" title=""ISO 639- 2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code"">ISO639-2</a>]. The
Description fields for script and region private-use subtags were
also capitalized as "Private use".
<span class="h3"><a class="selflink" id="section-2.4" href="#section-2.4">2.4</a>. New Region Subtags</span>
[<a id="ref-RFC5646">RFC5646</a>] expands the scope of region subtags by adding subtags based
on code elements defined as "exceptionally reserved" in [<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>].
These code elements are reserved by the ISO 3166 Maintenance Agency
"at the request of national ISO member bodies, governments and
international organizations". At the time of IESG approval of this
memo, ISO 3166/MA had defined nine exceptionally reserved code
elements, all of which were added to the Language Subtag Registry
except for the following:
o 'FX' (Metropolitan France) was already present in the Language
Subtag Registry because it was an assigned [<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>] code
element from 1993 to 1997, but was deprecated with a Preferred-
Value of "FR".
o 'UK' (United Kingdom) was not added because it is associated with
the same UN M.49 code (826) as the existing region subtag 'GB'.
<a href="./rfc5646#section-3.4">[RFC5646], Section 3.4</a>, item 15 (D) states that a new region
subtag is not added to the Language Subtag Registry if it carries
the same meaning as an existing region subtag.
<span class="h3"><a class="selflink" id="section-2.5" href="#section-2.5">2.5</a>. Grandfathered and Redundant Tags</span>
As stated in [<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>], "grandfathered" and "redundant" tags are
complete tags in the Language Subtag Registry that were registered
under [<a href="./rfc1766" title=""Tags for the Identification of Languages"">RFC1766</a>] or [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] and remain valid. Grandfathered tags
cannot be generated from a valid combination of subtags, while
redundant tags can be.
<span class="grey">Ewell Informational [Page 6]</span>
<span id="page-7" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
Under certain conditions, registration of a subtag under [<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>]
may cause a grandfathered tag to be reclassified as redundant. It
may also enable the creation of a generative tag with the same
meaning as a grandfathered or redundant tag; in that case, the
grandfathered or redundant tag is marked as Deprecated, and the
generative tag (including the new subtag) becomes its Preferred-
Value.
As a result of adding the new subtags in this update, each of the
following grandfathered tags became composable, were reclassified as
redundant, and were deprecated with the indicated generative tag
serving as the Preferred-Value:
zh-cmn (Preferred-Value: cmn)
zh-cmn-Hans (Preferred-Value: cmn-Hans)
zh-cmn-Hant (Preferred-Value: cmn-Hant)
zh-gan (Preferred-Value: gan)
zh-wuu (Preferred-Value: wuu)
zh-yue (Preferred-Value: yue)
The following grandfathered tags were deprecated, with the indicated
generative tag serving as the Preferred-Value:
i-ami (Preferred-Value: ami)
i-bnn (Preferred-Value: bnn)
i-pwn (Preferred-Value: pwn)
i-tao (Preferred-Value: tao)
i-tay (Preferred-Value: tay)
i-tsu (Preferred-Value: tsu)
zh-hakka (Preferred-Value: hak)
zh-min (no Preferred-Value; see below)
zh-min-nan (Preferred-Value: nan)
zh-xiang (Preferred-Value: hns)
<span class="grey">Ewell Informational [Page 7]</span>
<span id="page-8" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
The tag "zh-min", originally registered under [<a href="./rfc1766" title=""Tags for the Identification of Languages"">RFC1766</a>], is a special
case: it represents a small class of Chinese languages, but is not a
true macrolanguage. The string "min" could not ever be used to tag
these languages since the [<a href="#ref-ISO639-3" title=""ISO 639- 3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages"">ISO639-3</a>] code element 'min' is assigned
to an individual language (Minangkabau) that is not related to
Chinese ('zh'). Because it is not believed to represent a useful
linguistic entity for tagging purposes, it was deprecated without a
Preferred-Value.
The following grandfathered and redundant sign language tags were
deprecated, with the indicated generative tag serving as the
Preferred-Value:
sgn-BE-FR (Preferred-Value: sfb)
sgn-BE-NL (Preferred-Value: vgt)
sgn-BR (Preferred-Value: bzs)
sgn-CH-DE (Preferred-Value: sgg)
sgn-CO (Preferred-Value: csn)
sgn-DE (Preferred-Value: gsg)
sgn-DK (Preferred-Value: dsl)
sgn-ES (Preferred-Value: ssp)
sgn-FR (Preferred-Value: fsl)
sgn-GB (Preferred-Value: bfi)
sgn-GR (Preferred-Value: gss)
sgn-IE (Preferred-Value: isg)
sgn-IT (Preferred-Value: ise)
sgn-JP (Preferred-Value: jsl)
sgn-MX (Preferred-Value: mfs)
sgn-NI (Preferred-Value: ncs)
sgn-NL (Preferred-Value: dse)
sgn-NO (Preferred-Value: nsl)
<span class="grey">Ewell Informational [Page 8]</span>
<span id="page-9" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
sgn-PT (Preferred-Value: psr)
sgn-SE (Preferred-Value: swl)
sgn-US (Preferred-Value: ase)
sgn-ZA (Preferred-Value: sfs)
No change was made to the Description field(s) for any of the
grandfathered or redundant tags. For example, the redundant tag
"sgn-US" continues to carry the Description "American Sign Language".
The sign language tags registered prior to [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] remain an
exception to the general principle that the meaning of a non-
grandfathered tag can be derived from its component subtags.
In previous versions of the registry, grandfathered tags that had
been deprecated as a result of adding an ISO 639-based language
subtag included a Comments field, with a value of the form "replaced
by ISO code xxx", where 'xxx' represented the new language subtag.
These comments duplicated the information contained within the
Preferred-Value field and were deleted as part of this update. No
changes were made to other Comments fields.
<span class="h3"><a class="selflink" id="section-2.6" href="#section-2.6">2.6</a>. Preferred-Value Changes</span>
<a href="./rfc5646#section-3.1.7">[RFC5646], Section 3.1.7</a> provides for the value of Preferred-Value
fields to be updated as necessary to reflect changes in one of the
source standards. Accordingly, the Preferred-Value fields for the
following deprecated tags were changed:
i-hak (changed from zh-hakka to hak)
zh-guoyu (changed from zh-cmn to cmn)
This makes it unnecessary for consumers of the Language Subtag
Registry to follow a "chain" of Preferred-Values in order to arrive
at a non-deprecated subtag.
<span class="h3"><a class="selflink" id="section-2.7" href="#section-2.7">2.7</a>. Additional Changes</span>
For consistency with the handling of alternative names in language
subtags, Description fields for script subtags taken from [<a href="#ref-ISO15924" title=""ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts"">ISO15924</a>]
that represent alternative names were converted to multiple
Description fields. For example, the Description "Han (Hanzi, Kanji,
Hanja)" was converted to four separate Description fields. Some
Description fields for script subtags contained parenthetical
material that was explanatory, rather than identifying alternative
names; these fields were not altered.
<span class="grey">Ewell Informational [Page 9]</span>
<span id="page-10" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
This situation does not apply to region subtags taken from
[<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>] and [<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>] because those standards do not provide
freely available alternative names for code elements.
Description fields in inverted form for script and region subtags
were rearranged to remove the inversion, for consistency with the
handling of language subtags in Sections <a href="#section-2.2">2.2</a> and <a href="#section-2.3">2.3</a>. For example,
the Description field "Korea, Republic of" was changed to "Republic
of Korea".
The capitalization of the Subtag fields for certain grandfathered and
redundant tags (sgn-BE-fr, sgn-BE-nl, sgn-CH-de, and yi-latn) was
modified to conform with the capitalization conventions described in
<a href="./rfc5646#section-2.1.1">[RFC5646], Section 2.1.1</a>. This has no effect on the validity or
meaning of these tags.
The Description field for subtag 'sgn' was capitalized as "Sign
languages" to match the capitalization used for other languages in
[<a href="#ref-ISO639-5" title=""ISO 639- 5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups"">ISO639-5</a>], even though this capitalization does not exactly match
that used for code element 'sgn' in any of the ISO 639 parts.
The Deprecated field for the region subtag TP was modified from 2002-
11-15 to 2002-05-20, to correct a clerical error. The corrected date
reflects the actual date the code element TP was withdrawn in
[<a href="#ref-ISO3166-1" title=""ISO 3166- 1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes"">ISO3166-1</a>].
The order of fields within records in the registry was adjusted as
necessary to match the order in which these fields are described in
<a href="./rfc5646#section-3.1.2">[RFC5646], Section 3.1.2</a>. This ordering is not required by [<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>]
and may not necessarily be reflected in future additions or
modifications to the registry.
<span class="h2"><a class="selflink" id="section-3" href="#section-3">3</a>. Updated Registry Contents</span>
IANA has updated the Language Subtag Registry according to the
provided replacement contents. The replacement content was listed in
the working draft of this document, but was deleted prior to
publication as an RFC to avoid potential confusion with the registry
itself. The Language Subtag Registry is available from the IANA
website, <<a href="http://www.iana.org">http://www.iana.org</a>>.
<span class="h2"><a class="selflink" id="section-4" href="#section-4">4</a>. Security Considerations</span>
For security considerations relevant to the Language Subtag Registry
and the use of language tags, see [<a href="./rfc5646" title=""Tags for Identifying Languages"">RFC5646</a>].
<span class="grey">Ewell Informational [Page 10]</span>
<span id="page-11" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
<span class="h2"><a class="selflink" id="section-5" href="#section-5">5</a>. IANA Considerations</span>
IANA has updated the Language Subtag Registry, which can be found via
<<a href="http://www.iana.org">http://www.iana.org</a>>. For details on the procedures for the format
and ongoing maintenance of this registry, see <a href="./rfc5646">RFC 5646</a>.
<span class="h2"><a class="selflink" id="section-6" href="#section-6">6</a>. References</span>
<span class="h3"><a class="selflink" id="section-6.1" href="#section-6.1">6.1</a>. Normative References</span>
[<a id="ref-ISO639-3">ISO639-3</a>] International Organization for Standardization, "ISO 639-
3:2007. Codes for the representation of names of
languages - Part 3: Alpha-3 code for comprehensive
coverage of languages", February 2007.
[<a id="ref-ISO639-5">ISO639-5</a>] International Organization for Standardization, "ISO 639-
5:2008. Codes for the representation of names of
languages -- Part 5: Alpha-3 code for language families
and groups", May 2008.
[<a id="ref-RFC5646">RFC5646</a>] Phillips, A., Ed. and M. Davis, Ed., "Tags for
Identifying Languages", <a href="./rfc5646">RFC 5646</a>, September 2009.
[<a id="ref-iso-639-3-macrolanguages_20090120">iso-639-3-macrolanguages_20090120</a>]
International Organization for Standardization, "ISO
639-3 Macrolanguage Mappings", January 2009, <<a href="http://www.sil.org/iso639-3/iso-639-3-macrolanguages_20090120.tab">http://</a>
<a href="http://www.sil.org/iso639-3/iso-639-3-macrolanguages_20090120.tab">www.sil.org/iso639-3/</a>
<a href="http://www.sil.org/iso639-3/iso-639-3-macrolanguages_20090120.tab">iso-639-3-macrolanguages_20090120.tab</a>>.
[<a id="ref-iso-639-3_20090210">iso-639-3_20090210</a>]
International Organization for Standardization, "ISO
639-3 Code Set", February 2009,
<<a href="http://www.sil.org/iso639-3/iso-639-3_20090210.tab">http://www.sil.org/iso639-3/iso-639-3_20090210.tab</a>>.
[<a id="ref-iso-639-3_Name_Index_20090210">iso-639-3_Name_Index_20090210</a>]
International Organization for Standardization, "ISO
639-3 Language Names Index", February 2009,
<<a href="http://www.sil.org/iso639-3/iso-639-3_Name_Index_20090210.tab">http://www.sil.org/</a>
<a href="http://www.sil.org/iso639-3/iso-639-3_Name_Index_20090210.tab">iso639-3/iso-639-3_Name_Index_20090210.tab</a>>.
[<a id="ref-iso639-5.tab.txt">iso639-5.tab.txt</a>]
International Organization for Standardization, "ISO
639-5 code list, Tab-delimited text", February 2009,
<<a href="http://www.loc.gov/standards/iso639-5/iso639-5.tab.txt">http://www.loc.gov/standards/iso639-5/iso639-5.tab.txt</a>>.
<span class="grey">Ewell Informational [Page 11]</span>
<span id="page-12" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
<span class="h3"><a class="selflink" id="section-6.2" href="#section-6.2">6.2</a>. Informative References</span>
[<a id="ref-ISO15924">ISO15924</a>] International Organization for Standardization, "ISO
15924:2004. Information and documentation -- Codes for
the representation of names of scripts", January 2004.
[<a id="ref-ISO3166-1">ISO3166-1</a>] International Organization for Standardization, "ISO
3166- 1:2006. Codes for the representation of names of
countries and their subdivisions -- Part 1: Country
codes", November 2006.
[<a id="ref-ISO639-1">ISO639-1</a>] International Organization for Standardization, "ISO 639-
1:2002. Codes for the representation of names of
languages -- Part 1: Alpha-2 code", July 2002.
[<a id="ref-ISO639-2">ISO639-2</a>] International Organization for Standardization, "ISO 639-
2:1998. Codes for the representation of names of
languages -- Part 2: Alpha-3 code", October 1998.
[<a id="ref-RFC1766">RFC1766</a>] Alvestrand, H., "Tags for the Identification of
Languages", <a href="./rfc1766">RFC 1766</a>, March 1995.
[<a id="ref-RFC3066">RFC3066</a>] Alvestrand, H., "Tags for the Identification of
Languages", <a href="./rfc3066">RFC 3066</a>, January 2001.
[<a id="ref-RFC4645">RFC4645</a>] Ewell, D., "Initial Language Subtag Registry", <a href="./rfc4645">RFC 4645</a>,
September 2006.
[<a id="ref-RFC4646">RFC4646</a>] Phillips, A. and M. Davis, "Tags for Identifying
Languages", <a href="https://www.rfc-editor.org/bcp/bcp47">BCP 47</a>, <a href="./rfc4646">RFC 4646</a>, September 2006.
[<a id="ref-UN_M.49">UN_M.49</a>] Statistics Division, United Nations, "Standard Country or
Area Codes for Statistical Use", Revision 4 (United
Nations publication, Sales No. 98.XVII.9, June 1999.
<span class="grey">Ewell Informational [Page 12]</span>
<span id="page-13" ></span>
<span class="grey"><a href="./rfc5645">RFC 5645</a> Update to the Language Subtag Registry September 2009</span>
<span class="h2"><a class="selflink" id="appendix-A" href="#appendix-A">Appendix A</a>. Acknowledgements</span>
This memo is a collaborative work of the Language Tag Registry Update
(LTRU) Working Group. All of its members have made significant
contributions to this memo and to its predecessor, [<a href="./rfc4645" title=""Initial Language Subtag Registry"">RFC4645</a>].
Specific contributions to this memo were made by Stephane Bortzmeyer,
John Cowan, Mark Davis, Martin Duerst, Frank Ellermann, Debbie
Garside, Kent Karlsson, Gerard Lang, Addison Phillips, Randy Presuhn,
and CE Whitehead.
Author's Address
Doug Ewell (editor)
Consultant
EMail: [email protected]
URI: <a href="http://www.ewellic.org">http://www.ewellic.org</a>
Ewell Informational [Page 13]
Annotations
Select text to annotate