4645
INFORMATIONAL
Initial Language Subtag Registry
Authors: D. Ewell
Date: September 2006
Area: app
Working Group: ltru
Stream: IETF
Abstract
This memo defined the initial contents of the IANA Language Subtag Registry for use in forming tags for the identification of languages. Since the contents of this memo only served as a starting point for the registry, its actual contents have been removed before publication to avoid confusion. This memo provides information for the Internet community.
RFC 4645
INFORMATIONAL
Errata Exist
Network Working Group D. Ewell, Ed.
Request for Comments: 4645 Consultant
Category: Informational September 2006
<span class="h1">Initial Language Subtag Registry</span>
Status of This Memo
This memo provides information for the Internet community. It does
not specify an Internet standard of any kind. Distribution of this
memo is unlimited.
Copyright Notice
Copyright (C) The Internet Society (2006).
Abstract
This memo defined the initial contents of the IANA Language Subtag
Registry for use in forming tags for the identification of languages.
Since the contents of this memo only served as a starting point for
the registry, its actual contents have been removed before
publication to avoid confusion.
Table of Contents
<a href="#section-1">1</a>. Introduction ....................................................<a href="#page-2">2</a>
<a href="#section-2">2</a>. Initialization of the Registry ..................................<a href="#page-2">2</a>
<a href="#section-3">3</a>. Initial Registry Contents .......................................<a href="#page-5">5</a>
<a href="#section-4">4</a>. Omitted Code Elements ...........................................<a href="#page-5">5</a>
<a href="#section-5">5</a>. Security Considerations .........................................<a href="#page-5">5</a>
<a href="#section-6">6</a>. IANA Considerations .............................................<a href="#page-5">5</a>
<a href="#section-7">7</a>. References ......................................................<a href="#page-6">6</a>
<a href="#section-7.1">7.1</a>. Normative References .......................................<a href="#page-6">6</a>
<a href="#section-7.2">7.2</a>. Informative References .....................................<a href="#page-6">6</a>
<span class="grey">Ewell Informational [Page 1]</span>
<span id="page-2" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
<span class="h2"><a class="selflink" id="section-1" href="#section-1">1</a>. Introduction</span>
[<a id="ref-RFC4646">RFC4646</a>] provides for a Language Subtag Registry and describes its
format. This memo defines the initial contents of the IANA Language
Subtag Registry, using the criteria described in <a href="#section-2">Section 2</a>.
The Language Subtag Registry is formatted in a modified record-jar
text format, as described in [<a href="#ref-record-jar" title=""The Art of Unix Programming"">record-jar</a>]. The specific format of
the registry, and the definition and intended purpose of each of the
fields, are described in [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>].
The registry is expected to change over time, as new subtags are
registered and existing subtags are modified or deprecated. The
process of updating the registry is described in <a href="./rfc4646#section-3">Section 3 of
[RFC4646]</a>. This memo does not define the permanent contents of the
registry and should not be represented as doing so.
Many of the subtags defined in this registry are based on code
elements defined in [<a href="#ref-ISO639-1" title=""ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code"">ISO639-1</a>], [<a href="#ref-ISO639-2" title=""ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition"">ISO639-2</a>], [<a href="#ref-ISO15924" title=""ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts"">ISO15924</a>], [<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>],
and [<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>]. This registry is not a mirror of the code lists
defined by these standards and should not be used as one.
<span class="h2"><a class="selflink" id="section-2" href="#section-2">2</a>. Initialization of the Registry</span>
<a href="./rfc4646#section-3.7">Section 3.7 of [RFC4646]</a> requires that the LTRU working group create
an initial version of the Language Subtag Registry and populate it
with the initial set of subtags. This involves converting the
entries from the existing IANA language tag registry defined by
[<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] to the new format, as well as defining valid subtags from
various source standards. This section describes the process that
was used to create the initial registry entries.
The initial set of records was based on the following standards:
[<a href="#ref-ISO639-1" title=""ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code"">ISO639-1</a>], [<a href="#ref-ISO639-2" title=""ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition"">ISO639-2</a>], [<a href="#ref-ISO15924" title=""ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts"">ISO15924</a>], and [<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>]. The following
criteria were used to select and format the records of the subtags
included in the initial Language Subtag Registry (hereafter "ILSR"):
1. For each source standard, the date of the standard referenced
in [<a href="./rfc1766" title=""Tags for the Identification of Languages"">RFC1766</a>] was selected as the starting date. Code elements
that were valid on that date in the selected standard were
added to the ILSR. Code elements that were previously
assigned, but that were vacated or withdrawn before that date,
were not added to the ILSR.
2. For each successive change to the standard, any additional
assignments up to the date of the adoption of [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] were
added to the ILSR. Values that have been withdrawn are marked
as deprecated, but not removed. Changes in meaning or
<span class="grey">Ewell Informational [Page 2]</span>
<span id="page-3" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
assignment of a subtag were permitted during this process (for
example, the [<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>] code element 'CS' was originally
assigned to Czechoslovakia and is now assigned to Serbia and
Montenegro).
Code elements from [<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>] were also included in the ILSR using the
criteria above, with the following additional rules:
3. UN numeric code elements assigned to "macro-geographical
(continental)" regions as of the date of adoption of [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>]
were added to the ILSR and thereby made valid for use in
language tags.
4. The UN numeric code elements for "economic groupings" or
"other groupings," and the alphanumeric code elements in
<a href="#appendix-X">Appendix X</a> of the UN document, were not added to the ILSR.
5. The UN numeric code elements for countries or areas not
associated with an assigned [<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>] alpha-2 code element
were not added to the ILSR. These values are listed in
<a href="#section-4">Section 4</a> and may be requested for registration by individuals
using the process defined in [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] and according to the
rules described therein. Listing of these code elements in
this section is not a guarantee of future registration.
6. Code elements that were withdrawn, vacated, or deprecated from
[<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>] as of the date of adoption of [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] were not
added to the ILSR.
Using the initial set of subtags described above, the tags in the
[<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registry were evaluated as follows:
7. Tags in the [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registry that were not deprecated,
consisted entirely of subtags already in this document, and
have the correct form and format for tags defined by [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>]
were converted to records of type "redundant" in the ILSR.
For example, "zh-Hant" is now defined by [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] because
'zh' is an [<a href="#ref-ISO639-1" title=""ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code"">ISO639-1</a>] code element and 'Hant' is an [<a href="#ref-ISO15924" title=""ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts"">ISO15924</a>]
code element, and both are defined as subtags in the ILSR.
8. Tags in the [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registry that contained one or more
subtags that either did not match the valid registration
pattern or were not otherwise defined by [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] were
converted to corresponding records of type "grandfathered" in
the ILSR. These records cannot become type "redundant" except
by revision of [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>], but may have a "Deprecated" and
<span class="grey">Ewell Informational [Page 3]</span>
<span id="page-4" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
"Preferred-Value" field added to them if a subsequent subtag
assignment or combination of assignments renders the tag
obsolete.
9. Tags in the [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registry that had a notation that they
were deprecated were converted to records of type
"grandfathered" in the ILSR. The record for the grandfathered
entry contains a "Deprecated" field with the most appropriate
date that can be determined for when the [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] record was
deprecated. The "Comments" field may optionally contain a
reason for the deprecation. The "Preferred-Value" field
contains a tag that replaces the value. For example, the
[<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] tag "art-lojban" is deprecated and thus appears as a
grandfathered tag in the ILSR. Its "Deprecated" field
contains the deprecation date (in this case "2003-09-02") and
the "Preferred-Value" field the value "jbo".
10. The remaining tags in the [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registry are not
deprecated and have a format consistent with language tags as
defined by [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] but contain subtags that are not defined
in the ILSR. These subtags are eligible for registration as
variants. The ILSR contains appropriate variant records for
the following list of subtags, and the registered [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>]
tags containing these subtags were entered into the ILSR as
type "redundant":
1901 (use with Prefix: de)
1996 (use with Prefix: de)
nedis (use with Prefix: sl)
rozaj (use with Prefix: sl)
11. All remaining [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] registered tags were converted to
records of type "grandfathered" in the ILSR. Interested
parties may use the registration process in [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>] to
attempt to register the variant subtags not already present in
the Language Subtag Registry. If all of the subtags in the
original tag become fully defined by the resulting
registrations, then the original tag is superseded. Such tags
will have their record changed from type "grandfathered" to
type "redundant" in the registry. Note that previous approval
of a tag under [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] is not a guarantee of approval of a
variant subtag under [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>]. The existing [<a href="./rfc3066" title=""Tags for the Identification of Languages"">RFC3066</a>] tag
maintains its validity, but the original reason for its
registration might have become obsolete.
<span class="grey">Ewell Informational [Page 4]</span>
<span id="page-5" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
<span class="h2"><a class="selflink" id="section-3" href="#section-3">3</a>. Initial Registry Contents</span>
The remainder of this section specified the initial set of records
for the registry. This material was deleted on publication of this
memo, to avoid any potential confusion with the registry itself. The
IANA language subtag registry can be found at
<<a href="http://www.iana.org/numbers.html">http://www.iana.org/numbers.html</a>> under "Language Tags".
<span class="h2"><a class="selflink" id="section-4" href="#section-4">4</a>. Omitted Code Elements</span>
The following code elements from [<a href="#ref-UN_M.49" title=""Standard Country or Area Codes for Statistical Use"">UN_M.49</a>] were not associated with
[<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>] alpha-2 code elements. Consequently, they were not
assigned as subtags in the initial Language Subtag Registry, but were
valid candidates for registration as region subtags, using the
process in [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>]:
830 Channel Islands
831 Guernsey
832 Jersey
833 Isle of Man
The last three became ineligible for registration in April, 2006,
when the [<a href="#ref-ISO3166-1" title=""ISO 3166:1988. Codes for the representation of names of countries, 3rd edition"">ISO3166-1</a>] code elements GG, JE, and IM were assigned as
region subtags.
<span class="h2"><a class="selflink" id="section-5" href="#section-5">5</a>. Security Considerations</span>
This document specifies the initial contents to be used by IANA in
populating the Language Subtag Registry. For security considerations
relevant to that registry and the use of language tags, see
[<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>].
<span class="h2"><a class="selflink" id="section-6" href="#section-6">6</a>. IANA Considerations</span>
This document points to the initial content for the Language Subtag
Registry which is maintained by the IANA. The IANA language subtag
registry can be found at <<a href="http://www.iana.org/numbers.html">http://www.iana.org/numbers.html</a>> under
"Language Tags". For details on the procedures for the format and
ongoing maintenance of this registry, see [<a href="./rfc4646" title=""Tags for Identifying Languages"">RFC4646</a>].
<span class="grey">Ewell Informational [Page 5]</span>
<span id="page-6" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
<span class="h2"><a class="selflink" id="section-7" href="#section-7">7</a>. References</span>
<span class="h3"><a class="selflink" id="section-7.1" href="#section-7.1">7.1</a>. Normative References</span>
[<a id="ref-RFC4646">RFC4646</a>] Phillips, A., Ed. and M. Davis, Ed., "Tags for
Identifying Languages", <a href="https://www.rfc-editor.org/bcp/bcp47">BCP 47</a>, <a href="./rfc4646">RFC 4646</a>, September
2006.
<span class="h3"><a class="selflink" id="section-7.2" href="#section-7.2">7.2</a>. Informative References</span>
[<a id="ref-ISO15924">ISO15924</a>] International Organization for Standardization, "ISO
15924:2004. Information and documentation -- Codes for
the representation of names of scripts", January 2004.
[<a id="ref-ISO3166-1">ISO3166-1</a>] International Organization for Standardization, "ISO
3166:1988. Codes for the representation of names of
countries, 3rd edition", August 1988.
[<a id="ref-ISO639-1">ISO639-1</a>] International Organization for Standardization, "ISO
639-1:2002. Codes for the representation of names of
languages -- Part 1: Alpha-2 code", 2002.
[<a id="ref-ISO639-2">ISO639-2</a>] International Organization for Standardization, "ISO
639-2:1998. Codes for the representation of names of
languages -- Part 2: Alpha-3 code, first edition",
1998.
[<a id="ref-RFC1766">RFC1766</a>] Alvestrand, H., "Tags for the Identification of
Languages", <a href="./rfc1766">RFC 1766</a>, March 1995.
[<a id="ref-RFC3066">RFC3066</a>] Alvestrand, H., "Tags for the Identification of
Languages", <a href="https://www.rfc-editor.org/bcp/bcp47">BCP 47</a>, <a href="./rfc3066">RFC 3066</a>, January 2001.
[<a id="ref-UN_M.49">UN_M.49</a>] Statistics Division, United Nations, "Standard Country
or Area Codes for Statistical Use", UN Standard Country
or Area Codes for Statistical Use, Revision 4 (United
Nations publication, Sales No. 98.XVII.9, June 1999.
[<a id="ref-record-jar">record-jar</a>] Raymond, E., "The Art of Unix Programming", 2003.
Author's Address
Doug Ewell (Editor)
Consultant
EMail: [email protected]
URI: <a href="http://users.adelphia.net/~dewell">http://users.adelphia.net/~dewell</a>
<span class="grey">Ewell Informational [Page 6]</span>
<span id="page-7" ></span>
<span class="grey"><a href="./rfc4645">RFC 4645</a> Initial Language Subtag Registry September 2006</span>
Full Copyright Statement
Copyright (C) The Internet Society (2006).
This document is subject to the rights, licenses and restrictions
contained in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a>, and except as set forth therein, the authors
retain all their rights.
This document and the information contained herein are provided on an
"AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET
ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE
INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Intellectual Property
The IETF takes no position regarding the validity or scope of any
Intellectual Property Rights or other rights that might be claimed to
pertain to the implementation or use of the technology described in
this document or the extent to which any license under such rights
might or might not be available; nor does it represent that it has
made any independent effort to identify any such rights. Information
on the procedures with respect to rights in RFC documents can be
found in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a> and <a href="https://www.rfc-editor.org/bcp/bcp79">BCP 79</a>.
Copies of IPR disclosures made to the IETF Secretariat and any
assurances of licenses to be made available, or the result of an
attempt made to obtain a general license or permission for the use of
such proprietary rights by implementers or users of this
specification can be obtained from the IETF on-line IPR repository at
<a href="http://www.ietf.org/ipr">http://www.ietf.org/ipr</a>.
The IETF invites any interested party to bring to its attention any
copyrights, patents or patent applications, or other proprietary
rights that may cover technology that may be required to implement
this standard. Please address the information to the IETF at
[email protected].
Acknowledgement
Funding for the RFC Editor function is provided by the IETF
Administrative Support Activity (IASA).
Ewell Informational [Page 7]
Annotations
Select text to annotate