blob: ee4eadaab50a465ffe68e7b6ccc4153e4115da4f (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
|
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html401/loose.dtd">
<html>
<!-- Created on October, 16 2024 by texi2html 1.78a -->
<!--
Written by: Lionel Cons <Lionel.Cons@cern.ch> (original author)
Karl Berry <karl@freefriends.org>
Olaf Bachmann <obachman@mathematik.uni-kl.de>
and many others.
Maintained by: Many creative people.
Send bugs and suggestions to <texi2html-bug@nongnu.org>
-->
<head>
<title>GNU libunistring: GNU libunistring</title>
<meta name="description" content="GNU libunistring: GNU libunistring">
<meta name="keywords" content="GNU libunistring: GNU libunistring">
<meta name="resource-type" content="document">
<meta name="distribution" content="global">
<meta name="Generator" content="texi2html 1.78a">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<style type="text/css">
<!--
a.summary-letter {text-decoration: none}
pre.display {font-family: serif}
pre.format {font-family: serif}
pre.menu-comment {font-family: serif}
pre.menu-preformatted {font-family: serif}
pre.smalldisplay {font-family: serif; font-size: smaller}
pre.smallexample {font-size: smaller}
pre.smallformat {font-family: serif; font-size: smaller}
pre.smalllisp {font-size: smaller}
span.roman {font-family:serif; font-weight:normal;}
span.sansserif {font-family:sans-serif; font-weight:normal;}
ul.toc {list-style: none}
-->
</style>
</head>
<body lang="en" bgcolor="#FFFFFF" text="#000000" link="#0000FF" vlink="#800080" alink="#FF0000">
<table cellpadding="1" cellspacing="1" border="0">
<tr><td valign="middle" align="left">[<a href="#SEC_Top" title="Cover (top) of document">Top</a>]</td>
<td valign="middle" align="left">[<a href="#SEC_Contents" title="Table of contents">Contents</a>]</td>
<td valign="middle" align="left">[<a href="libunistring_21.html#SEC94" title="Index">Index</a>]</td>
<td valign="middle" align="left">[<a href="libunistring_abt.html#SEC_About" title="About (help)"> ? </a>]</td>
</tr></table>
<h1 class="settitle">GNU libunistring</h1>
<a name="SEC_Contents"></a>
<h1>Table of Contents</h1>
<div class="contents">
<ul class="toc">
<li><a name="TOC1" href="libunistring_1.html#SEC1">1. Introduction</a>
<ul class="toc">
<li><a name="TOC2" href="libunistring_1.html#SEC2">1.1 Unicode</a></li>
<li><a name="TOC3" href="libunistring_1.html#SEC3">1.2 Unicode and Internationalization</a></li>
<li><a name="TOC4" href="libunistring_1.html#SEC4">1.3 Locale encodings</a></li>
<li><a name="TOC5" href="libunistring_1.html#SEC5">1.4 Choice of in-memory representation of strings</a></li>
<li><a name="TOC6" href="libunistring_1.html#SEC6">1.5 ‘<samp>char *</samp>’ strings</a></li>
<li><a name="TOC7" href="libunistring_1.html#SEC7">1.6 Unicode strings</a></li>
</ul></li>
<li><a name="TOC8" href="libunistring_2.html#SEC8">2. Conventions</a></li>
<li><a name="TOC9" href="libunistring_3.html#SEC9">3. Elementary types <code><unitypes.h></code></a></li>
<li><a name="TOC10" href="libunistring_4.html#SEC10">4. Elementary Unicode string functions <code><unistr.h></code></a>
<ul class="toc">
<li><a name="TOC11" href="libunistring_4.html#SEC11">4.1 Elementary string checks</a></li>
<li><a name="TOC12" href="libunistring_4.html#SEC12">4.2 Elementary string conversions</a></li>
<li><a name="TOC13" href="libunistring_4.html#SEC13">4.3 Elementary string functions</a>
<ul class="toc">
<li><a name="TOC14" href="libunistring_4.html#SEC14">4.3.1 Iterating over a Unicode string</a></li>
<li><a name="TOC15" href="libunistring_4.html#SEC15">4.3.2 Creating Unicode strings one character at a time</a></li>
<li><a name="TOC16" href="libunistring_4.html#SEC16">4.3.3 Copying Unicode strings</a></li>
<li><a name="TOC17" href="libunistring_4.html#SEC17">4.3.4 Comparing Unicode strings</a></li>
<li><a name="TOC18" href="libunistring_4.html#SEC18">4.3.5 Searching for a character in a Unicode string</a></li>
<li><a name="TOC19" href="libunistring_4.html#SEC19">4.3.6 Counting the characters in a Unicode string</a></li>
</ul></li>
<li><a name="TOC20" href="libunistring_4.html#SEC20">4.4 Elementary string functions with memory allocation</a></li>
<li><a name="TOC21" href="libunistring_4.html#SEC21">4.5 Elementary string functions on NUL terminated strings</a>
<ul class="toc">
<li><a name="TOC22" href="libunistring_4.html#SEC22">4.5.1 Iterating over a NUL terminated Unicode string</a></li>
<li><a name="TOC23" href="libunistring_4.html#SEC23">4.5.2 Length of a NUL terminated Unicode string</a></li>
<li><a name="TOC24" href="libunistring_4.html#SEC24">4.5.3 Copying a NUL terminated Unicode string</a></li>
<li><a name="TOC25" href="libunistring_4.html#SEC25">4.5.4 Comparing NUL terminated Unicode strings</a></li>
<li><a name="TOC26" href="libunistring_4.html#SEC26">4.5.5 Duplicating a NUL terminated Unicode string</a></li>
<li><a name="TOC27" href="libunistring_4.html#SEC27">4.5.6 Searching for a character in a NUL terminated Unicode string</a></li>
<li><a name="TOC28" href="libunistring_4.html#SEC28">4.5.7 Searching for a substring in a NUL terminated Unicode string</a></li>
<li><a name="TOC29" href="libunistring_4.html#SEC29">4.5.8 Tokenizing a NUL terminated Unicode string</a></li>
</ul>
</li>
</ul></li>
<li><a name="TOC30" href="libunistring_5.html#SEC30">5. Conversions between Unicode and encodings <code><uniconv.h></code></a></li>
<li><a name="TOC31" href="libunistring_6.html#SEC31">6. Output with Unicode strings <code><unistdio.h></code></a></li>
<li><a name="TOC32" href="libunistring_7.html#SEC32">7. Names of Unicode characters <code><uniname.h></code></a></li>
<li><a name="TOC33" href="libunistring_8.html#SEC33">8. Unicode character classification and properties <code><unictype.h></code></a>
<ul class="toc">
<li><a name="TOC34" href="libunistring_8.html#SEC34">8.1 General category</a>
<ul class="toc">
<li><a name="TOC35" href="libunistring_8.html#SEC35">8.1.1 The object oriented API for general category</a></li>
<li><a name="TOC36" href="libunistring_8.html#SEC36">8.1.2 The bit mask API for general category</a></li>
</ul></li>
<li><a name="TOC37" href="libunistring_8.html#SEC37">8.2 Canonical combining class</a></li>
<li><a name="TOC38" href="libunistring_8.html#SEC38">8.3 Bidi class</a></li>
<li><a name="TOC39" href="libunistring_8.html#SEC39">8.4 Decimal digit value</a></li>
<li><a name="TOC40" href="libunistring_8.html#SEC40">8.5 Digit value</a></li>
<li><a name="TOC41" href="libunistring_8.html#SEC41">8.6 Numeric value</a></li>
<li><a name="TOC42" href="libunistring_8.html#SEC42">8.7 Mirrored character</a></li>
<li><a name="TOC43" href="libunistring_8.html#SEC43">8.8 Arabic shaping</a>
<ul class="toc">
<li><a name="TOC44" href="libunistring_8.html#SEC44">8.8.1 Joining type of Arabic characters</a></li>
<li><a name="TOC45" href="libunistring_8.html#SEC45">8.8.2 Joining group of Arabic characters</a></li>
</ul></li>
<li><a name="TOC46" href="libunistring_8.html#SEC46">8.9 Properties</a>
<ul class="toc">
<li><a name="TOC47" href="libunistring_8.html#SEC47">8.9.1 Properties as objects – the object oriented API</a></li>
<li><a name="TOC48" href="libunistring_8.html#SEC48">8.9.2 Properties as functions – the functional API</a></li>
</ul></li>
<li><a name="TOC49" href="libunistring_8.html#SEC49">8.10 Other attributes</a>
<ul class="toc">
<li><a name="TOC50" href="libunistring_8.html#SEC50">8.10.1 Indic conjunct break</a></li>
</ul></li>
<li><a name="TOC51" href="libunistring_8.html#SEC51">8.11 Scripts</a></li>
<li><a name="TOC52" href="libunistring_8.html#SEC52">8.12 Blocks</a></li>
<li><a name="TOC53" href="libunistring_8.html#SEC53">8.13 ISO C and Java syntax</a></li>
<li><a name="TOC54" href="libunistring_8.html#SEC54">8.14 Classifications like in ISO C</a></li>
</ul></li>
<li><a name="TOC55" href="libunistring_9.html#SEC55">9. Display width <code><uniwidth.h></code></a></li>
<li><a name="TOC56" href="libunistring_10.html#SEC56">10. Grapheme cluster breaks in strings <code><unigbrk.h></code></a>
<ul class="toc">
<li><a name="TOC57" href="libunistring_10.html#SEC57">10.1 Grapheme cluster breaks in a string</a></li>
<li><a name="TOC58" href="libunistring_10.html#SEC58">10.2 Grapheme cluster break property</a></li>
</ul></li>
<li><a name="TOC59" href="libunistring_11.html#SEC59">11. Word breaks in strings <code><uniwbrk.h></code></a>
<ul class="toc">
<li><a name="TOC60" href="libunistring_11.html#SEC60">11.1 Word breaks in a string</a></li>
<li><a name="TOC61" href="libunistring_11.html#SEC61">11.2 Word break property</a></li>
</ul></li>
<li><a name="TOC62" href="libunistring_12.html#SEC62">12. Line breaking <code><unilbrk.h></code></a></li>
<li><a name="TOC63" href="libunistring_13.html#SEC63">13. Normalization forms (composition and decomposition) <code><uninorm.h></code></a>
<ul class="toc">
<li><a name="TOC64" href="libunistring_13.html#SEC64">13.1 Decomposition of Unicode characters</a></li>
<li><a name="TOC65" href="libunistring_13.html#SEC65">13.2 Composition of Unicode characters</a></li>
<li><a name="TOC66" href="libunistring_13.html#SEC66">13.3 Normalization of strings</a></li>
<li><a name="TOC67" href="libunistring_13.html#SEC67">13.4 Normalizing comparisons</a></li>
<li><a name="TOC68" href="libunistring_13.html#SEC68">13.5 Normalization of streams of Unicode characters</a></li>
</ul></li>
<li><a name="TOC69" href="libunistring_14.html#SEC69">14. Case mappings <code><unicase.h></code></a>
<ul class="toc">
<li><a name="TOC70" href="libunistring_14.html#SEC70">14.1 Case mappings of characters</a></li>
<li><a name="TOC71" href="libunistring_14.html#SEC71">14.2 Case mappings of strings</a></li>
<li><a name="TOC72" href="libunistring_14.html#SEC72">14.3 Case mappings of substrings</a></li>
<li><a name="TOC73" href="libunistring_14.html#SEC73">14.4 Case insensitive comparison</a></li>
<li><a name="TOC74" href="libunistring_14.html#SEC74">14.5 Case detection</a></li>
</ul></li>
<li><a name="TOC75" href="libunistring_15.html#SEC75">15. Regular expressions <code><uniregex.h></code></a></li>
<li><a name="TOC76" href="libunistring_16.html#SEC76">16. Using the library</a>
<ul class="toc">
<li><a name="TOC77" href="libunistring_16.html#SEC77">16.1 Installation</a></li>
<li><a name="TOC78" href="libunistring_16.html#SEC78">16.2 Compiler options</a></li>
<li><a name="TOC79" href="libunistring_16.html#SEC79">16.3 Include files</a></li>
<li><a name="TOC80" href="libunistring_16.html#SEC80">16.4 Autoconf macro</a></li>
<li><a name="TOC81" href="libunistring_16.html#SEC81">16.5 Reporting problems</a></li>
</ul></li>
<li><a name="TOC82" href="libunistring_17.html#SEC82">17. More advanced functionality</a></li>
<li><a name="TOC83" href="libunistring_18.html#SEC83">A. The <code>wchar_t</code> mess</a></li>
<li><a name="TOC84" href="libunistring_19.html#SEC84">B. The <code>char32_t</code> problem</a></li>
<li><a name="TOC85" href="libunistring_20.html#SEC85">C. Licenses</a>
<ul class="toc">
<li><a name="TOC86" href="libunistring_20.html#SEC86">C.1 GNU GENERAL PUBLIC LICENSE</a></li>
<li><a name="TOC87" href="libunistring_20.html#SEC91">C.2 GNU LESSER GENERAL PUBLIC LICENSE</a></li>
<li><a name="TOC88" href="libunistring_20.html#SEC92">C.3 GNU Free Documentation License</a></li>
</ul></li>
<li><a name="TOC89" href="libunistring_21.html#SEC94">Index</a></li>
</ul>
</div>
<a name="Top"></a>
<a name="SEC_Top"></a>
<hr size="1">
<table cellpadding="1" cellspacing="1" border="0">
<tr><td valign="middle" align="left">[<a href="#SEC_Top" title="Cover (top) of document">Top</a>]</td>
<td valign="middle" align="left">[<a href="#SEC_Contents" title="Table of contents">Contents</a>]</td>
<td valign="middle" align="left">[<a href="libunistring_21.html#SEC94" title="Index">Index</a>]</td>
<td valign="middle" align="left">[<a href="libunistring_abt.html#SEC_About" title="About (help)"> ? </a>]</td>
</tr></table>
<p>
<font size="-1">
This document was generated by <em>Bruno Haible</em> on <em>October, 16 2024</em> using <a href="https://www.nongnu.org/texi2html/"><em>texi2html 1.78a</em></a>.
</font>
<br>
</p>
</body>
</html>
|