Next

Chapter 1. Unicode 0.1 preview 4

Mathias Gaunard

Distributed under the Boost Software License, Version 1.0. (See accompanying file LICENSE_1_0.txt or copy at http://www.boost.org/LICENSE_1_0.txt)

Table of Contents

Preface
Motivation
Introduction to Unicode
Notion of character
Character set
Encodings
Combining character sequences
Grapheme clusters
Normalization
Other operations
Character properties
Overview
Components
Organization
Linking
Converters and Segmenters
Concepts
Converting and segmenting
Combining converters
Codecvt facets
User's Guide
UTF converters and segmenters
Composition and Normalization
String searching algorithms
Examples
convert
characters
compose
search
source_input
Iterator/Range reference
Concepts
Header <boost/iterator/codecvt_converter.hpp>
Header <boost/iterator/convert_iterator.hpp>
Header <boost/iterator/convert_iterator_fwd.hpp>
Header <boost/iterator/converter_codecvt.hpp>
Header <boost/iterator/converter_concept.hpp>
Header <boost/iterator/segment_iterator.hpp>
Header <boost/iterator/segment_iterator_fwd.hpp>
Header <boost/iterator/segmenter_concept.hpp>
Unicode reference
Header <boost/cuchar.hpp>
Header <boost/unicode/cat.hpp>
Header <boost/unicode/codecvt.hpp>
Header <boost/unicode/combining.hpp>
Header <boost/unicode/compose.hpp>
Header <boost/unicode/ucd/compose.hpp>
Header <boost/unicode/compose_fwd.hpp>
Header <boost/unicode/graphemes.hpp>
Header <boost/unicode/hangul.hpp>
Header <boost/unicode/search.hpp>
Header <boost/unicode/static_utf.hpp>
Header <boost/unicode/static_utf_codecs.hpp>
Header <boost/unicode/string_cp.hpp>
Header <boost/unicode/surrogates.hpp>
Header <boost/unicode/ucd/block_types.hpp>
Header <boost/unicode/ucd/properties.hpp>
Header <boost/unicode/ucd/properties_types.hpp>
Header <boost/unicode/utf.hpp>
Header <boost/unicode/utf_codecs.hpp>
Appendices
Appendix A: Unicode in source files
Appendix B: Rationale
Appendix C: Future Work
Appendix D: Acknowledgements

Unicode is the industry standard to consistently represent and manipulate text across most of the world's writing systems.

Description

This library aims at providing the foundation tools to accurately represent and deal with natural text in C++ in a portable and robust manner, so as to allow internationalized applications, by implementing parts of the Unicode Standard.

This library is environment-independent and deliberately chooses not to rely to the standard C++ locale facilities as well as the standard string facilities, judged ill-suited for Unicode. It does, however provide tools that can be used with them.

The current version is locale-agnostic, but a subsystem for tailored locale behaviour may be added in the future.

[Warning] Warning

Boost.Unicode is a library in development and is not part of Boost.

How to use this manual

Some icons are used to mark certain topics indicative of their relevance. These icons precede some text to indicate:

Table 1.1. Icons

Icon

Name

Meaning

note

Note

Information provided is auxiliary but will give the reader a deeper insight into a specific topic. May be skipped.

alert

Alert

Information provided is of utmost importance.

tip

Tip

A potentially useful and helpful piece of information.


Last revised: August 01, 2010 at 21:53:15 GMT


Next