Classes | Namespaces | Enumerations | Functions
unicode.h File Reference

Unicode and UTF-8 related classes and functions. More...

Classes

class  Xapian::Utf8Iterator
 An iterator which returns Unicode character values from a UTF-8 encoded string. More...
 

Namespaces

 Xapian
 The Xapian namespace contains public interfaces for the Xapian library.
 
 Xapian::Unicode
 Functions associated with handling Unicode characters.
 

Enumerations

enum  Xapian::Unicode::category {
  UNASSIGNED, UPPERCASE_LETTER, LOWERCASE_LETTER, TITLECASE_LETTER,
  MODIFIER_LETTER, OTHER_LETTER, NON_SPACING_MARK, ENCLOSING_MARK,
  COMBINING_SPACING_MARK, DECIMAL_DIGIT_NUMBER, LETTER_NUMBER, OTHER_NUMBER,
  SPACE_SEPARATOR, LINE_SEPARATOR, PARAGRAPH_SEPARATOR, CONTROL,
  FORMAT, PRIVATE_USE, SURROGATE, CONNECTOR_PUNCTUATION,
  DASH_PUNCTUATION, OPEN_PUNCTUATION, CLOSE_PUNCTUATION, INITIAL_QUOTE_PUNCTUATION,
  FINAL_QUOTE_PUNCTUATION, OTHER_PUNCTUATION, MATH_SYMBOL, CURRENCY_SYMBOL,
  MODIFIER_SYMBOL, OTHER_SYMBOL
}
 Each Unicode character is in exactly one of these categories. More...
 

Functions

unsigned Xapian::Unicode::nonascii_to_utf8 (unsigned ch, char *buf)
 Convert a single non-ASCII Unicode character to UTF-8. More...
 
unsigned Xapian::Unicode::to_utf8 (unsigned ch, char *buf)
 Convert a single Unicode character to UTF-8. More...
 
void Xapian::Unicode::append_utf8 (std::string &s, unsigned ch)
 Append the UTF-8 representation of a single Unicode character to a std::string.
 
category Xapian::Unicode::get_category (unsigned ch)
 Return the category which a given Unicode character falls into.
 
bool Xapian::Unicode::is_wordchar (unsigned ch)
 Test if a given Unicode character is "word character".
 
bool Xapian::Unicode::is_whitespace (unsigned ch)
 Test if a given Unicode character is a whitespace character.
 
bool Xapian::Unicode::is_currency (unsigned ch)
 Test if a given Unicode character is a currency symbol.
 
unsigned Xapian::Unicode::tolower (unsigned ch)
 Convert a Unicode character to lowercase.
 
unsigned Xapian::Unicode::toupper (unsigned ch)
 Convert a Unicode character to uppercase.
 
std::string Xapian::Unicode::tolower (const std::string &term)
 Convert a UTF-8 std::string to lowercase.
 
std::string Xapian::Unicode::toupper (const std::string &term)
 Convert a UTF-8 std::string to uppercase.
 

Detailed Description

Unicode and UTF-8 related classes and functions.


Documentation for Xapian (version 1.4.0).
Generated on Wed Oct 12 2016 by Doxygen 1.8.11.