CkString C++ Reference Documentation

CkString

Current Version: 10.1.0

The Chilkat string class.

Object Creation

// Local variable on the stack
CkString obj;

// Dynamically allocate/delete
CkString *pObj = new CkString();
// ...
delete pObj;

Properties

NumArabic
int get_NumArabic(void);
Introduced in version 9.5.0.25

The number of Arabic characters contained in this string.

top
NumAscii
int get_NumAscii(void);
Introduced in version 9.5.0.25

The number of us-ascii characters contained in this string.

top
NumCentralEuro
int get_NumCentralEuro(void);
Introduced in version 9.5.0.25

The number of Central European and Eastern European characters found in this string. These are characters specific to Polish, Czech, Slovak, Hungarian, Slovene, Croatian, Serbian (Latin script), Romanian and Albanian.

top
NumChinese
int get_NumChinese(void);
Introduced in version 9.5.0.25

The number of Chinese characters contained in this string.

top
NumCyrillic
int get_NumCyrillic(void);
Introduced in version 9.5.0.25

The number of Cyrillic characters contained in this string. The Cyrillic alphabet also called azbuka, from the old name of the first two letters) is actually a family of alphabets, subsets of which are used by certain East and South Slavic languages "” Belarusian, Bulgarian, Macedonian, Russian, Rusyn, Serbian and Ukrainian"”as well as many other languages of the former Soviet Union, Asia and Eastern Europe.

top
NumGreek
int get_NumGreek(void);
Introduced in version 9.5.0.25

The number of Greek characters contained in this string.

top
NumHebrew
int get_NumHebrew(void);
Introduced in version 9.5.0.25

The number of Hebrew characters contained in this string.

top
NumJapanese
int get_NumJapanese(void);
Introduced in version 9.5.0.25

The number of Japanese characters contained in this string.

top
NumKorean
int get_NumKorean(void);
Introduced in version 9.5.0.25

The number of Korean characters contained in this string.

top
NumLatin
int get_NumLatin(void);
Introduced in version 9.5.0.25

The number of Latin characters contained in this string. Latin characters include all major Western European languages, such as German, Spanish, French, Italian, Nordic languages, etc.

Note: This is the number of chars that are Latin characters that do not fall in the us-ascii range. In other words, the 8bit Latin chars.

top
NumThai
int get_NumThai(void);
Introduced in version 9.5.0.25

The number of Thai characters contained in this string.

top

Methods

append
void append(const char *str);

The str is appended to end of this instance.

top
appendAnsi
void appendAnsi(const char *str);

Appends an ANSI string to the end of this instance. str should always be a null terminated ANSI string regardless of the Utf8 property setting.

top
appendChar
void appendChar(char c);

Appends a single ANSI character to the end of this instance.

top
appendCurrentDateRfc822
void appendCurrentDateRfc822(void);

Appends the current date/time to the end of this instance. The date/time is formatted according to the RFC822 standard, which is the typical format used in the "Date" header field of email. For example: "Fri, 27 Jul 2012 17:41:41 -0500"

top
appendDateRfc822
void appendDateRfc822(SYSTEMTIME &dateTime);

The dateTime is appended in RFC 822 format to the end of this instance.

top
appendDateRfc822Gmt
void appendDateRfc822Gmt(SYSTEMTIME &dateTime);

The dateTime is appended in RFC 822 format using GMT to the end of this instance.

top
appendEnc
void appendEnc(const char *str, const char *charsetEncoding);

Appends a string of any character encoding to the end of this instance. Examples of charsetEncoding are: Shift_JIS, windows-1255, iso-8859-2, gb2312, etc. The str should point to a null-terminated string that uses the charset specified by charsetEncoding.

More Information and Examples
top
appendHexData
void appendHexData(const void *byteData, unsigned long numBytes);

Converts the binary data to a hexidecimal string representation and appends to the end of this instance. The hex data is appended in 2-byte chunks with 16 bytes per line. For example:

6861 6C6C 6861 6C6C 6861 6C6C 6861 6C6C
6861 6C6C 6861 6C6C 6861 6C6C 6861 6C6C
6861 6C6C 6861 6C6C 6861 6C6C 6861 6C6C

top
appendInt
void appendInt(int n);

Appends the decimal string representation of an integer to the end of this instance.

top
appendLastWindowsError
void appendLastWindowsError(void);

For many Win32 Platform SDK functions, such as CreateFile, error information must be retrieved by using the Win32 functions GetLastError and FormatMessage. This method calls these Win32 functions to format the error and appends it to the string.

This method would only be used by Win32 applications that call Platform SDK functions.

top
appendN
void appendN(const char *str, unsigned long numBytes);

Appends N bytes of character data to the end of this instance. If the Utf8 property is set to true, then str should point to characters in the utf-8 encoding, otherwise it should point to characters using the ANSI encoding. Note: numBytes is not necessarily the number of characters. It is the length, in bytes, of the string to be appended. This method exists to allow for non-null terminated strings to be appended.

top
appendNU
void appendNU(const wchar_t *wideStr, int numChars);

Append N Unicode characters to the end of this instance. The wideStr points to the 2-byte per char Unicode string. The numChars is the number of Unicode characters to be appended (not the number of bytes).

top
appendRandom
void appendRandom(int numBytes, const char *encoding);

Appends numBytes random bytes to the end of this instance. Because arbitrary byte values in the range 0 to 255 do not necessarily represent valid characters, the bytes must be encoded to a string friendly representation such as hex, base64, etc. The encoding specifies the encoding to be used. Possible values are "hex", "base64", "quoted-printable", "asc", or "url".

top
appendStr
void appendStr(CkString &strObj);

Appends the contents of strObj to the end of this instance.

top
appendU
void appendU(const wchar_t *unicode);

Append a Unicode string to the CkString object.

top
appendUtf8
void appendUtf8(const char *str);

Appends a utf-8 string to the existing contents of this instance. str should always be a null terminated utf-8 string regardless of the Utf8 property setting.

top
base64Decode
void base64Decode(const char *charsetEncoding);

In-place base64 decodes the string and inteprets the results according to the character encoding specified.

More Information and Examples
top
base64DecodeW
void base64DecodeW(const wchar_t *charsetEncoding);

The utf-16 version of base64Decode.

top
base64Encode
void base64Encode(const char *charsetEncoding);

In-place base64 encodes the string. Internally, the string is first converted to the character encoding specified and then base-64 encoded. Typical charsetEncoding values are "utf-8", "ANSI", "iso-8859-1", etc.

More Information and Examples
top
base64EncodeW
void base64EncodeW(const wchar_t *charsetEncoding);

The utf-16 version of base64Encode.

top
beginsWith
bool beginsWith(const char *substr);

Return true if this string begins with substr (case sensitive), otherwise returns false.

top
beginsWithStr
bool beginsWithStr(CkString &strObj);

Returns true if the string begins with the contents of strObj. Otherwise returns false. This method is case sensitive.

top
beginsWithW
bool beginsWithW(const wchar_t *str);

The utf-16 version of beginsWith.

top
charAt
char charAt(int idx);

Returns the ANSI character at a specified index.The first character is at index 0.

top
charAtU
wchar_t charAtU(int idx);

Return the Nth character as a Unicode character.

top
chopAtFirstChar
void chopAtFirstChar(char ch);

Finds the first occurrence of ch and discards the characters at and following ch.

top
chopAtStr
void chopAtStr(CkString &subStrObj);

Finds the first occurrence of a substring and chops it at that point. The result is that the substring and all subsequent characters are removed from the string.

top
clear
void clear(void);

Clears the string. The string contains 0 characters after calling this method.

top
clone
CkString *clone(void);

Creates a copy of the string. As with any newly created Chilkat object instance returned by a Chilkat method, the returned CkString object must be deleted by the calling application.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
compareStr
int compareStr(CkString &str);

Compare two strings. A return value = 0 means they are equal. Return value = 1 indicates that calling object is lexicographically less than argument. Return value = -1 indicates that calling object is lexicographically greater than argument.

top
containsSubstring
bool containsSubstring(const char *substr);

Returns true if the string contains the specified substring, otherwise returns false. The string comparison is case-sensitive.

top
containsSubstringNoCase
bool containsSubstringNoCase(const char *substr);

Same as containsSubstring except the matching is case insensitive.

top
containsSubstringNoCaseW
bool containsSubstringNoCaseW(const wchar_t *substr);

The utf-16 version of containsSubstringNoCase.

top
containsSubstringW
bool containsSubstringW(const wchar_t *substr);

The utf-16 version of containsSubstring.

top
countCharOccurances
int countCharOccurances(char ch);

Returns the number of occurrences of the specified ANSI char.

top
decodeXMLSpecial
void decodeXMLSpecial(void);

Decodes XML special characters. For example, < is converted to '<'

top
doubleValue
double doubleValue(void);

Converts the string to a double and returns the value.

top
eliminateChar
void eliminateChar(char ansiChar, int startIndex);

Eliminate all occurrences of a particular ANSI character.

top
encodeXMLSpecial
void encodeXMLSpecial(void);

Encodes XML special characters. For example, '<' is converted to <

top
endsWith
bool endsWith(const char *substr);

Returns true if the string ends with substr (case-sensitive). Otherwise returns false.

top
endsWithStr
bool endsWithStr(CkString &substrObj);

Returns true if the string ends with the specified substring, otherwise returns false.

top
endsWithW
bool endsWithW(const wchar_t *s);

The utf-16 version of endsWith.

top
entityDecode
void entityDecode(void);

Decodes any HTML entities found within the string, replacing them with the characters represented.

top
entityEncode
void entityEncode(void);

HTML encodes any characters that are special to HTML or cannot be represented by 7-bit us-ascii.

top
equals
bool equals(const char *str);

Returns true if the strings are equal, otherwise returns false. (case-sensitive)

top
equalsIgnoreCase
bool equalsIgnoreCase(const char *str);

Returns true if the strings are equal, otherwise returns false. (case-insensitive)

top
equalsIgnoreCaseStr
bool equalsIgnoreCaseStr(CkString &strObj);

Returns true if the strings are equal, otherwise returns false (case-insensitive)

top
equalsIgnoreCaseW
bool equalsIgnoreCaseW(const wchar_t *s);

The utf-16 version of equalsIgnoreCase.

top
equalsStr
bool equalsStr(CkString &strObj);

Returns true if the strings are equal, otherwise returns false. (case-sensitive)

top
equalsW
bool equalsW(const wchar_t *s);

The utf-16 version of the "equals" method.

top
getChar
CkString *getChar(int idx);

Returns a new CkString object containing the Nth character. (Note, it does not contain the Nth byte, but the Nth character.) For languages such as Chinese, Japanese, etc. individual characters are represented by multiple or varying number of bytes.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
getEnc
const char *getEnc(const char *encoding);

Returns the string as a null-terminated byte sequence in the character encoding specified by encoding. The encoding can be any encoding specified at the link below, except for charsets that contain 0 (null) bytes in individual character byte representations, such as utf-16, utf-32, etc.

top
getNumChars
int getNumChars(void);

Returns the number of characters in the string.

top
getSizeAnsi
int getSizeAnsi(void);

Returns the size, in bytes, of the ANSI encoding of the string.

top
getSizeUnicode
int getSizeUnicode(void);

Returns the size, in bytes, of the Unicode encoding of the string.

top
getSizeUtf8
int getSizeUtf8(void);

Returns the size, in bytes, of the utf-8 encoding of the string.

top
getString
const char *getString(void);

Returns the contents of this instance.

top
getStringAnsi
const char *getStringAnsi(void);

Returns the string as null-terminated ANSI.

top
getStringUtf8
const char *getStringUtf8(void);

Returns the string as null-terminated utf-8.

top
getUnicode
const wchar_t *getUnicode(void);

Return a pointer to memory containing the string in Unicode.

top
hexDecode
void hexDecode(const char *charsetEncoding);

Hex decodes a string and inteprets the bytes according to the character encoding specified.

More Information and Examples
top
hexDecodeW
void hexDecodeW(const wchar_t *charsetEncoding);

The utf-16 version of hexDecode.

top
hexEncode
void hexEncode(const char *charsetEncoding);

Converts the string to the character encoding specified and replaces the string contents with the hex encoding of the character data.

More Information and Examples
top
hexEncodeW
void hexEncodeW(const wchar_t *charsetEncoding);

The utf-16 version of hexEncode.

top
indexOf
int indexOf(const char *substr);

Returns the index of the first occurrence of a substring. Returns -1 if not found.

top
indexOfStr
int indexOfStr(CkString &substrObj);

Returns the index of the first occurrence of a substring. Returns -1 if not found.

top
indexOfW
int indexOfW(const wchar_t *s);

The utf-16 version of "indexOf".

top
intValue
int intValue(void);

Converts the string to an integer and returns the integer value.

top
isEmpty
bool isEmpty(void);

Returns true if the string object is empty, otherwise returns false.

top
lastChar
char lastChar(void);

Returns the last ANSI character in the string.

top
loadFile
bool loadFile(const char *path, const char *charsetEncoding);

Load the contents of a text file into the CkString object. The string is cleared before loading. The character encoding of the text file is specified by charsetEncoding. This method allows for text files in any charset to be loaded: utf-8, Unicode, Shift_JIS, iso-8859-1, etc.

Returns true for success, false for failure.

More Information and Examples
top
loadFileW
bool loadFileW(const wchar_t *path, const wchar_t *charsetEncoding);

The utf-16 version of loadFile.

Returns true for success, false for failure.

top
matches
bool matches(const char *strPattern);

Returns true if the string matches the strPattern, which may contain one or more asterisk wildcard characters. Returns false if the string does not match. This method is case-sensitive.

top
matchesNoCase
bool matchesNoCase(const char *strPattern);

Returns true if the string matches the strPattern, which may contain one or more asterisk wildcard characters. Returns false if the string does not match. This method is case-insensitive.

top
matchesNoCaseW
bool matchesNoCaseW(const wchar_t *s);

The utf-16 version of matchesNoCase.

top
matchesStr
bool matchesStr(CkString &strPatternObj);

Returns true if the string matches a pattern, otherwise returns false. The pattern may contain any number of wildcard '*' characters which represent 0 or more occurrences of any character. This method is case-sensitive.

top
matchesW
bool matchesW(const wchar_t *s);

The utf-16 version of the "matches" method.

top
minimizeMemory
void minimizeMemory(void);

Minimizes the amount of memory consumed by this object. For example, consider the following: A CkString object is loaded with the contents of a text file. The "replaceAllOccurances" method is called, replacing longer substrings with shorter replacements. The actual string length will become shorter than the internal buffer space that is allocated. The minimizeMemory method will, if necessary, allocate a new internal buffer that is exactly the size needed to hold the current contents of the string, copy the string to the new internal buffer, and deallocate the old buffer.

top
obfuscate
void obfuscate(void);

Obfuscates the string. (The unobfuscate method can be called to reverse the obfuscation to restore the original string.)

The Chilkat string obfuscation algorithm works by taking the utf-8 bytes of the string, base64 encoding it, and then scrambling the letters of the base64 encoded string. It is deterministic in that the same string will always obfuscate to the same result. It is not a secure way of encrypting a string. It is only meant to be a simple means of transforming a string into something unintelligible.

top
prepend
void prepend(const char *str);

Prepends str to this instance.

top
prependW
void prependW(const wchar_t *s);

The utf-16 version of the "prepend" method.

top
punyDecode
void punyDecode(void);
Introduced in version 9.5.0.52

In-place decodes the string from punycode.

top
punyEncode
void punyEncode(void);
Introduced in version 9.5.0.52

In-place encodes the string to punycode.

top
qpDecode
void qpDecode(const char *charsetEncoding);

Quoted-printable decodes the string and interprets the resulting character data according to the specified character encoding. The result is that the quoted-printable string is in-place decoded.

More Information and Examples
top
qpDecodeW
void qpDecodeW(const wchar_t *charset);

The utf-16 version of the qpDecode method.

top
qpEncode
void qpEncode(const char *charsetEncoding);

Quoted-printable encodes the string. The string is first converted to the charset specified, and those bytes are QP-encoded. The contents of the string are replaced with the QP-encoded result.

More Information and Examples
top
qpEncodeW
void qpEncodeW(const wchar_t *charset);

The utf-16 version of the qpEncode method.

top
removeAll
int removeAll(CkString &substr);

Removes all occurrences of substr.

top
removeCharOccurances
void removeCharOccurances(char ch);

Removes all occurrences of a specific ANSI character from the string.

top
removeChunk
void removeChunk(int charStartPos, int numChars);

Removes a chunk of characters specified by starting index and length.

top
removeDelimited
void removeDelimited(const char *beginDelim, const char *endDelim, bool caseSensitive);
Introduced in version 9.5.0.52

Remove all occurrences of strings delimited by beginDelim and endDelim. Also removes the delimiters.

top
removeFirst
bool removeFirst(CkString &substr);

Removes the first occurrence of a substring.

top
replaceAll
int replaceAll(CkString &findStrObj, CkString &replaceStrObj);

Replaces all occurrences of a substring with another. The replacement string is allowed to be empty or different in length.

top
replaceAllOccurances
int replaceAllOccurances(const char *findStr, const char *replaceStr);

Replaces all occurrences of a substring with another substring. The replacement string is allowed to be empty or different in length.

top
replaceAllOccurancesW
int replaceAllOccurancesW(const wchar_t *pattern, const wchar_t *replacement);

The utf-16 version of the replaceAllOccurances method.

top
replaceChar
void replaceChar(char findCh, char replaceCh);

Replaces all occurrences of a specified ANSI character with another.

top
replaceFirst
bool replaceFirst(CkString &findStrObj, CkString &replaceStrObj);

Replaces the first occurrence of a substring with another. The replacement string is allowed to be empty or different in length.

top
replaceFirstOccurance
bool replaceFirstOccurance(const char *findStr, const char *replaceStr);

Replaces the first occurrence of a substring with another. The replacement string is allowed to be empty or different in length. (Chilkat is aware of the misspelling of the word "occurrence", but unfortunately it is too late to change..)

top
replaceFirstOccuranceW
bool replaceFirstOccuranceW(const wchar_t *pattern, const wchar_t *replacement);

The utf-16 version of replaceFirstOccurrance. (Chilkat is aware of the misspelling of the word "occurrence", but unfortunately it is too late to change..)

top
saveToFile
bool saveToFile(const char *path, const char *charsetEncoding);

Saves the string to a file using the character encoding specified by charsetEncoding. If a file of the same name exists, it is overwritten. For charsets such as "utf-8", "utf-16", or others that have a possible BOM/preamble, the preamble is output by default. To exclude the BOM/preamble, prepend "no-bom-" to the charset name. For example "no-bom-utf-8".

Returns true for success, false for failure.

More Information and Examples
top
saveToFileW
bool saveToFileW(const wchar_t *path, const wchar_t *charset);

The utf-16 version of the saveToFile method.

Returns true for success, false for failure.

top
setStr
void setStr(CkString &s);

Replaces the contents of the string with another.

top
setString
void setString(const char *str);

Clears the contents of this instance and appends str.

top
setStringAnsi
void setStringAnsi(const char *s);

Set the CkString object from an ANSI string.

top
setStringU
void setStringU(const wchar_t *unicode);

Set the CkString object from a Unicode string.

top
setStringUtf8
void setStringUtf8(const char *s);

Set the string object from a utf-8 string.

top
shorten
void shorten(int n);

Discards the last N characters.

top
split
CkStringArray *split(char delimiterChar, bool exceptDoubleQuoted, bool exceptEscaped, bool keepEmpty);

Splits a string into a collection of strings using a delimiter character. If exceptEscaped is true, then delimiter chars escaped with a backslash are ignored. If exceptDoubleQuoted is true, then delimiter chars inside quotes are ignored. If keepEmpty is false, then empty strings are excluded from being added to the returned CkStringArray object.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
split2
CkStringArray *split2(const char *delimiterChars, bool exceptDoubleQuoted, bool exceptEscaped, bool keepEmpty);

Same as "split", except a set of characters can be used for delimiters.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
split2W
CkStringArray *split2W(const wchar_t *splitCharSet, bool exceptDoubleQuoted, bool exceptEscaped, bool keepEmpty);

The utf-16 version of the split2 method.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
splitAtWS
CkStringArray *splitAtWS(void);

Equivalent to split2(" \t\r\n",true,true,false)

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
substring
CkString *substring(int startCharIndex, int numChars);

Returns a substring specified by starting character position and number of characters. (The 1st char is at index 0.)

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
toCRLF
void toCRLF(void);

Converts all line endings to CRLF.

top
tokenize
CkStringArray *tokenize(const char *punctuation);

Tokenizes a string. The string is split at whitespace characters, and any single punctuation character is returned as a separate token. For example, this string:
CkStringArray *CkString::tokenize(char *punctuation) const

is tokenized to

CkStringArray
*
CkString
:
:
tokenize
(
*
punctuation
)
const

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
tokenizeW
CkStringArray *tokenizeW(const wchar_t *punctuation);

The utf-16 version of the "tokenize" method.

Note: The application is responsible for deleting (via the C++ delete operator) the object returned by this method.

Returns NULL on failure

top
toLF
void toLF(void);

Converts all line endings to bare-LF (Unix/Linux style line endings).

top
toLowerCase
void toLowerCase(void);

Converts the string to lowercase.

top
toUpperCase
void toUpperCase(void);

Converts the string to uppercase.

top
trim
void trim(void);

Trim SPACE and Tab characters from both ends of the string.

top
trim2
void trim2(void);

Trim SPACE, Tab, CR, and LF characters from both ends of the string.

top
trimInsideSpaces
void trimInsideSpaces(void);

Replaces all tabs, CR's, and LF's, with SPACE chars, and removes extra SPACE's so there are no occurrences of more than one SPACE char in a row.

top
unobfuscate
void unobfuscate(void);

Unobfuscates the string.

The Chilkat string obfuscation algorithm works by taking the utf-8 bytes of the string, base64 encoding it, and then scrambling the letters of the base64 encoded string. It is deterministic in that the same string will always obfuscate to the same result. It is not a secure way of encrypting a string. It is only meant to be a simple means of transforming a string into something unintelligible.

top
urlDecode
void urlDecode(const char *charsetEncoding);

URL decodes the string and interprets the resulting byte data in the specified charset encoding.

More Information and Examples
top
urlDecodeW
void urlDecodeW(const wchar_t *charsetEncoding);

The utf-16 version of the urlDecode method.

top
urlEncode
void urlEncode(const char *charsetEncoding);

URL encodes the string. The string is first converted to the specified charset encoding, and those bytes are URL-encoded. The contents of the string are replaced with the URL-encoded result.

More Information and Examples
top
urlEncodeW
void urlEncodeW(const wchar_t *charsetEncoding);

The utf-16 version of the urlEncode method.

top