stdstringlib.rst 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324
  1. .. _stdlib_stdstringlib:
  2. ==================
  3. The String library
  4. ==================
  5. the string lib implements string formatting and regular expression matching routines.
  6. --------------
  7. Squirrel API
  8. --------------
  9. ++++++++++++++
  10. Global Symbols
  11. ++++++++++++++
  12. .. js:function:: endswith(str, cmp)
  13. returns `true` if the end of the string `str` matches a the string `cmp` otherwise returns `false`
  14. .. js:function:: escape(str)
  15. Returns a string with backslashes before characters that need to be escaped(`\",\a,\b,\t,\n,\v,\f,\r,\\,\",\',\0,\xnn`).
  16. .. js:function:: format(formatstr, ...)
  17. Returns a string formatted according `formatstr` and the optional parameters following it.
  18. The format string follows the same rules as the `printf` family of
  19. standard C functions( the "*" is not supported). ::
  20. e.g.
  21. sq> print(format("%s %d 0x%02X\n","this is a test :",123,10));
  22. this is a test : 123 0x0A
  23. .. js:function:: printf(formatstr, ...)
  24. Just like calling `print(format(formatstr` as in the example above, but is more convenient AND more efficient. ::
  25. e.g.
  26. sq> printf("%s %d 0x%02X\n","this is a test :",123,10);
  27. this is a test : 123 0x0A
  28. .. js:function:: lstrip(str)
  29. Strips white-space-only characters that might appear at the beginning of the given string
  30. and returns the new stripped string.
  31. .. js:function:: rstrip(str)
  32. Strips white-space-only characters that might appear at the end of the given string
  33. and returns the new stripped string.
  34. .. js:function:: split(str, separators [, skipempty])
  35. returns an array of strings split at each point where a separator character occurs in `str`.
  36. The separator is not returned as part of any array element.
  37. The parameter `separators` is a string that specifies the characters as to be used for the splitting.
  38. The parameter `skipempty` is a boolean (default false). If `skipempty` is true, empty strings are not added to array.
  39. ::
  40. eg.
  41. local a = split("1.2-3;;4/5",".-/;");
  42. // the result will be [1,2,3,,4,5]
  43. or
  44. local b = split("1.2-3;;4/5",".-/;",true);
  45. // the result will be [1,2,3,4,5]
  46. .. js:function:: startswith(str, cmp)
  47. returns `true` if the beginning of the string `str` matches the string `cmp`; otherwise returns `false`
  48. .. js:function:: strip(str)
  49. Strips white-space-only characters that might appear at the beginning or end of the given string and returns the new stripped string.
  50. ++++++++++++++++++
  51. The regexp class
  52. ++++++++++++++++++
  53. .. js:class:: regexp(pattern)
  54. The regexp object represents a precompiled regular expression pattern. The object is created
  55. through `regexp(pattern)`.
  56. +---------------------+--------------------------------------+
  57. | `\\` | Quote the next metacharacter |
  58. +---------------------+--------------------------------------+
  59. | `^` | Match the beginning of the string |
  60. +---------------------+--------------------------------------+
  61. | `.` | Match any character |
  62. +---------------------+--------------------------------------+
  63. | `$` | Match the end of the string |
  64. +---------------------+--------------------------------------+
  65. | `|` | Alternation |
  66. +---------------------+--------------------------------------+
  67. | `(subexp)` | Grouping (creates a capture) |
  68. +---------------------+--------------------------------------+
  69. | `(?:subexp)` | No Capture Grouping (no capture) |
  70. +---------------------+--------------------------------------+
  71. | `[]` | Character class |
  72. +---------------------+--------------------------------------+
  73. **GREEDY CLOSURES**
  74. +---------------------+---------------------------------------------+
  75. | `*` | Match 0 or more times |
  76. +---------------------+---------------------------------------------+
  77. | `+` | Match 1 or more times |
  78. +---------------------+---------------------------------------------+
  79. | `?` | Match 1 or 0 times |
  80. +---------------------+---------------------------------------------+
  81. | `{n}` | Match exactly n times |
  82. +---------------------+---------------------------------------------+
  83. | `{n,}` | Match at least n times |
  84. +---------------------+---------------------------------------------+
  85. | `{n,m}` | Match at least n but not more than m times |
  86. +---------------------+---------------------------------------------+
  87. **ESCAPE CHARACTERS**
  88. +---------------------+--------------------------------------+
  89. | `\\t` | tab (HT, TAB) |
  90. +---------------------+--------------------------------------+
  91. | `\\n` | newline (LF, NL) |
  92. +---------------------+--------------------------------------+
  93. | `\\r` | return (CR) |
  94. +---------------------+--------------------------------------+
  95. | `\\f` | form feed (FF) |
  96. +---------------------+--------------------------------------+
  97. **PREDEFINED CLASSES**
  98. +---------------------+--------------------------------------+
  99. | `\\l` | lowercase next char |
  100. +---------------------+--------------------------------------+
  101. | `\\u` | uppercase next char |
  102. +---------------------+--------------------------------------+
  103. | `\\a` | letters |
  104. +---------------------+--------------------------------------+
  105. | `\\A` | non letters |
  106. +---------------------+--------------------------------------+
  107. | `\\w` | alphanumeric `[_0-9a-zA-Z]` |
  108. +---------------------+--------------------------------------+
  109. | `\\W` | non alphanumeric `[^_0-9a-zA-Z]` |
  110. +---------------------+--------------------------------------+
  111. | `\\s` | space |
  112. +---------------------+--------------------------------------+
  113. | `\\S` | non space |
  114. +---------------------+--------------------------------------+
  115. | `\\d` | digits |
  116. +---------------------+--------------------------------------+
  117. | `\\D` | non digits |
  118. +---------------------+--------------------------------------+
  119. | `\\x` | hexadecimal digits |
  120. +---------------------+--------------------------------------+
  121. | `\\X` | non hexadecimal digits |
  122. +---------------------+--------------------------------------+
  123. | `\\c` | control characters |
  124. +---------------------+--------------------------------------+
  125. | `\\C` | non control characters |
  126. +---------------------+--------------------------------------+
  127. | `\\p` | punctuation |
  128. +---------------------+--------------------------------------+
  129. | `\\P` | non punctuation |
  130. +---------------------+--------------------------------------+
  131. | `\\b` | word boundary |
  132. +---------------------+--------------------------------------+
  133. | `\\B` | non word boundary |
  134. +---------------------+--------------------------------------+
  135. .. js:function:: regexp.capture(str [, start])
  136. returns an array of tables containing two indexes ("begin" and "end") of
  137. the first match of the regular expression in the string `str`.
  138. An array entry is created for each captured sub expressions. If no match occurs returns null.
  139. The search starts from the index `start`
  140. of the string; if `start` is omitted the search starts from the beginning of the string.
  141. The first element of the returned array(index 0) always contains the complete match.
  142. ::
  143. local ex = regexp(@"(\d+) ([a-zA-Z]+)(\p)");
  144. local string = "stuff 123 Test;";
  145. local res = ex.capture(string);
  146. foreach(i,val in res)
  147. {
  148. print(format("match number[%02d] %s\n",
  149. i,string.slice(val.begin,val.end))); //prints "Test"
  150. }
  151. ...
  152. will print
  153. match number[00] 123 Test;
  154. match number[01] 123
  155. match number[02] Test
  156. match number[03] ;
  157. .. js:function:: regexp.match(str)
  158. returns a true if the regular expression matches the string
  159. `str`, otherwise returns false.
  160. .. js:function:: regexp.search(str [, start])
  161. returns a table containing two indexes ("begin" and "end") of the first match of the regular expression in
  162. the string `str`, otherwise if no match occurs returns null. The search starts from the index `start`
  163. of the string; if `start` is omitted the search starts from the beginning of the string.
  164. ::
  165. local ex = regexp("[a-zA-Z]+");
  166. local string = "123 Test;";
  167. local res = ex.search(string);
  168. print(string.slice(res.begin,res.end)); //prints "Test"
  169. -------------
  170. C API
  171. -------------
  172. .. _sqstd_register_stringlib:
  173. .. c:function:: SQRESULT sqstd_register_stringlib(HSQUIRRELVM v)
  174. :param HSQUIRRELVM v: the target VM
  175. :returns: an SQRESULT
  176. :remarks: The function aspects a table on top of the stack where to register the global library functions.
  177. initialize and register the string library in the given VM.
  178. +++++++++++++
  179. Formatting
  180. +++++++++++++
  181. .. c:function:: SQRESULT sqstd_format(HSQUIRRELVM v, SQInteger nformatstringidx, SQInteger * outlen, SQChar ** output)
  182. :param HSQUIRRELVM v: the target VM
  183. :param SQInteger nformatstringidx: index in the stack of the format string
  184. :param SQInteger * outlen: a pointer to an integer that will be filled with the length of the newly created string
  185. :param SQChar ** output: a pointer to a string pointer that will receive the newly created string
  186. :returns: an SQRESULT
  187. :remarks: the newly created string is allocated in the scratchpad memory.
  188. creates a new string formatted according to the object at position `nformatstringidx` and the optional parameters following it.
  189. The format string follows the same rules as the `printf` family of
  190. standard C functions( the "*" is not supported).
  191. ++++++++++++++++++
  192. Regular Expessions
  193. ++++++++++++++++++
  194. .. c:function:: SQRex* sqstd_rex_compile(const SQChar * pattern, const SQChar ** error)
  195. :param SQChar * pattern: a pointer to a zero terminated string containing the pattern that has to be compiled.
  196. :param SQChar ** error: a pointer to a string pointer that will be set with an error string in case of failure.
  197. :returns: a pointer to the compiled pattern
  198. compiles an expression and returns a pointer to the compiled version.
  199. in case of failure returns NULL.The returned object has to be deleted
  200. through the function sqstd_rex_free().
  201. .. c:function:: void sqstd_rex_free(SQRex * exp)
  202. :param SQRex * exp: the expression structure that has to be deleted.
  203. deletes a expression structure created with sqstd_rex_compile()
  204. .. c:function:: SQBool sqstd_rex_match(SQRex * exp,const SQChar * text)
  205. :param SQRex * exp: a compiled expression
  206. :param SQChar * text: the string that has to be tested
  207. :returns: SQTrue if successful otherwise SQFalse
  208. returns SQTrue if the string specified in the parameter text is an
  209. exact match of the expression, otherwise returns SQFalse.
  210. .. c:function:: SQBool sqstd_rex_search(SQRex * exp, const SQChar * text, const SQChar ** out_begin, const SQChar ** out_end)
  211. :param SQRex * exp: a compiled expression
  212. :param SQChar * text: the string that has to be tested
  213. :param SQChar ** out_begin: a pointer to a string pointer that will be set with the beginning of the match
  214. :param SQChar ** out_end: a pointer to a string pointer that will be set with the end of the match
  215. :returns: SQTrue if successful otherwise SQFalse
  216. searches the first match of the expression in the string specified in the parameter text.
  217. if the match is found returns SQTrue and the sets out_begin to the beginning of the
  218. match and out_end at the end of the match; otherwise returns SQFalse.
  219. .. c:function:: SQBool sqstd_rex_searchrange(SQRex * exp, const SQChar * text_begin, const SQChar * text_end, const SQChar ** out_begin, const SQChar ** out_end)
  220. :param SQRex * exp: a compiled expression
  221. :param SQChar * text_begin: a pointer to the beginnning of the string that has to be tested
  222. :param SQChar * text_end: a pointer to the end of the string that has to be tested
  223. :param SQChar ** out_begin: a pointer to a string pointer that will be set with the beginning of the match
  224. :param SQChar ** out_end: a pointer to a string pointer that will be set with the end of the match
  225. :returns: SQTrue if successful otherwise SQFalse
  226. searches the first match of the expression in the string delimited
  227. by the parameter text_begin and text_end.
  228. if the match is found returns SQTrue and sets out_begin to the beginning of the
  229. match and out_end at the end of the match; otherwise returns SQFalse.
  230. .. c:function:: SQInteger sqstd_rex_getsubexpcount(SQRex * exp)
  231. :param SQRex * exp: a compiled expression
  232. :returns: the number of sub expressions matched by the expression
  233. returns the number of sub expressions matched by the expression
  234. .. c:function:: SQBool sqstd_rex_getsubexp(SQRex * exp, SQInteger n, SQRexMatch * subexp)
  235. :param SQRex * exp: a compiled expression
  236. :param SQInteger n: the index of the submatch(0 is the complete match)
  237. :param SQRexMatch * a: pointer to structure that will store the result
  238. :returns: the function returns SQTrue if n is a valid index; otherwise SQFalse.
  239. retrieve the begin and and pointer to the length of the sub expression indexed
  240. by n. The result is passed through the struct SQRexMatch.