title.txt 3.9 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768
  1. title.txt
  2. The MediaWiki software's "Title" class represents article titles, which are used
  3. for many purposes: as the human-readable text title of the article, in the URL
  4. used to access the article, the wikitext link to the article, the key into the
  5. article database, and so on. The class in instantiated from one of these forms
  6. and can be queried for the others, and for other attributes of the title. This
  7. is intended to be an immutable "value" class, so there are no mutator functions.
  8. To get a new instance, call Title::newFromText(). Once instantiated, the
  9. non-static accessor methods can be used, such as getText(), getDBkey(),
  10. getNamespace(), etc. Note that Title::newFromText() may return false if the text
  11. is illegal according to the rules below.
  12. The prefix rules: a title consists of an optional interwiki prefix (such as "m:"
  13. for meta or "de:" for German), followed by an optional namespace, followed by
  14. the remainder of the title. Both interwiki prefixes and namespace prefixes have
  15. the same rules: they contain only letters, digits, space, and underscore, must
  16. start with a letter, are case insensitive, and spaces and underscores are
  17. interchangeable. Prefixes end with a ":". A prefix is only recognized if it is
  18. one of those specifically allowed by the software. For example, "de:name" is a
  19. link to the article "name" in the German Wikipedia, because "de" is recognized
  20. as one of the allowable interwikis. The title "talk:name" is a link to the
  21. article "name" in the "talk" namespace of the current wiki, because "talk" is a
  22. recognized namespace. Both may be present, and if so, the interwiki must
  23. come first, for example, "m:talk:name". If a title begins with a colon as its
  24. first character, no prefixes are scanned for, and the colon is just removed.
  25. Note that because of these rules, it is possible to have articles with colons in
  26. their names. "E. Coli 0157:H7" is a valid title, as is "2001: A Space Odyssey",
  27. because "E. Coli 0157" and "2001" are not valid interwikis or namespaces.
  28. It is not possible to have an article whose bare name includes a namespace or
  29. interwiki prefix.
  30. An initial colon in a title listed in wiki text may however suppress special
  31. handling for interlanguage links, image links, and category links. It is also
  32. used to indicate the main namespace in template inclusions.
  33. Once prefixes have been stripped, the rest of the title processed this way:
  34. * Spaces and underscores are treated as equivalent and each is converted to the
  35. other in the appropriate context (underscore in URL and database keys, spaces
  36. in plain text).
  37. * Multiple consecutive spaces are converted to a single space.
  38. * Leading or trailing space is removed.
  39. * If $wgCapitalLinks is enabled (the default), the first letter is capitalised,
  40. using the capitalisation function of the content language object.
  41. * The unicode characters LRM (U+200E) and RLM (U+200F) are silently stripped.
  42. * Invalid UTF-8 sequences or instances of the replacement character (U+FFFD) are
  43. considered illegal.
  44. * A percent sign followed by two hexadecimal characters is illegal
  45. * Anything that looks like an XML/HTML character reference is illegal
  46. * Any character not matched by the $wgLegalTitleChars regex is illegal
  47. * Zero-length titles (after whitespace stripping) are illegal
  48. All titles except special pages must be less than 255 bytes when encoded with
  49. UTF-8, because that is the size of the database field. Special page titles may
  50. be up to 512 bytes.
  51. Note that Unicode Normal Form C (NFC) is enforced by MediaWiki's user interface
  52. input functions, and so titles will typically be in this form.
  53. getArticleID() needs some explanation: for "internal" articles, it should return
  54. the "page_id" field if the article exists, else it returns 0. For all external
  55. articles it returns 0. All of the IDs for all instances of Title created during
  56. a request are cached, so they can be looked up quickly while rendering wiki text
  57. with lots of internal links. See linkcache.txt.