The server codebase needs to support some of the standard unicode algorithms:

  • Normalization (at least NFD and NKFD)
  • Collation (at least a 'generic' version, and preferably locale-selectable)
  • Case mapping (ditto)
  • Segmentation
  • Line breaking? (UAX #14)