Unicode string library for Erlang
At a glance
- An OTP application, with a C pipe driver under the hood.
- Based on ICU, the best implementation of Unicode.
- Strings stored as binaries ⇒ big memory savings.
- Backwards-compatible (mostly) with the standard string module.
- Distributed under the new BSD license.
Documentation
Also, take a look at demo/demo1.erl and demo/demo2.erl for simple examples.
Development
- Google Code project: http://code.google.com/p/starling/
- Mailing list: http://groups.google.com/group/starling-discuss/topics
- Source: svn co http://starling.googlecode.com/svn/trunk/ starling
Please send patches to the mailing list or to hasan@12monkeys.co.uk
TODOs
- More string processing functions.
- Lift the length limit of 215 code units.
- Longer-term: Apart from Unicode string handling, ICU also provides APIs to work with Unicode regexps, localization, and locale-specific tasks like number and date formatting. Starling will wrap those APIs too eventually.
Q & A
Unicode Explained, a great book on Unicode, has an image of a long-tailed glossy starling on its cover.
I am aware of the Starling by Twitter, and I don't care.
