[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: ANN: Gorille 0.3
John Cowan 'scripted it' > > At least in UTF-8 you can just count bytes <0x80 to count characters. > Make that 0xC0. No, it's easy, but not quite that easy. Unicode code-points up to U+007F are represented as 8-bit bytes with the same value, so counting bytes <0x80 gives you the number of US-ASCII characters. Characters above U+007F are represented with multiple bytes, the first >= 0xC0, the trailing bytes are all >= 0x80. So to count characters, one way is to count all bytes less <= 0x7F or >= 0xC0. Regards ~Rob -- Rob Lugt ElCel Technology http://www.elcel.com
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|