Skip to main content

Home/ Coders/ Group items tagged unicode

Rss Feed Group items tagged

Fabien Cadet

utf 8 nbsp - RE: nbsp is not that hard, folks ; reply by: Américo Albuquerque... - 0 views

  • " " " " and "\u00A0" have nothing, NOTHING to do with UTF-8.
  • There is a character -- an abstract unit in a "script" (a writing system; we are using Latin right now) -- called NO-BREAK SPACE by the Unicode Standard and ISO/IEC 10646. Unicode and ISO/IEC 10646 assign this character an integer number, 160, which is A0 in hex.
  • UTF-8 is an encoding scheme that provides a way of representing any of the approximately 1.1 million possible abstract characters in Unicode as a sequence of 1 to 4 bytes.
  • ...8 more annotations...
  • The UTF-8 representation of the Unicode character 160 (no-break space), is the pair of bytes C2 A0, in that order.
  • This thing: \u00A0
  • the no-break space character
  • This thing:   or this thing:  
  • is to SGML applications like HTML and XML what \u00A0 is to Java & Python;
  • is called a character reference (or "numeric character reference").
  • This thing:  
  • is to SGML applications like HTML and XML an "entity reference";
  •  
    « [...] " " " " and "\u00A0" have nothing, NOTHING to do with UTF-8 [...] Unicode and ISO/IEC 10646 assign this character an integer number, 160, which is A0 in hex [...] UTF-8 is an encoding scheme [...] The UTF-8 representation of the Unicode character 160 (no-break space), is the pair of bytes C2 A0. »
Joel Bennett

OMG Ponies!!! (Aka Humanity: Epic Fail) - Jon Skeet: Coding Blog - 7 views

  •  
    How the simple things (like time zones, daylight savings, culture and unicode) manage to make the job of a software developer painful.
1 - 3 of 3
Showing 20 items per page