Unicode madness
Taking into account the unexpected temporal shift in the early hours of the morning (the clocks going back), I woke up in the afternoon today.
I was looking at Unicode.org and I discovered how to achieve matched “double” and ‘single quotes’.
Buncey will be happy.
Here's a cute little asterism for Buncey: ⁂
There is a set of code charts, including one for General Punctuation (PDF). Take the hexadecimal number under the character you would like and insert it between
For example,
I advised Buncey to use ` (backquote) and ' (single quote) for opening and closing quotes, respectively. This annoyed Buncey as the ` (backquote) is an opening quote, yet the ' (single quote) is not a closing quote - simply a single quote. I found the solution - so he'll be happy. I believe the complain originated with his dislike of Microsoft Word's problematic `smart-quotes'.
For those who aren't sure what Microsoft's smart quotes are:
For some unknown reason (well, one can presume it was part of its Embrace and extend strategy), Microsoft decided to use its own set of opening and closing quotes (i.e. real quotes as opposed to ` (backquote) and ' (single quote)). The problem was that there were already quote symbols in Unicode, which is a character set. As a result most browsers and operating systems, other than Microsoft's, display Microsoft's smart quotes very poorly (see example from PHP Princess - see related links).
Related Links:
This is probably a crappy article as I moved all over the place while writing. And now that I discover why I should use deciamal notation for Unicode (see Curling Quotes in Related Links) after I've gone on to explain how to do it in hexadecimal. Bah! I'm not editing this anymore as I just want to write down the usage of the quotes and the dashes before I go to bed. Web standards and related issues is a very interesting field. But doing big edits to already written blog posts isn't my thing.
I'll just mention:
“ doubles ” (
‘ singles ’ (
I was looking at Unicode.org and I discovered how to achieve matched “double” and ‘single quotes’.
Buncey will be happy.
Here's a cute little asterism for Buncey: ⁂
There is a set of code charts, including one for General Punctuation (PDF). Take the hexadecimal number under the character you would like and insert it between
&#x
(the `#' (hash or `pound sign') denotes a numerical rather than a worded escape code, and the `x' denotes hexadecimal as opposed to decimal notaion) and ;
.
For example,
†
gives the exotic † (dagger). However, †
also produces a †.
I advised Buncey to use ` (backquote) and ' (single quote) for opening and closing quotes, respectively. This annoyed Buncey as the ` (backquote) is an opening quote, yet the ' (single quote) is not a closing quote - simply a single quote. I found the solution - so he'll be happy. I believe the complain originated with his dislike of Microsoft Word's problematic `smart-quotes'.
For those who aren't sure what Microsoft's smart quotes are:
For some unknown reason (well, one can presume it was part of its Embrace and extend strategy), Microsoft decided to use its own set of opening and closing quotes (i.e. real quotes as opposed to ` (backquote) and ' (single quote)). The problem was that there were already quote symbols in Unicode, which is a character set. As a result most browsers and operating systems, other than Microsoft's, display Microsoft's smart quotes very poorly (see example from PHP Princess - see related links).
Related Links:
- Demoroniser manual page (a Perl application to remove Microsoft's smart quotes)
- A short discussion on writing a PHP application to remove smart quotes
- Fix Microsoft Smart Quotes (I couldn't immediately find a page to test it on)
- Curling Quotes in HTML, SGML, and XML (interesting discussion on best practices on representing quotes in various documents) - read this for interesting bits I missed out
- The Trouble With EM n EN (and Other Shady Characters) (excellent article regarding the correct use of em and en dashes and other things - however, it fails to take into account the fact that not everybody uses a standard US keyboard: ``grave accents-that character on the tilde key'')
This is probably a crappy article as I moved all over the place while writing. And now that I discover why I should use deciamal notation for Unicode (see Curling Quotes in Related Links) after I've gone on to explain how to do it in hexadecimal. Bah! I'm not editing this anymore as I just want to write down the usage of the quotes and the dashes before I go to bed. Web standards and related issues is a very interesting field. But doing big edits to already written blog posts isn't my thing.
I'll just mention:
“ doubles ” (
“
and &8221;
)
‘ singles ’ (
‘
and &8217;
)
0 Comments:
Post a Comment
<< Home