<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ourlil.com &#187; utf8</title>
	<atom:link href="http://OurLil.com/blog/tag/utf8/feed/" rel="self" type="application/rss+xml" />
	<link>http://OurLil.com</link>
	<description>A web site for tutti noi (all of us!)</description>
	<lastBuildDate>Mon, 28 Nov 2011 15:23:22 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Unicode and Character Sets plus MySQL Latin1 to UTF-8 Conversion</title>
		<link>http://OurLil.com/blog/unicode-and-character-sets-plus-mysql-latin1-to-utf8-conversion/456/</link>
		<comments>http://OurLil.com/blog/unicode-and-character-sets-plus-mysql-latin1-to-utf8-conversion/456/#comments</comments>
		<pubDate>Tue, 03 Feb 2009 17:42:18 +0000</pubDate>
		<dc:creator>webmaestro</dc:creator>
				<category><![CDATA[coding]]></category>
		<category><![CDATA[ANSI]]></category>
		<category><![CDATA[ASCII]]></category>
		<category><![CDATA[character set]]></category>
		<category><![CDATA[encoding]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[PHP]]></category>
		<category><![CDATA[UTF-8]]></category>
		<category><![CDATA[utf8]]></category>

		<guid isPermaLink="false">http://clay.freedomblogging.com/2009/02/03/the-absolute-minimum-every-software-developer-absolutely-positively-must-know-about-unicode-and-character-sets-no-excuses-joel-on-software/456/</guid>
		<description><![CDATA[I knew most of this&#8230; but alas, not all of it&#8230; BTW, here&#8217;s a relevant ThinkGeek.com present a friend gave me: I found this interesting article on How To Change An Early WPMU Database from latin1 to utf8 Encoding, which has a bunch of useful links related to character encoding problems, WordPress (WPMU), and MySQL [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright" src="http://www.thinkgeek.com/images/products/zoom/dad.jpg" alt="Binary DAD" width="121" height="145" /></p>
<p>I knew most of this&#8230; but alas, not all of it&#8230; BTW, here&#8217;s a relevant <a title="Where a geek can be a geek..." href="http://ThinkGeek.com" target="_blank">ThinkGeek.com</a> present a friend gave me:</p>
<p>I found this interesting article on <a href="http://edubuzz.org/blogs/david/wordpress/changing-early-wpmu-db-to-utf8/">How To Change An Early WPMU Database from latin1 to utf8 Encoding</a>, which has a bunch of useful links related to character encoding problems, WordPress (WPMU), and MySQL &amp; PHP.</p>
<p>From the article in question:</p>
<blockquote><p>So I have an announcement to make: if you are a programmer working in 2003 and you don&#8217;t know the basics of characters, character sets, encodings, and Unicode, and I <em>catch</em> you, I&#8217;m going to punish you by making you peel onions for 6 months in a submarine. I swear I will.</p>
<p>And one more thing:<br />
<a href="http://www.joelonsoftware.com/pictures/unicode/ascii.png"><br />
</a></p>
<p align="center"><strong>IT&#8217;S NOT THAT HARD.</strong></p>
<p align="left"><a href="http://www.joelonsoftware.com/pictures/unicode/ascii.png"><img class="alignright" src="http://www.joelonsoftware.com/pictures/unicode/ascii.png" alt="Binary DAD" width="132" height="83" /></a>In this article I&#8217;ll fill you in on exactly what <em>every working programmer</em> should know. All that stuff about &#8220;plain text = ascii = characters are 8 bits&#8221; is not only wrong, it&#8217;s hopelessly wrong, and if you&#8217;re still programming that way, you&#8217;re not much better than a medical doctor who doesn&#8217;t believe in germs. Please do not write another line of code until you finish reading this article.</p>
</blockquote>
<p align="left">And then there&#8217;s this juicy tidbit:</p>
<blockquote>
<p align="center"><img src="http://www.joelonsoftware.com/pictures/unicode/hummers.jpg" border="0" alt="" width="390" height="61" /></p>
<p>For a while it seemed like that might be good enough, but programmers were complaining. &#8220;Look at all those zeros!&#8221; they said, since they were Americans and they were looking at English text which rarely used code points above U+00FF. Also they were liberal hippies in California who wanted to <em>conserve (sneer)</em>. If they were Texans they wouldn&#8217;t have minded guzzling twice the number of bytes. But those Californian wimps couldn&#8217;t bear the idea of <em>doubling</em> the amount of storage it took for strings, and anyway, there were already all these doggone documents out there using various ANSI and DBCS character sets and who&#8217;s going to convert them all? <em>Moi?</em> For this reason alone most people decided to ignore Unicode for several years and in the meantime things got worse.</p></blockquote>
<p><a href="http://www.joelonsoftware.com/articles/Unicode.html">The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) &#8211; Joel on Software</a></p>
<p>And some more:</p>
<p><a href="http://www.oreillynet.com/onlamp/blog/2006/01/turning_mysql_data_in_latin1_t.html" target="_blank">Turning MySQL data in latin1 to utf-8 utf8</a></p>
<blockquote><p>Iâ€™ve just finished one of the most difficult and tedious problems Iâ€™ve ever solved, so I have to share the solution here in a little tutorial of how I fixed this, even though Iâ€™m sure there are better ways, this is what worked for me.</p>
<p>THE PROBLEM &#8211; PART 1:<br />
My old CD Baby MySQL database from 1998 was filled with foreign characters and was in MySQLâ€™s default (<strong>latin1</strong>) encoding.<br />
For years, customers and clients had been using our web interface to give us their names, addresses, song titles, bio, and many things in all kinds of alphabets.<br />
I wanted everything to be in <strong>UTF-8</strong>.  (The database, the website, the MySQL client, everything.)</p>
<p>QUICK DEFINITION : &#8220;FOREIGN CHARACTERS&#8221;<br />
When I say &#8220;foreign characters&#8221; I mean not just Greek, Icelandic, Japanese, Chinese, Korean, and others shown at <a href="http://www.omniglot.com/writing/alphabets.htm">Omniglot</a>, but also the curly-quotes, ellipsis, em-dash, and things described at <a href="http://www.alistapart.com/articles/emen/">alistapart</a>.</p></blockquote>
<p>And from <a href="http://alexking.org" target="_blank">AlexKing.org</a> comes <a title="Permanent Link to Fixing a MySQL Character Encoding Mismatch" rel="bookmark" href="http://alexking.org/blog/2008/03/06/mysql-latin1-utf8-conversion">Fixing a MySQL Character Encoding Mismatch</a></p>
<blockquote><p>We ran into an interesting MySQL character encoding issue at <a href="http://crowdfavorite.com/">Crowd Favorite</a> today while working to upgrade and launch a new client site.</p>
<p>Here is what we were trying to do: copy the production database to the staging database so we could properly configure and test everything before pushing the new site live. Pretty simple right? It was, until we noticed a bunch of weird character encoding issues on the staging site.</p>
<p class="center"><a title="Character Encoding Issue by alexkingorg, on Flickr" href="http://www.flickr.com/photos/alexkingorg/2312995897/"><img src="http://farm3.static.flickr.com/2012/2312995897_db067fa495_m.jpg" alt="Character Encoding Issue" width="240" height="50" /></a></p>
<p>It turned out that while the database tables were set to a Latin-1 (latin1), the content that populated those tables was encoded as UTF-8 (utf8).</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://OurLil.com/blog/unicode-and-character-sets-plus-mysql-latin1-to-utf8-conversion/456/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

