<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Switching to UTF-8</title>
	<atom:link href="http://blog.punbb.org/2007/02/12/switching-to-utf-8/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/</link>
	<description>Straight from the horse's mouth</description>
	<pubDate>Thu, 28 Aug 2008 16:55:11 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6-bleeding</generator>
		<item>
		<title>By: Rickard</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-25476</link>
		<dc:creator>Rickard</dc:creator>
		<pubDate>Wed, 05 Mar 2008 12:34:31 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-25476</guid>
		<description>Reality: Please inform us how we can commit more to UTF-8 and internationalization. I'm dying to know.</description>
		<content:encoded><![CDATA[<p>Reality: Please inform us how we can commit more to UTF-8 and internationalization. I&#8217;m dying to know.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Reality</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-25447</link>
		<dc:creator>Reality</dc:creator>
		<pubDate>Wed, 05 Mar 2008 03:53:42 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-25447</guid>
		<description>The truth is, you aren't as committed to UTF-8 and internationalization like you make it seem in this blog post. Especially you Jérémie.</description>
		<content:encoded><![CDATA[<p>The truth is, you aren&#8217;t as committed to UTF-8 and internationalization like you make it seem in this blog post. Especially you Jérémie.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex&#8217;s Blog &#187; Encoding! eek</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-6959</link>
		<dc:creator>Alex&#8217;s Blog &#187; Encoding! eek</dc:creator>
		<pubDate>Thu, 06 Sep 2007 03:09:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-6959</guid>
		<description>[...] Switching to UTF-8 [...]</description>
		<content:encoded><![CDATA[<p>[...] Switching to UTF-8 [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Asbjørn Ulsberg</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-75</link>
		<dc:creator>Asbjørn Ulsberg</dc:creator>
		<pubDate>Fri, 23 Feb 2007 07:45:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-75</guid>
		<description>I've heard good things about it. Worth a shot, perhaps?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve heard good things about it. Worth a shot, perhaps?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rickard</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-62</link>
		<dc:creator>Rickard</dc:creator>
		<pubDate>Thu, 22 Feb 2007 10:47:24 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-62</guid>
		<description>Yeah, maybe. I have never heard of this particular library though. Not sure how well it works.</description>
		<content:encoded><![CDATA[<p>Yeah, maybe. I have never heard of this particular library though. Not sure how well it works.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Asbjørn Ulsberg</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-58</link>
		<dc:creator>Asbjørn Ulsberg</dc:creator>
		<pubDate>Thu, 22 Feb 2007 10:10:22 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-58</guid>
		<description>Another option is to use the following pure PHP conversion library if iconv isn't available:

http://mikolajj.republika.pl/</description>
		<content:encoded><![CDATA[<p>Another option is to use the following pure PHP conversion library if iconv isn&#8217;t available:</p>
<p><a href="http://mikolajj.republika.pl/" rel="nofollow">http://mikolajj.republika.pl/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rickard</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-55</link>
		<dc:creator>Rickard</dc:creator>
		<pubDate>Thu, 22 Feb 2007 08:31:38 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-55</guid>
		<description>I'm not sure that's such a good idea. If anything, we'll link to iconv binaries for various platforms with very clear instructions on how to use it.</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure that&#8217;s such a good idea. If anything, we&#8217;ll link to iconv binaries for various platforms with very clear instructions on how to use it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Asbjørn Ulsberg</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-49</link>
		<dc:creator>Asbjørn Ulsberg</dc:creator>
		<pubDate>Wed, 21 Feb 2007 23:20:28 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-49</guid>
		<description>How about providing the iconv conversion through punbb.org somehow? You could even do it automatically and remotely by calling a PHP script on punbb.org from the db_update script. If you do it in a similar way to how the conversion script works, by splitting up the whole task in several smaller tasks, you don't have to think about timeouts either. To protect against DoS attacks on this punbb.org-hosted iconv script, you should probably do some kind of authentication plus a implement a repeat-attack protection. Nothing fancy, but it's better to be on the safe side. :)</description>
		<content:encoded><![CDATA[<p>How about providing the iconv conversion through punbb.org somehow? You could even do it automatically and remotely by calling a PHP script on punbb.org from the db_update script. If you do it in a similar way to how the conversion script works, by splitting up the whole task in several smaller tasks, you don&#8217;t have to think about timeouts either. To protect against DoS attacks on this punbb.org-hosted iconv script, you should probably do some kind of authentication plus a implement a repeat-attack protection. Nothing fancy, but it&#8217;s better to be on the safe side. :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rickard</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-36</link>
		<dc:creator>Rickard</dc:creator>
		<pubDate>Wed, 21 Feb 2007 12:01:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-36</guid>
		<description>Jérémie: You're probably right regarding usernames. There will be a hook somewhere in the registering process and we'll let the community come up with a plan for dealing with that.

Asbjørn: I'm currently working on an extension of the db_update script that will process the database and do the conversion. It will work if the current encoding is iso-8859-1 or if the PHP extension iconv() is available (PHP5 and later) or if the PHP extension mb_string is available. If neither of these are true, we'll instruct the user to dump the database and run iconv on it manually. The conversion script will work like the search re-indexing script (e.g. process X amounts of entries, redirect, process X more etc).</description>
		<content:encoded><![CDATA[<p>Jérémie: You&#8217;re probably right regarding usernames. There will be a hook somewhere in the registering process and we&#8217;ll let the community come up with a plan for dealing with that.</p>
<p>Asbjørn: I&#8217;m currently working on an extension of the db_update script that will process the database and do the conversion. It will work if the current encoding is iso-8859-1 or if the PHP extension iconv() is available (PHP5 and later) or if the PHP extension mb_string is available. If neither of these are true, we&#8217;ll instruct the user to dump the database and run iconv on it manually. The conversion script will work like the search re-indexing script (e.g. process X amounts of entries, redirect, process X more etc).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Asbjørn Ulsberg</title>
		<link>http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-33</link>
		<dc:creator>Asbjørn Ulsberg</dc:creator>
		<pubDate>Wed, 21 Feb 2007 11:19:33 +0000</pubDate>
		<guid isPermaLink="false">http://blog.punbb.org/2007/02/12/switching-to-utf-8/#comment-33</guid>
		<description>Excellent! I find it rather rediculous that PHP and MySQL have taken so extremely long time getting on the Unicode bandwagon. It was inevitable, but still they hesitated for years and years and even more years. And now when the need for UTF-8 is so screaming and gleaming it almost hurts, the problem with migrating to it lies in all developers using PHP and MySQL, like you PunBB developers. I don't envy you! But I think you're doing what's right and a terrific job at it too.

I don't have any genious idea on how you should handle this, but what you could do is create the SQL conversion script from the database (should go pretty quick) so that the user can execute this script in his MySQL admin interface (phpMyAdmin, command line or whatever). Executing this script might take a while, but at least you'd leave the option to the user on how he wants to execute it. You may of course offer to execute it for him, but you should probably warn that it is going to take a while. Remember to set a long timeout on the PHP script that's going to do this! :-)

The actual conversion process can be done by exporting all of the data to an SQL script, dropping all of the database tables, running iconv on the export to get it all to UTF-8, then importing the converted file to create all tables and columns (in UTF-8 if possible) and inserting all the data.</description>
		<content:encoded><![CDATA[<p>Excellent! I find it rather rediculous that PHP and MySQL have taken so extremely long time getting on the Unicode bandwagon. It was inevitable, but still they hesitated for years and years and even more years. And now when the need for UTF-8 is so screaming and gleaming it almost hurts, the problem with migrating to it lies in all developers using PHP and MySQL, like you PunBB developers. I don&#8217;t envy you! But I think you&#8217;re doing what&#8217;s right and a terrific job at it too.</p>
<p>I don&#8217;t have any genious idea on how you should handle this, but what you could do is create the SQL conversion script from the database (should go pretty quick) so that the user can execute this script in his MySQL admin interface (phpMyAdmin, command line or whatever). Executing this script might take a while, but at least you&#8217;d leave the option to the user on how he wants to execute it. You may of course offer to execute it for him, but you should probably warn that it is going to take a while. Remember to set a long timeout on the PHP script that&#8217;s going to do this! :-)</p>
<p>The actual conversion process can be done by exporting all of the data to an SQL script, dropping all of the database tables, running iconv on the export to get it all to UTF-8, then importing the converted file to create all tables and columns (in UTF-8 if possible) and inserting all the data.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
