<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: A Unicode editor for Michael</title>
	<atom:link href="http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/feed/" rel="self" type="application/rss+xml" />
	<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/</link>
	<description>... but that doesn&#039;t matter, because I turn it into a sexy dance</description>
	<lastBuildDate>Tue, 25 Oct 2011 15:04:31 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Michael Norrish</title>
		<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/comment-page-1/#comment-62</link>
		<dc:creator>Michael Norrish</dc:creator>
		<pubDate>Thu, 04 Nov 2004 23:28:57 +0000</pubDate>
		<guid isPermaLink="false">http://ken.friislarsen.net/blog/?p=20#comment-62</guid>
		<description>Well, I want to write my programs using the String signature, so that I can ignore the encoding
issues entirely.  Whether or not the underlying representation uses a vector of 32 bit words is
irrelevant.  The question is what the return type of
   String.sub
should be.  The WideString signature says it has to be the WideChar type.  The demands of Unicode
basically require this to be a type capable of representing 2**32 values. (Or maybe it&#039;s 2**20,
but whatever :-)  So, WideChar needs to be four bytes.  If you use a nice underlying UTF-8
representation underneath the String signature, that&#039;s good implementation, but I don&#039;t want
to see that as a user of the API.</description>
		<content:encoded><![CDATA[<p>Well, I want to write my programs using the String signature, so that I can ignore the encoding<br />
issues entirely.  Whether or not the underlying representation uses a vector of 32 bit words is<br />
irrelevant.  The question is what the return type of<br />
   String.sub<br />
should be.  The WideString signature says it has to be the WideChar type.  The demands of Unicode<br />
basically require this to be a type capable of representing 2**32 values. (Or maybe it&#8217;s 2**20,<br />
but whatever <img src='http://ken.friislarsen.net/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />   So, WideChar needs to be four bytes.  If you use a nice underlying UTF-8<br />
representation underneath the String signature, that&#8217;s good implementation, but I don&#8217;t want<br />
to see that as a user of the API.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ken</title>
		<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/comment-page-1/#comment-61</link>
		<dc:creator>Ken</dc:creator>
		<pubDate>Thu, 04 Nov 2004 20:50:20 +0000</pubDate>
		<guid isPermaLink="false">http://ken.friislarsen.net/blog/?p=20#comment-61</guid>
		<description>Regarding non-ASCII input, I&#039;d use &lt;a href=&quot;http://gucharmap.sourceforge.net&quot;&gt;gucharmap&lt;/a&gt; (unless it is &lt;code&gt;latin1&lt;/code&gt;-letters, in which case I&#039;d just use the keybord) or the &quot;Character Palette&quot; applet (assuming Gnome).  But I know that there are other &lt;a href=&quot;http://www.jw-stumpel.nl/stestu.html&quot;&gt;input methods&lt;/a&gt; available.

As for programming in SML.  I don&#039;t know what the &quot;right&quot; way to handle Unicode is. &lt;code&gt;WideChar&lt;/code&gt; seem like an adobtion of C&#039;s &lt;code&gt;wchar_t&lt;/code&gt; (BTW where did you get the &#039;4 bytes&#039; from? I don&#039;t that WideChar specifies how many bytes should be used) which is inadequate IMNSHO for handling Unicode, because you really need to deal with the encoding directly.  And for a great many purposes will a 4 byte encoding be terrible wastefull. Thanksfully, SML&#039;s normal (and required) &lt;code&gt;CharVector&lt;/code&gt;s (i.e., &lt;code&gt;String&lt;/code&gt;) would be perfect for representing UTF-8 strings.  Thus, you could implement a nice library in pure SML.</description>
		<content:encoded><![CDATA[<p>Regarding non-ASCII input, I&#8217;d use <a href="http://gucharmap.sourceforge.net">gucharmap</a> (unless it is <code>latin1</code>-letters, in which case I&#8217;d just use the keybord) or the &#8220;Character Palette&#8221; applet (assuming Gnome).  But I know that there are other <a href="http://www.jw-stumpel.nl/stestu.html">input methods</a> available.</p>
<p>As for programming in SML.  I don&#8217;t know what the &#8220;right&#8221; way to handle Unicode is. <code>WideChar</code> seem like an adobtion of C&#8217;s <code>wchar_t</code> (BTW where did you get the &#8217;4 bytes&#8217; from? I don&#8217;t that WideChar specifies how many bytes should be used) which is inadequate IMNSHO for handling Unicode, because you really need to deal with the encoding directly.  And for a great many purposes will a 4 byte encoding be terrible wastefull. Thanksfully, SML&#8217;s normal (and required) <code>CharVector</code>s (i.e., <code>String</code>) would be perfect for representing UTF-8 strings.  Thus, you could implement a nice library in pure SML.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Norrish</title>
		<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/comment-page-1/#comment-58</link>
		<dc:creator>Michael Norrish</dc:creator>
		<pubDate>Wed, 03 Nov 2004 23:50:11 +0000</pubDate>
		<guid isPermaLink="false">http://ken.friislarsen.net/blog/?p=20#comment-58</guid>
		<description>I didn&#039;t know about gedit.  Thanks.  I&#039;m a bit confused about how I&#039;m supposed to input non-ASCII
characters into it though.  Perhaps what I should have said in my &#039;log entry is that &quot;Emacs doesn&#039;t
support Unicode properly&quot;, and this makes life painful.  Apparently Emacs-22 will be better.

As for programming in SML, presumably the &quot;right&quot; way to do this is to use a WideChar type, and
to hope that the 4 byte characters in memory get written out to disk in nice compact UTF-8 form.
(Or is there some other model implicit in the revised Basis?)</description>
		<content:encoded><![CDATA[<p>I didn&#8217;t know about gedit.  Thanks.  I&#8217;m a bit confused about how I&#8217;m supposed to input non-ASCII<br />
characters into it though.  Perhaps what I should have said in my &#8216;log entry is that &#8220;Emacs doesn&#8217;t<br />
support Unicode properly&#8221;, and this makes life painful.  Apparently Emacs-22 will be better.</p>
<p>As for programming in SML, presumably the &#8220;right&#8221; way to do this is to use a WideChar type, and<br />
to hope that the 4 byte characters in memory get written out to disk in nice compact UTF-8 form.<br />
(Or is there some other model implicit in the revised Basis?)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ken</title>
		<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/comment-page-1/#comment-60</link>
		<dc:creator>Ken</dc:creator>
		<pubDate>Wed, 03 Nov 2004 07:23:45 +0000</pubDate>
		<guid isPermaLink="false">http://ken.friislarsen.net/blog/?p=20#comment-60</guid>
		<description>If you just want to play with it, I think you should be able to just check out mgtk from the &lt;a href=&quot;http://sourceforge.net/cvs/?group_id=23226&quot;&gt;SF CVS&lt;/a&gt;.  Then, go to &lt;code&gt;mgtk/src/defs2sml/release/mgtk&lt;/code&gt; and do a &lt;code&gt;make&lt;/code&gt;, after that go back to &lt;code&gt;mgtk/examples&lt;/code&gt; and do a &lt;code&gt;make editor&lt;/code&gt;.  Some murking with makefiles might be needed.

However, there will be a official release shortly.  Meanwhile just use &lt;a href=&quot;http://gedit.sf.net&quot;&gt;gedit&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>If you just want to play with it, I think you should be able to just check out mgtk from the <a href="http://sourceforge.net/cvs/?group_id=23226">SF CVS</a>.  Then, go to <code>mgtk/src/defs2sml/release/mgtk</code> and do a <code>make</code>, after that go back to <code>mgtk/examples</code> and do a <code>make editor</code>.  Some murking with makefiles might be needed.</p>
<p>However, there will be a official release shortly.  Meanwhile just use <a href="http://gedit.sf.net">gedit</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Norrish</title>
		<link>http://ken.friislarsen.net/blog/2004/11/02/a-unicode-editor-for-michael/comment-page-1/#comment-59</link>
		<dc:creator>Michael Norrish</dc:creator>
		<pubDate>Tue, 02 Nov 2004 22:31:58 +0000</pubDate>
		<guid isPermaLink="false">http://ken.friislarsen.net/blog/?p=20#comment-59</guid>
		<description>Cool!  What do I do to download it and the library?  Or should I wait for the official mGTK release?</description>
		<content:encoded><![CDATA[<p>Cool!  What do I do to download it and the library?  Or should I wait for the official mGTK release?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

