<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: some things</title>
	<atom:link href="http://blogs.gnome.org/tthurman/2008/12/03/some-things/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/</link>
	<description>Thomas Thurman does not like cold meals because of broken applications.</description>
	<lastBuildDate>Fri, 30 Jul 2010 12:08:40 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: michael_nz</title>
		<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/comment-page-1/#comment-394</link>
		<dc:creator>michael_nz</dc:creator>
		<pubDate>Wed, 10 Dec 2008 01:49:57 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/tthurman/?p=172#comment-394</guid>
		<description>Well, I&#039;ve been working on two scripts:

1. A horrible mess of sed and grep, etc., which finds the most common words in an HTML document that are not yet in the pronunciation file.  I go through the results to add more words, and plan to rewrite the script in Python.

2. A short Python program that parses an HTML document with Beautiful Soup, then runs through the text substituting words, ignoring words in capitals (they&#039;re usually acronyms) and optionally italicised ones.  It adds naming dots to capitalised words that don&#039;t begin a paragraph or follow one of &quot;?&quot;, &quot;!&quot;, &quot;.&quot;.

The naming dot thing is a hack, certainly, but it does make things easier to read.  I think leaving heteronyms in Roman, while a little ugly, is also essential for readability.  E.g. translating back from your Firefox extension&#039;s version of the Shavian Wikipedia page gives &quot;kingsley red&quot; for &quot;Kingsley Read&quot;.  I&#039;ve found this sort of thing out along the way, as I&#039;ve now read two and a half books through my system, starting with just a few words in the database and building up to the point where I now get many entire paragraphs in Shavian.

Thanks for starting this whole project and catching my attention.  I didn&#039;t know about Shavian before I saw your first Planet GNOME post, and believe it or not it makes a useful test case for a commercial project I&#039;m working on.</description>
		<content:encoded><![CDATA[<p>Well, I&#8217;ve been working on two scripts:</p>
<p>1. A horrible mess of sed and grep, etc., which finds the most common words in an HTML document that are not yet in the pronunciation file.  I go through the results to add more words, and plan to rewrite the script in Python.</p>
<p>2. A short Python program that parses an HTML document with Beautiful Soup, then runs through the text substituting words, ignoring words in capitals (they&#8217;re usually acronyms) and optionally italicised ones.  It adds naming dots to capitalised words that don&#8217;t begin a paragraph or follow one of &#8220;?&#8221;, &#8220;!&#8221;, &#8220;.&#8221;.</p>
<p>The naming dot thing is a hack, certainly, but it does make things easier to read.  I think leaving heteronyms in Roman, while a little ugly, is also essential for readability.  E.g. translating back from your Firefox extension&#8217;s version of the Shavian Wikipedia page gives &#8220;kingsley red&#8221; for &#8220;Kingsley Read&#8221;.  I&#8217;ve found this sort of thing out along the way, as I&#8217;ve now read two and a half books through my system, starting with just a few words in the database and building up to the point where I now get many entire paragraphs in Shavian.</p>
<p>Thanks for starting this whole project and catching my attention.  I didn&#8217;t know about Shavian before I saw your first Planet GNOME post, and believe it or not it makes a useful test case for a commercial project I&#8217;m working on.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Thurman</title>
		<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/comment-page-1/#comment-391</link>
		<dc:creator>Thomas Thurman</dc:creator>
		<pubDate>Mon, 08 Dec 2008 00:22:02 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/tthurman/?p=172#comment-391</guid>
		<description>Yes-- I just put it up here:
http://www.gnome.org/~tthurman/shavian-mangling.txt

I don&#039;t know why WordPress has such a stupid bug.

Thanks for the email: I&#039;d be interested to see what you&#039;re working on.  Some thoughts:

1) Spelling of &quot;Shavian&quot;: yes, you&#039;re right, sorry
2) Naming dot: Not sure how to implement this from the Latin alphabet; we could just add it for any capitalised word not at the beginning of a sentence, but it&#039;s a bit of a hack.
3) cmudict uses a strange Pittsburgh-based accent.
4) I have no good solution for homophones :(</description>
		<content:encoded><![CDATA[<p>Yes&#8211; I just put it up here:<br />
<a href="http://www.gnome.org/~tthurman/shavian-mangling.txt" rel="nofollow">http://www.gnome.org/~tthurman/shavian-mangling.txt</a></p>
<p>I don&#8217;t know why WordPress has such a stupid bug.</p>
<p>Thanks for the email: I&#8217;d be interested to see what you&#8217;re working on.  Some thoughts:</p>
<p>1) Spelling of &#8220;Shavian&#8221;: yes, you&#8217;re right, sorry<br />
2) Naming dot: Not sure how to implement this from the Latin alphabet; we could just add it for any capitalised word not at the beginning of a sentence, but it&#8217;s a bit of a hack.<br />
3) cmudict uses a strange Pittsburgh-based accent.<br />
4) I have no good solution for homophones :(</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: michael_nz</title>
		<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/comment-page-1/#comment-390</link>
		<dc:creator>michael_nz</dc:creator>
		<pubDate>Mon, 08 Dec 2008 00:13:08 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/tthurman/?p=172#comment-390</guid>
		<description>Hm.  That got cut off at the first Shavian character.  Did you get my emailed version?</description>
		<content:encoded><![CDATA[<p>Hm.  That got cut off at the first Shavian character.  Did you get my emailed version?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: michael_nz</title>
		<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/comment-page-1/#comment-386</link>
		<dc:creator>michael_nz</dc:creator>
		<pubDate>Thu, 04 Dec 2008 00:53:54 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/tthurman/?p=172#comment-386</guid>
		<description>Inspired by your GNOME Shavian post I&#039;ve learned to read Shavian.  I&#039;ve been developing some scripts and building up a pronunciation database in my accent.

I just thought I&#039;d point out a few bugs in your system (I hope the characters come through here...).
- You&#039;ve got &quot;Shavian&quot; spelled wrong!  It should be &quot;·</description>
		<content:encoded><![CDATA[<p>Inspired by your GNOME Shavian post I&#8217;ve learned to read Shavian.  I&#8217;ve been developing some scripts and building up a pronunciation database in my accent.</p>
<p>I just thought I&#8217;d point out a few bugs in your system (I hope the characters come through here&#8230;).<br />
- You&#8217;ve got &#8220;Shavian&#8221; spelled wrong!  It should be &#8220;·</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Trochee</title>
		<link>http://blogs.gnome.org/tthurman/2008/12/03/some-things/comment-page-1/#comment-385</link>
		<dc:creator>Trochee</dc:creator>
		<pubDate>Wed, 03 Dec 2008 19:50:35 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/tthurman/?p=172#comment-385</guid>
		<description>heh. I just signed up for 5seven5 and added a little haiku about San Francisco&#039;s inverted seasons (rainy season seems spring-like, despite starting in October).

thanks for the tip!</description>
		<content:encoded><![CDATA[<p>heh. I just signed up for 5seven5 and added a little haiku about San Francisco&#8217;s inverted seasons (rainy season seems spring-like, despite starting in October).</p>
<p>thanks for the tip!</p>
]]></content:encoded>
	</item>
</channel>
</rss>
<!-- This Quick Cache file was built for (  blogs.gnome.org/tthurman/2008/12/03/some-things/feed/ ) in 1.17470 seconds, on Feb 10th, 2012 at 2:32 pm UTC. -->
<!-- This Quick Cache file will automatically expire ( and be re-built automatically ) on Feb 10th, 2012 at 3:32 pm UTC -->
