<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>Planet Semantico &#187; English</title>
	<atom:link href="http://www.semanticaweb.info/category/english/feed" rel="self" type="application/rss+xml" />
	<link>http://www.semanticaweb.info</link>
	<description>Planeta web semántica español, 3.0 semweb planet</description>
	<pubDate>Tue, 06 Jan 2009 17:45:49 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
	<language>en</language>
			<item>
		<title>Macbook Wheel pushes HCI envelope</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/06/macbook-wheel-pushes-hci-envelope/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/06/macbook-wheel-pushes-hci-envelope/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 17:45:49 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1728</guid>
		<description><![CDATA[The new Macbook Wheel is really pushing the Human Computer Interface envelope.



]]></description>
			<content:encoded><![CDATA[<p>The new Macbook Wheel is really pushing the Human Computer Interface envelope.</p>
<p><center><br />
<object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/s0Gzq-QEt0s&#038;color1=0xb1b1b1&#038;color2=0xcfcfcf&#038;feature=player_embedded&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><embed src="http://www.youtube.com/v/s0Gzq-QEt0s&#038;color1=0xb1b1b1&#038;color2=0xcfcfcf&#038;feature=player_embedded&#038;fs=1" type="application/x-shockwave-flash" allowfullscreen="true" width="425" height="344"></embed></object><br />
</center></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/06/macbook-wheel-pushes-hci-envelope/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Search Box: Keep Your Curious Visitors on Site</title>
		<link>http://blog.hakia.com/?p=562</link>
		<comments>http://blog.hakia.com/?p=562#comments</comments>
		<pubDate>Tue, 06 Jan 2009 08:22:08 +0000</pubDate>
		<dc:creator>hakia Team</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://blog.hakia.com/?p=562</guid>
		<description><![CDATA[With the start of 2009, we have just released a new and improved version of hakia Search Box. To see how it works, go to Search Box Page.
One immediate distinguishing feature of hakia Search Box is its flexibility to search in multiple domains as shown below.







The second distinguishing feature is its sentence highlighting and semantic [...]]]></description>
			<content:encoded><![CDATA[<p>With the start of 2009, we have just released a new and improved version of hakia Search Box. To see how it works, go to <a href="http://club.hakia.com/searchbox.aspx">Search Box Page</a>.</p>
<p>One immediate distinguishing feature of hakia Search Box is its flexibility to search in multiple domains as shown below.</p>
<table>
<tr>
<td>
<img src="http://blog.hakia.com/wp-content/uploads/searchbox.gif" alt="searchbox" title="searchbox"  class="alignnone size-full wp-image-563" />
</td>
</tr>
</table>
<p>The second distinguishing feature is its sentence highlighting and semantic precision (especially with complex, long-tail, and unusual queries) as shown below. Note the uninterrupted text snippets (no ellipses) for Pubmed and health searches.</p>
<table>
<tr>
<td>
<a href="http://blog.hakia.com/wp-content/uploads/searchbox3.gif"><img src="http://blog.hakia.com/wp-content/uploads/searchbox3.gif" alt="searchbox3" title="searchbox3" class="alignnone size-full wp-image-568" /></a>
</td>
</tr>
</table>
<p>There are several ASP and PHP examples on the page with design options as outlined below:</p>
<p>- Web Plus Search (multiple domains as shown above)<br />
- Site search (pick a site to search only its content)<br />
- Pubmed search (search results from 10 million pubmed articles)<br />
- Health search (search results from credible Web sources on health)</p>
<p>It is free up to 30,000 searches per day (which is the highest number offered to date). </p>
<p>Why do you need a good search box on your site? Well, you don&#8217;t want those curious visitors to leave your site and go to a search engine. With a good search box, you will keep them on your Web property.</p>
<p>If you already have a search box on your Web site and you are not sure what to do, you can add hakia&#8217;s search box as a semantic search option. </p>
<p>Give it a try and let us know. </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.hakia.com/?p=562/feed/</wfw:commentRss>
		</item>
		<item>
		<title>UMBC to host 2009 Global Game Jam site</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/06/umbc-to-host-2009-global-game-jam-site/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/06/umbc-to-host-2009-global-game-jam-site/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 07:30:02 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1725</guid>
		<description><![CDATA[UMBC is hosting a site for the 2009 Global Game Jam on January 30-February 1, 2009.  This is a two day game development contest sponsored by the International Game Developers Association and held simultaneously in 49 sites from 20 countries.
At 5pm local time on Friday, January 30, each site will be told the parameters [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/globe_for_web.png" alt="" title="Global Gam Jam 2009" width="198" height="120" class="alignnone size-medium wp-image-1727" align="right" />UMBC is hosting a site for the 2009 <a href="http://www.globalgamejam.com/">Global Game Jam</a> on January 30-February 1, 2009.  This is a two day game development contest sponsored by the <a href="http://www.idga.org/">International Game Developers Association</a> and held simultaneously in <a href="http://www.globalgamejam.com/locations.html">49 sites</a> from 20 countries.</p>
<p>At 5pm local time on Friday, January 30, each site will be told the parameters of the game they all must produce. Participants pitch ideas, form teams, and get to work producing the best game they can in 48 hours. The UMBC site will have a good mix of computers and development platforms including Windows (XP), Mac (Leopard), XBox 360 (with Creators Club), PlayStation 3 (running Linux) with a diverse software environment that inlcludes Visual Studio, Maya, XNA Game Studio, NVIDIA PhysX and Adobe Creative Suite.  For more information see the <a href="http://gaim.umbc.edu/jam/">UMBC Global Game Jam</a> page.</p>
<p>The Global Game Jam participants do not have to be UMBC students, and the Jam is open to participants of all levels of skill and experience.  There is no registration fee for the Baltimore Jam site at UMBC, but space is limited so <a href="http://gaim.umbc.edu/jam/">advance registration</a> is required.</p>
<p>This event is sponsored by the <a href="http://gaim.umbc.edu/">UMBC Games, Animation and Interactive Media</a> program, an innovative academic program with tracks available for students pursuing a degree in computer science or a degree in visual arts.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/06/umbc-to-host-2009-global-game-jam-site/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Pellet 2.0 RC4</title>
		<link>http://clarkparsia.com/weblog/2009/01/05/pellet-20-rc4/</link>
		<comments>http://clarkparsia.com/weblog/2009/01/05/pellet-20-rc4/#comments</comments>
		<pubDate>Mon, 05 Jan 2009 18:16:31 +0000</pubDate>
		<dc:creator>Mike Smith</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://clarkparsia.com/weblog/?p=473</guid>
		<description><![CDATA[	We&#8217;re welcoming 2009 by making a new release candidate for Pellet 2.0 available for download.  Pellet 2.0 RC4 resolves several issues present in previous release candidates, which are documented more fully in a Pellet Trac report.
	The resolved issue that was most likely to have frustrated users was broken import behavior when using the the [...]]]></description>
			<content:encoded><![CDATA[	<p>We&#8217;re welcoming 2009 by making a new release candidate for <a href="http://clarkparsia.com/pellet">Pellet</a> 2.0 available for <a href="http://pellet.owldl.com/download">download</a>.  Pellet 2.0 RC4 resolves several issues present in previous release candidates, which are documented more fully in a <a href="http://cvsdude.com/trac/clark-parsia/pellet-devel/report/11">Pellet Trac report</a>.</p>
	<p>The resolved issue that was most likely to have frustrated users was broken import behavior when using the the Pellet command line with the Jena loader. In addition, our effort to support the <a href="http://www.w3.org/Submission/SWRL/#8">built-ins listed in the SWRL submission</a> is closer to complete now that the team has added implementation of the <a href="http://www.w3.org/Submission/SWRL/#8.6">built-ins for URIs</a>.</p>
	<p>Special thanks to the users who reported issues since most of the changes in this release were made in response to user identified problems.  Keep up the good work by sending your bug reports to the <a href="http://lists.owldl.com/mailman/listinfo/pellet-users">Pellet users mailing list</a>.</p>

 ]]></content:encoded>
			<wfw:commentRss>http://clarkparsia.com/weblog/2009/01/05/pellet-20-rc4/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Ratsimor PhD: Bartering for goods and services in pervasive environments</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/05/ratsimor-phd-bartering-for-goods-and-services-in-pervasive-environments/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/05/ratsimor-phd-bartering-for-goods-and-services-in-pervasive-environments/#comments</comments>
		<pubDate>Mon, 05 Jan 2009 07:30:16 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1724</guid>
		<description><![CDATA[We&#8217;ve been working to get the dissertaions of our recent PhD graduates online.  The latest one is Olga Ratsimor&#8217;s 2007 dissertation on bartering for goods and services in a mobile or pervasive environments.  Here is the citation and abstract.  You can click through on the title to get a pdf copy of [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve been working to get the dissertaions of our recent PhD graduates online.  The latest one is <a href="http://ebiquity.umbc.edu/person/html/Olga/Vladi/Ratsimor/">Olga Ratsimor&#8217;s</a> 2007 dissertation on bartering for goods and services in a mobile or pervasive environments.  Here is the citation and abstract.  You can click through on the title to get a pdf copy of the dissertation.</p>
<hr />
<p>Olga Vladi Ratsimor, <a href="http://ebiquity.umbc.edu/paper/html/id/430/Opportunistic-Bartering-of-Digital-Goods-and-Services-in-Pervasive-Environments">Opportunistic Bartering of Digital Goods and Services in Pervasive Environments</a>, Ph.D. Dissertation in Computer Science, University of Maryland, Baltimore County, August 2007.</p>
<p>The vision of mobile personal devices querying peers in their environment for information such as local restaurant recommendations or directions to the closest gas station, or traffic and weather updates has long been a goal of the pervasive research community. However, considering the diversity and the personal nature of devices participating in pervasive environments it is not feasible to assume that these interactions and collaborations will take place with out economically-driven motivating incentives.</p>
<p>This dissertation presents a novel bartering communication model that provides an underlying framework for incentives for collaborations in mobile pervasive environments by supporting opportunistic serendipitous peer-to-peer bartering for digital goods such as ring tones, MP3’s and podcasts.</p>
<p>To demonstrate viability and advantages of this innovative bartering approach, we compare and contrast the performances of two conventional, frequently employed, peer-to-peer interaction approaches namely Altruists and FreeRiders against two collaborative strategies that employ the Double Coincidence of Wants paradigm from the domain of barter exchanges. In particular, we present our communication framework that represents these collaborative strategies through a set of interaction policies that reflect these strategies.  Furthermore, we present a set of results from our in-depth simulation studies that compare these strategies.</p>
<p>We examine the operation of the nodes employing our framework and executing these four distinct strategies and specifically, we compare the performances of the nodes executing these strategies in homogeneous and heterogeneous networks of mobile devices. We also examine the effects of adding InfoStations to these networks. For each of the strategies, we observe levels of gains and losses that nodes experience as result of collaborative digital good exchanges. We also evaluate communication overhead that nodes incur while looking for possible collaborative exchange. Furthermore, this dissertation offers an in-depth study of the swarm-like inter-strategy dynamics in heterogeneous networks populated with diverse nodes displaying varying levels of collaborative interaction attitudes. Further, the bartering framework is extended by incorporating value-sensitive bartering models that incorporate digital goods and content valuations into the bartering exchange process. In addition, the bartering model is extended by integration of socially influenced collaborative interaction that exploit role based social relationships between mobile peers that populate dynamic mobile environments.</p>
<p>Taken as a whole, the novel research work presented in this dissertation offers the first comprehensive effort that employs and models opportunistic bartering-based collaborative methodology in the context of serendipitous encounters in dynamic mobile peer-to-peer pervasive environments where mobile entities negotiate and exchange digital goods and content.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/05/ratsimor-phd-bartering-for-goods-and-services-in-pervasive-environments/feed/</wfw:commentRss>
		</item>
		<item>
		<title>DHS wants to mine social media for terrorism relatated data</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/05/dhs-wants-to-mine-social-media-for-terrorism-relatated-data/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/05/dhs-wants-to-mine-social-media-for-terrorism-relatated-data/#comments</comments>
		<pubDate>Mon, 05 Jan 2009 05:25:37 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1726</guid>
		<description><![CDATA[USA Today reports (Feds may mine blogs for terrorism clues) that the US Department of Homeland Security wants to use data-mining technology to search blogs and Internet message boards to find those used by terrorists to plan attacks.

&#8220;Blogging and message boards have played a substantial role in allowing communication among those who would do the [...]]]></description>
			<content:encoded><![CDATA[<p>USA Today reports (<a href="http://www.usatoday.com/printedition/news/20081224/terrorblogs24_st.art.htm">Feds may mine blogs for terrorism clues</a>) that the US Department of Homeland Security wants to use data-mining technology to search blogs and Internet message boards to find those used by terrorists to plan attacks.</p>
<blockquote><p>
&#8220;Blogging and message boards have played a substantial role in allowing communication among those who would do the United States harm,&#8221; DHS said in a recent notice.
</p></blockquote>
<p>Julian Sanchez <a href="http://arstechnica.com/journals/law.ars/2008/12/24/what-two-months-wasnt-enough-to-get-the-story-right">notes</a> on Ars Technica that the story is not new.</p>
<blockquote><p>
&#8220;The story is actually pegged to a <a href="https://www.fbo.gov/index?s=opportunity&#038;mode=form&#038;id=dd832afcbd5e504ed8ad342cc4729add&#038;tab=core&#038;_cview=0&#038;cck=1&#038;au=&#038;ck=">Sources Sought Notice</a> posted by the  Department of Homeland Security back in October. Our colleagues at Wired <a href="http://blog.wired.com/defense/2008/10/dhs-scour-blogs.html">reported on it</a>  at the time.&#8221;
</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/05/dhs-wants-to-mine-social-media-for-terrorism-relatated-data/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Wenjia Li: Coping with Node Misbehaviors in MANETs, 4pm Tue 1/6/08, 325b ITE, UMBC</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/04/wenjia-li-coping-with-node-misbehaviors-in-manets-4pm-tue-1608-325b-ite-umbc/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/04/wenjia-li-coping-with-node-misbehaviors-in-manets-4pm-tue-1608-325b-ite-umbc/#comments</comments>
		<pubDate>Mon, 05 Jan 2009 01:41:25 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1722</guid>
		<description><![CDATA[Wenjia Li will present his dissertation proposal on &#8216;A Security Framework to Cope with Node Misbehaviors in Mobile Ad Hoc Networks&#8217; which will be done under the supervision of Professor Anupam Joshi.  The presentation will be at 4:00pm Tuesday, January 6, in ITE 325b.  Here&#8217;s the abstract.
A Mobile Ad-hoc NETwork (MANET), as its [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ebiquity.umbc.edu/person/html/Wenjia/Li/">Wenjia Li</a> will present his dissertation proposal on &#8216;A Security Framework to Cope with Node Misbehaviors in Mobile Ad Hoc Networks&#8217; which will be done under the supervision of Professor <a href="http://ebiquity.umbc.edu/person/html/Anupam/Joshi/">Anupam Joshi</a>.  The presentation will be at 4:00pm Tuesday, January 6, in <a href="Wenjia Li: Coping with Node Misbehaviors in MANETs, 4pm Tue 1/6/2008, 325b ITE, UMBC">ITE 325b</a>.  Here&#8217;s the abstract.</p>
<p>A Mobile Ad-hoc NETwork (<a href="http://en.wikipedia.org/wiki/Mobile_ad-hoc_network">MANET</a>), as its name suggests, has no fixed infrastructure, and is generally composed of a dynamic set of cooperative peers, which are willing to share their wireless transmission power with other peers so that indirect communication can be possible between nodes that are not in the radio range of each other . The nature of MANETs, such as node mobility, unreliable transmission medium and restricted battery power, makes them extremely vulnerable to a variety of node misbehaviors. Wireless links, for instance, are generally prone to both passive eavesdropping and active intrusion. Another security concern in ad hoc networks is caused by the cooperative nature of the nodes. Attacks from external adversaries may disturb communications, but the external intruder generally cannot directly participate in the cooperative activities among the nodes, such as routing, because they do not possess the proper secure credentials, such as shared keys. However, compromised nodes, which are taken over by an adversary, are capable of presenting the proper secure credentials, and consequently can interfere with almost all of the network operations, such as route discovery, key management and distribution, and packet forwarding. Hence, it is essential to cope with node misbehaviors so as to secure mobile ad hoc networks.</p>
<p>In this dissertation, we address the question of how to ensure that a MANET will properly operate despite the presence of various node misbehaviors. We propose to build a framework that can cope with various node misbehaviors in a wise and adaptive manner. The main purpose of our proposed framework is to provide a platform so that the components that identify and respond to misbehaviors can better cooperate with each other and quickly adapt to the changes of network context. Therefore, policies are planned to be utilized in our framework in order to make those components correctly function in different network contexts. Besides the policy component, there are three other components, which fulfill the tasks of misbehavior detection, trust and reputation management, and route management, respectively. To validate and evaluate our proposed framework, we plan to implement our framework based on simulator.</p>
<p>In particular, the contributions of this dissertation are (i) Develop a framework to combine the functionalities of surveillance and detection of misbehavior, trust and reputation management, route management, and policy management so as to provide a high-level solution to cope with various misbehaviors in MANETs in an intelligent and adaptive manner (ii) Propose and implement a misbehavior detector based on the gossip-based outlier detection method, which relies on neither any pre-defined threshold nor any training data (iii) Take into account both first-hand information (direct observation) and second-hand information (indirect observation) during both misbehavior detection and trust management processes, in which first-hand information and second-hand information are merged by some intelligent methods (iv) Specify and enforce policies in the proposed framework, which makes the framework promptly adapt to the rapidly changing network context.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/04/wenjia-li-coping-with-node-misbehaviors-in-manets-4pm-tue-1608-325b-ite-umbc/feed/</wfw:commentRss>
		</item>
		<item>
		<title>octo.py: quick and easy MapReduce for Python</title>
		<link>http://ebiquity.umbc.edu/blogger/2009/01/02/octopy-quick-and-easy-mapreduce-for-python/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2009/01/02/octopy-quick-and-easy-mapreduce-for-python/#comments</comments>
		<pubDate>Fri, 02 Jan 2009 17:34:37 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1718</guid>
		<description><![CDATA[The amount of free, interesting, and useful data is growing explosively. Luckily, computer are getting cheaper as we speak, they are all connected with a robust communication infrastructure, and software for analyzing data is better than ever.  That&#8217;s why everyone is interested in easy to use frameworks like MapReduce for every-day programmers to run [...]]]></description>
			<content:encoded><![CDATA[<p>The amount of free, interesting, and useful data is growing explosively. Luckily, computer are getting cheaper as we speak, they are all connected with a robust communication infrastructure, and software for analyzing data is better than ever.  That&#8217;s why everyone is interested in easy to use frameworks like <a href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> for every-day programmers to run their data crunching in parallel.</p>
<p><a href="http://code.google.com/p/octopy/">octo.py</a> is a very simple MapReduce like system inspired by Ruby&#8217;s <a href="http://tech.rufy.com/2006/08/mapreduce-for-ruby-ridiculously-easy.html"> Starfish</a>.</p>
<blockquote><p>
&#8220;<a href="http://code.google.com/p/octopy/">Octo.py</a> doesn&#8217;t aim to meet all your distributed computing needs, but its simple approach is amendable to a large proportion of parallelizable tasks. If your code has a for-loop, there&#8217;s a good chance that you can make it distributed with just a few small changes. If you&#8217;re already using Python&#8217;s map() and reduce() functions, the changes needed are trivial!&#8221;
</p></blockquote>
<p>triangular.py is the simple example given in the documentation that is used with octo.py to compute the first 100 <a href="http://wikipedia.org/wiki/Triangular_number">triangular numbers</a>.</p>
<blockquote>
<pre>
# triangular.py compute first 100 triangular numbers. Do
# 'octo.py server triangular.py' on server with address IP
# and 'octo.py client IP' on each client. Server uses source
# &#038; final, sends tasks to clients, integrates results. Clients
# get tasks from server, use mapfn &#038; reducefn, return results.

source = dict(zip(range(100), range(100)))

def final(key, value):
    print key, value

def mapfn(key, value):
    for i in range(value + 1):
        yield key, i

def reducefn(key, value):
    return sum(value)
</pre>
</blockquote>
<p>Put <a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/octo.py">octo.py</a> on all of the machines you want to use. On the machine you will use as a server (with ip address &lt;ip&gt;), also install <a href="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2009/01/triangular.py"> triangular.py</a>, and then execute:</p>
<pre>
     python octo.py server triangular.py &amp;
</pre>
<p>On each of your clients, run </p>
<pre>
     python octo.py client &lt;ip&gt; &amp;
</pre>
<p>You can try this out using the same machine to run the server process and one or more client processes, of course.</p>
<p>When the clients register with the server, they will get a copy of <em>triangular.py</em> and wait for tasks from the server.  The server access the data from <em>source</em> and distributed tasks to the clients. These in turn use <em>mapfn</em> and <em>reducefn</em> to complete the tasks, returning the results.  The server integrates these and, when all have completed, invokes <em>final</em>, which in this case just prints the answers, and halts.  The clients continue to run, waiting for more tasks to do. </p>
<p>Octo.py is not a replacement for more sophisticated frameworks like Hadoop or Disco but if you are working in Python, its <a href="http://en.wikipedia.org/wiki/KISS_principle">KISS</a> approach is a good way to get started with the MapReduce paradigm and might be all you need for a small projects.</p>
<p>(Note: The package has not been updated since April 2008, so it&#8217;s status is not clear.  But further development would run the risk of making it more complex and would be self-defeating.)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2009/01/02/octopy-quick-and-easy-mapreduce-for-python/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Help Us Build the 2009 Calais Roadmap</title>
		<link>http://www.opencalais.com/node/12376</link>
		<comments>http://www.opencalais.com/node/12376#comments</comments>
		<pubDate>Wed, 31 Dec 2008 23:19:30 +0000</pubDate>
		<dc:creator>FranSan</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">12376 at http://www.opencalais.com</guid>
		<description><![CDATA[<p>We'd like to thank each and every one of you for contributing to the success of Calais over the last year. From an idea in mid-2007 to our launch in January 2008, it's been an amazing ride.</p>
<p>We're wrapping up the year with 9,000 registered users of the Calais service who are submitting one to two million transactions each day. You've built dozens, if not hundreds, of innovative applications and you've provided regular feedback to help us make the service more useful. Thank you. We truly could not have done it without you.</p>
<p>Now we'd like to ask for a little more help. We're pulling together our roadmap for 2009 and want to <em>hear directly from you about what you'd like to see</em>. As a user-driven project, Calais needs your feedback to make sure we're on the right path.</p>
<p>Please drop us a note at <a href="mailto:2009@opencalais.com" target="_blank" title="2009@opencalais.com">2009@opencalais.com</a> and let us know two things:</p>
<blockquote><p>First: What would you like to see delivered in 2009? New extractions? Languages? New tools or API capabilities? Integration with your favorite content management system? Let us know - nothing is off the table.</p></blockquote>
<blockquote><p>Second: How are you using Calais? Even if it's experimental and not for public consumption, we'd love to hear what you're up to - or even what you're thinking about.</p></blockquote>
<p>Again, thanks for your support. There are some exciting announcements coming with Release 4 of Calais in mid-January and we're looking forward to a great 2009. Your input will help make it so.</p>
<p>Our best wishes for happy and healthy new year,</p>
<p>The Calais Team</p>
]]></description>
			<content:encoded><![CDATA[<p>We'd like to thank each and every one of you for contributing to the success of Calais over the last year. From an idea in mid-2007 to our launch in January 2008, it's been an amazing ride.</p>
<p>We're wrapping up the year with 9,000 registered users of the Calais service who are submitting one to two million transactions each day. You've built dozens, if not hundreds, of innovative applications and you've provided regular feedback to help us make the service more useful. Thank you. We truly could not have done it without you.</p>
<p>Now we'd like to ask for a little more help. We're pulling together our roadmap for 2009 and want to <em>hear directly from you about what you'd like to see</em>. As a user-driven project, Calais needs your feedback to make sure we're on the right path.</p>
<p>Please drop us a note at <a href="mailto:2009@opencalais.com"  title="2009@opencalais.com">2009@opencalais.com</a> and let us know two things:</p>
<blockquote><p>First: What would you like to see delivered in 2009? New extractions? Languages? New tools or API capabilities? Integration with your favorite content management system? Let us know - nothing is off the table.</p></blockquote>
<blockquote><p>Second: How are you using Calais? Even if it's experimental and not for public consumption, we'd love to hear what you're up to - or even what you're thinking about.</p></blockquote>
<p>Again, thanks for your support. There are some exciting announcements coming with Release 4 of Calais in mid-January and we're looking forward to a great 2009. Your input will help make it so.</p>
<p>Our best wishes for happy and healthy new year,</p>
<p>The Calais Team</p>
]]></content:encoded>
			<wfw:commentRss>http://www.opencalais.com/node/12376/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Videos of 2008 ICWSM presentations</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/31/videos-of-2008-icwsm-presentations/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/31/videos-of-2008-icwsm-presentations/#comments</comments>
		<pubDate>Wed, 31 Dec 2008 16:35:41 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1717</guid>
		<description><![CDATA[The submission deadline for the Third International Conference on Weblogs and Social Media is just three weeks away.  To inspire yourself to work on a submission, you can check out videos from the 2008 ICWSM which are online at Videolectures.net.  Here are highlights.

Best paper: Travis Kriplean, Ivan Beschastnikh, David McDonald, Wikipedian Self-Governance in [...]]]></description>
			<content:encoded><![CDATA[<p>The submission deadline for the Third <a href="http://www.icwsm.org/2009/">International Conference on Weblogs and Social Media</a> is just three weeks away.  To inspire yourself to work on a submission, you can check out videos from the <a href="http://www.icwsm.org/2008/">2008 ICWSM</a> which are online at <a href="http://videolectures.net/icwsm08_seattle/">Videolectures.net</a>.  Here are highlights.</p>
<ul>
<li>Best paper: Travis Kriplean, Ivan Beschastnikh, David McDonald, <a href="http://videolectures.net/icwsm08_kriplean_wsga/">Wikipedian Self-Governance in Action: Motivating the Policy Lens</a></li>
<li>Invited talk: Brad Fitzpatrick, <a href="http://videolectures.net/icwsm08_fitzpatrick_dippf/">Decentralization and Interop: past, present, future</a></li>
<li>Invited talk: David Sifry, <a href="http://videolectures.net/icwsm08_sifry_sinew/">Scaling Innovation in a Network Effect World</a></li>
<li>Invited talk: Marc Smith, <a href="http://videolectures.net/icwsm08_smith_sdsm/">Social dimension of social media</a></li>
<li>Tutorial: Mary McGlohon, <a href="http://videolectures.net/icwsm08_mcglohon_gmtsma/">Graph Mining Techniques for Social Media Analysis</a></li>
<li>Tutorial: Jan Wiebe, <a href="http://videolectures.net/icwsm08_wiebe_ssa/">Subjectivity and Sentiment Analysis</a></li>
<li>James Caverlee, <a href="http://videolectures.net/icwsm08_caverlee_lssm/">A Large-Scale Study of MySpace: Observations and Implications for Online Social Networks</a></li>
<li>Aaditeshwar Seth, Jie Zhang, <a href="http://videolectures.net/icwsm08_seth_snba/">A Social Network Based Approach to Personalized Recommendation of Participatory Media Content</a></li>
<li>Michael Gamon, Danyel Fisher, Sumit Basu, Dmitriy Belenko, Matthew Hurst, Arnd Christian König, <a href="http://videolectures.net/icwsm08_gamon_blews/">BLEWS: Using Blogs to Provide Context for News Articles</a></li>
<li>Jiang Yang, Lada Adamic, Mark Ackerman : <a href="http://videolectures.net/icwsm08_yang_cse/">Competing to Share Expertise: the Taskcn Knowledge Sharing Community</a></li>
<li>Jaime Arguello, Jonathan Elsas, Jamie Callan, Jaime Carbonel : <a href="http://videolectures.net/icwsm08_arguello_drqe/">Document Representation and Query Expansion Models for Blog Recommendation</a></li>
<li>Noor Ali-Hasan, <a href="http://videolectures.net/icwsm08_ali_hasan_esms/">Exploring Social Media Scenarios for the Television</a></li>
<li>Hikaru Yamamoto, Naohiro Matsumura, Daisuke Tomozawa, <a href="http://videolectures.net/icwsm08_yamamoto_ficib/">Finding Influencers and Consumer Insights in the Blogosphere</a></li>
<li>Mikhail Bautin, Lohit Vijayarenu, Steven Skiena, <a href="http://videolectures.net/icwsm08_bautin_isanb/">International Sentiment Analysis for News and Blogs</a></li>
<li>Ramesh Nallapati, William Cohen, <a href="http://videolectures.net/icwsm08_nallapati_plsa/">Link-PLSA-LDA: A new unsupervised model for topics and influence of blogs</a></li>
<li>Ian Soboroff, <a href="http://videolectures.net/icwsm08_soboroff_trec/">On TREC Blog Track</a></li>
<li>Jason Kessler, <a href="http://videolectures.net/icwsm08_kessler_ptb/">Polling the Blogosphere: a Rule-Based Approach to Belief Classification</a></li>
<li>Yi-Chia Wang, William Cohen, Mahesh Joshi, Carolyn Rosé : <a href="http://videolectures.net/icwsm08_wang_rits/">Recovering Implicit Thread Structure in Newsgroup Style Conversations</a></li>
<li>Danyel Fisher, Marc Smith, Tammara Combs Turner, <a href="http://videolectures.net/icwsm08_fisher_apoc/">Space Planning for Online Community</a></li>
<li>Kristin Stecher, Scott Counts, <a href="http://videolectures.net/icwsm08_stecher_sipt/">Spontaneous Inference of Personality Traits from Online Profiles</a></li>
<li>Gina Walejko, Thomas Ksiazek, <a href="http://videolectures.net/icwsm08_walejko_ksiazek_pos/">The Politics of Sourcing: A Study of Journalistic Practices in the Blogosphere</a></li>
<li>Nairan Ramírez-Esparza, Cindy Chung, Ewa Kacewicz, James W. Pennebaker, <a href="http://videolectures.net/icwsm08_ramirez_esparza_pwudf/">The Psychology of Word Use in Depression Forums in English and in Spanish</a></li>
<li>Kristin Stecher, Scott Counts, <a href="http://videolectures.net/icwsm08_stecher_tsopa/">Thin Slices of Online Profile Attributes</a></li>
<li>David Evans, Sam Gosling, Anthony Carroll, <a href="http://videolectures.net/icwsm08_evans_weosn/">What Elements of an Online Social Networking Profile Predict Target-Rater Agreement in Personality? </a></li>
<li>Zareen Syed, <a href="http://videolectures.net/icwsm08_syed_wodd/">Wikipedia as an Ontology for Describing Documents</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/31/videos-of-2008-icwsm-presentations/feed/</wfw:commentRss>
		</item>
		<item>
		<title>UMBC ties for first in 2008 Pan-American Intercollegiate Team Chess Championship</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/30/umbc-ties-for-first-in-2008-pan-american-intercollegiate-team-chess-championship/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/30/umbc-ties-for-first-in-2008-pan-american-intercollegiate-team-chess-championship/#comments</comments>
		<pubDate>Tue, 30 Dec 2008 22:03:07 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1715</guid>
		<description><![CDATA[Congratulations to the UMBC Chess team and their advisor and our colleague, UMBC CSEE Professor Alan Sherman, for a first place tie in the 54th Pan-American Intercollegiate Team Chess Championship. 
UMBC tied for first place with University of Texas at Dallas (B Team) in the sixth and final round of the three-day 2008 Pan-Am Championship [...]]]></description>
			<content:encoded><![CDATA[<p>Congratulations to the <a href="http://www.umbc.edu/studentlife/orgs/chess/">UMBC Chess team</a> and their advisor and our colleague, UMBC CSEE Professor <a href="http://www.cs.umbc.edu/~sherman/">Alan Sherman</a>, for a first place tie in the 54th <a href="http://en.wikipedia.org/wiki/Pan_American_Intercollegiate_Team_Chess_Championship">Pan-American Intercollegiate Team Chess Championship</a>. </p>
<p>UMBC tied for first place with University of Texas at Dallas (B Team) in the sixth and final round of the three-day 2008 Pan-Am Championship which was held in Dallas.  This year 29 four-person college teams competed in the annual event which is known as the  &#8220;<em>World Series of College Chess</em>&#8220;.  UMBC has now won the Pan-Am tournament a record eight times.  The final standings are available at <a href="http://www.swchess.com/sce/tourney/PanAms08/PanAmTm.htm">swchess</a>.</p>
<p>The two first place winners will meet again with the third and fourth place teams,  the University of Texas Brownville and Stanford, in the special <em>Final Four of Chess</em> tournament, which is held in spring 2009.</p>
<p><center></p>
<div >
<img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/12/leadumbc.jpg" title="The UMBC chess team: front row, L to R: WGM Sabina Foisor, GM Timur Gareev, GM Sergey Erenburg, and GM Leonid Kritz, board one, Back row: UMBC coaches GM Sam Palantnik and NM Igor Epshteyn. Photo Alexey Root."   /><br />
The UMBC chess team: front row, L to R: WGM Sabina Foisor, GM Timur Gareev, GM Sergey Erenburg, and GM Leonid Kritz, board one, Back row: UMBC coaches GM Sam Palantnik and NM Igor Epshteyn. Photo Alexey Root.
</div>
<p></center><center></center></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/30/umbc-ties-for-first-in-2008-pan-american-intercollegiate-team-chess-championship/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Wikirage tracks whats hot on Wikipedia</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/30/wikirage-tracks-whats-hot-on-wikipedia/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/30/wikirage-tracks-whats-hot-on-wikipedia/#comments</comments>
		<pubDate>Tue, 30 Dec 2008 19:16:08 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1713</guid>
		<description><![CDATA[
 Wikirage is yet another way to track what&#8217;s happening in the world via changes in social media, in this case, Wikipedia.  As the site suggests, &#8220;popular people in the news, the latest fads, and the hottest video games can be quickly identified by monitoring this social phenomenon.&#8221;
Wikirage lists the 100 Wikipedia pages that [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.wikirage.com/rage-image/" alt="Another sense of Wikirage"><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/12/rage.gif" title="Another sense of Wikirage" align="right"/></a><br />
 <a href="http://www.wikirage.com/">Wikirage</a> is yet another way to track what&#8217;s happening in the world via changes in social media, in this case, Wikipedia.  As the site suggests, &#8220;popular people in the news, the latest fads, and the hottest video games can be quickly identified by monitoring this social phenomenon.&#8221;</p>
<p>Wikirage lists the 100 Wikipedia pages that are being heavily edited over any of six time periods from the last hour to the last month.  You can see the top 100 by your choice of six metrics: number of quality edits, unique editors, total edits, vandalism, reversions, or undos.  Clicking on a result shows a monthly summary for the article, for example, <a href="http://www.wikirage.com/wiki/December_2008_Gaza_Strip_airstrikes/">December 2008 Gaza Strip airstrikes</a>, which is at the top of today&#8217;s list for number of edits as I write. I understand the Gaza article, but what&#8217;s up with the <a href="http://www.wikirage.com/wiki/Thylacine/">Tasmanian tiger</a>?</p>
<p>The interface has some other nice features, such as marking pages in red that have high revision, vandalism or undo rates and showing associated Wikipedia flags that indicating articles that need attention or don&#8217;t live up to standards.  Wikirage is also available for the English, Japanese, Spanish, German and French language Wikipedias.  </p>
<p>Wikirage was developed by <a href="http://www.craigsblog.com/">Craig Wood</a> and is a nicely done system.</p>
<p>(via the <a href="http://porn-sex-viagra-casino-spam.com/">Porn Sex Viagra Casino Spam</a> site)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/30/wikirage-tracks-whats-hot-on-wikipedia/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Akshay Java Ph.D.: Mining Social Media Communities and Content</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/30/akshay-java-phd-mining-social-media-communities-and-content/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/30/akshay-java-phd-mining-social-media-communities-and-content/#comments</comments>
		<pubDate>Tue, 30 Dec 2008 18:20:13 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1712</guid>
		<description><![CDATA[Akshay Java defended his PhD dissertation this fall on discovering communities in social media systems and the submitted version is now available online. Akshay is now a scientist at Microsoft&#8217;s Live Labs.   The citation, link and abstract are below.

Akshay Java, Mining Social Media Communities and Content, Ph.D. Dissertation, Department of Computer Science and [...]]]></description>
			<content:encoded><![CDATA[<p>Akshay Java defended his PhD dissertation this fall on discovering communities in social media systems and the submitted version is now available online. Akshay is now a scientist at Microsoft&#8217;s Live Labs.   The citation, link and abstract are below.</p>
<hr />
<p>Akshay Java, Mining Social Media Communities and Content, Ph.D. Dissertation, Department of Computer Science and Electrical Engineering, University of Maryland, Baltimore County, December 1, 2008. Available at <a href="http://ebiquity.umbc.edu/paper/html/id/429/Mining-Social-Media-Communities-and-Content">http://ebiquity.umbc.edu/paper/html/id/429/Mining-Social-Media-Communities-and-Content</a>.</p>
<p>
Social Media is changing the way people find information, share knowledge and communicate with each other. The important factor contributing to the growth of these technologies is the ability to easily produce “user-generated content”. Blogs, Twitter, Wikipedia, Flickr and YouTube are just a few examples of Web 2.0 tools that are drastically changing the Internet landscape today. These platforms allow users to produce and annotate content and more importantly, empower them to share information with their social network. Friends can in turn, comment and interact with the producer of the original content and also with each other. Such social interactions foster communities in online, social media systems. User-generated content and the social graph are thus the two essential elements of any social media system.
</p>
<p>
Given the vast amount of user-generated content being produced each day and the easy access to the social graph, how can we analyze the structure and content of social media data to understand the nature of online communication and collaboration in social applications? This thesis presents a systematic study of the social media landscape through the combined analysis of its special properties, structure and content.
</p>
<p>
First, we have developed a framework for analyzing social media content effectively. The BlogVox opinion retrieval system is a large scale blog indexing and content analysis engine. For a given query term, the system retrieves and ranks blog posts expressing sentiments (either positive or negative) towards the query terms. Further, we have developed a framework to index and semantically analyze syndicated1 feeds from news websites. We use a sophisticated natural language processing system, OntoSem, to semantically analyze news stories and build a rich fact repository of knowledge extracted from real-time feeds. It enables other applications to benefit from such deep semantic analysis by exporting the text meaning representations in Semantic Web language, OWL.
</p>
<p>
Secondly, we describe novel algorithms that utilize the special structure and properties of social graphs to detect communities in social media. Communities are an essential element of social media systems and detecting their structure and membership is critical in several real-world applications. Many algorithms for community detection are computationally expensive and generally, do not scale well for large networks. In this work we present an approach that benefits from the scale-free distribution of node degrees to extract communities efficiently. Social media sites frequently allow users to provide additional meta-data about the shared resources, usually in the form of tags or folksonomies. We have developed a new community detection algorithm that can combine information from tags and the structural information obtained from the graphs to effectively detect communities. We demonstrate how structure and content analysis in social media can benefit from the availability of rich meta-data and special properties.
</p>
<p>
Finally, we study social media systems from the user perspective. In the first study we present an analysis of how a large population of users subscribes and organizes the blog feeds that they read. This study has revealed interesting properties and characteristics of the way we consume information. We are the first to present an approach to what is now known as the “feed distillation” task, which involves finding relevant feeds for a given query term. Based on our understanding of feed subscription patterns we have built a prototype system that provides recommendations for new feeds to subscribe and measures the readership based influence of blogs in different topics.
</p>
<p>
We are also the first to measure the usage and nature of communities in a relatively new phenomena called Microblogging. Microblogging is a new form of communication in which users can describe their current status in short posts distributed by instant messages, mobile phones, email or the Web. In this study, we present our observations of the microblogging phenomena and user intentions by studying the content, topological and geographical properties of such communities. We find that microblogging provides users with a more immediate form of communication to talk about their daily activities and to seek or share information.
</p>
<p>
The course of this research has highlighted several challenges that processing social media data presents. This class of problems requires us to re-think our approach to text mining, community and graph analysis. Comprehensive understanding of social media systems allows us to validate theories from social sciences and psychology, but on a scale much larger than ever imagined. Ultimately this leads to a better understanding of how we communicate and interact with each other today and in future.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/30/akshay-java-phd-mining-social-media-communities-and-content/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The true cost of sending SMS messages</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/#comments</comments>
		<pubDate>Sun, 28 Dec 2008 14:24:57 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1711</guid>
		<description><![CDATA[The NYT has an article, What Carriers Aren’t Eager to Tell You About Texting , about new interest in understanding why charges for SMS service has been increasing even while volume is up and communication costs are down.
I learned one interesting thing from the article about the length of SMS messages.  I&#8217;d never thought [...]]]></description>
			<content:encoded><![CDATA[<p>The NYT has an article, <a href="http://www.nytimes.com/2008/12/28/business/28digi.html">What Carriers Aren’t Eager to Tell You About Texting </a>, about new interest in understanding why charges for SMS service has been increasing even while volume is up and communication costs are down.</p>
<p>I learned one interesting thing from the article about the length of SMS messages.  I&#8217;d never thought much about where the limit on the number of characters came from.  According to the article, the limit is 160 (7 bit) characters because that&#8217;s what will fit into the control channel messages that mobile phones exchange with cell towers.</p>
<blockquote><p> &#8220;The lucrative nature of that revenue increase cannot be appreciated without doing something that T-Mobile chose not to do, which is to talk about whether its costs rose as the industry’s messaging volume grew tenfold. Mr. Kohl’s letter of inquiry noted that “text messaging files are very small, as the size of text messages are generally limited to 160 characters per message, and therefore cost carriers very little to transmit.” A better description might be “cost carriers very, very, very little to transmit.”</p>
<p>A text message initially travels wirelessly from a handset to the closest base-station tower and is then transferred through wired links to the digital pipes of the telephone network, and then, near its destination, converted back into a wireless signal to traverse the final leg, from tower to handset. In the wired portion of its journey, a file of such infinitesimal size is inconsequential. Srinivasan Keshav, a professor of computer science at the University of Waterloo, in Ontario, said: “Messages are small. Even though a trillion seems like a lot to carry, it isn’t.”</p>
<p>Perhaps the costs for the wireless portion at either end are high — spectrum is finite, after all, and carriers pay dearly for the rights to use it. But text messages are not just tiny; they are also free riders, tucked into what’s called a control channel, space reserved for operation of the wireless network. That’s why a message is so limited in length: it must not exceed the length of the message used for internal communication between tower and handset to set up a call. The channel uses space whether or not a text message is inserted.&#8221;  </p></blockquote>
<p>There&#8217;s a lot more to the protocols, of course.  The <a href="http://en.wikipedia.org/wiki/Short_message_service">Wikipedia SMS article</a> looks like a good place to start.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The true cost of sending SMS messages</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/#comments</comments>
		<pubDate>Sun, 28 Dec 2008 14:24:57 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1711</guid>
		<description><![CDATA[The NYT has an article, What Carriers Aren’t Eager to Tell You About Texting , about new interest in understanding why charges for SMS service has been increasing even while volume is up and communication costs are down.
I learned one interesting thing from the article about the length of SMS messages.  I&#8217;d never thought [...]]]></description>
			<content:encoded><![CDATA[<p>The NYT has an article, <a href="http://www.nytimes.com/2008/12/28/business/28digi.html">What Carriers Aren’t Eager to Tell You About Texting </a>, about new interest in understanding why charges for SMS service has been increasing even while volume is up and communication costs are down.</p>
<p>I learned one interesting thing from the article about the length of SMS messages.  I&#8217;d never thought much about where the limit on the number of characters came from.  According to the article, the limit is 160 (7 bit) characters because that&#8217;s what will fit into the control channel messages that mobile phones exchange with cell towers.</p>
<blockquote><p> &#8220;The lucrative nature of that revenue increase cannot be appreciated without doing something that T-Mobile chose not to do, which is to talk about whether its costs rose as the industry’s messaging volume grew tenfold. Mr. Kohl’s letter of inquiry noted that “text messaging files are very small, as the size of text messages are generally limited to 160 characters per message, and therefore cost carriers very little to transmit.” A better description might be “cost carriers very, very, very little to transmit.”</p>
<p>A text message initially travels wirelessly from a handset to the closest base-station tower and is then transferred through wired links to the digital pipes of the telephone network, and then, near its destination, converted back into a wireless signal to traverse the final leg, from tower to handset. In the wired portion of its journey, a file of such infinitesimal size is inconsequential. Srinivasan Keshav, a professor of computer science at the University of Waterloo, in Ontario, said: “Messages are small. Even though a trillion seems like a lot to carry, it isn’t.”</p>
<p>Perhaps the costs for the wireless portion at either end are high — spectrum is finite, after all, and carriers pay dearly for the rights to use it. But text messages are not just tiny; they are also free riders, tucked into what’s called a control channel, space reserved for operation of the wireless network. That’s why a message is so limited in length: it must not exceed the length of the message used for internal communication between tower and handset to set up a call. The channel uses space whether or not a text message is inserted.&#8221;  </p></blockquote>
<p>There&#8217;s a lot more to the protocols, of course.  The <a href="http://en.wikipedia.org/wiki/Short_message_service">Wikipedia SMS article</a> looks like a good place to start.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/28/the-true-cost-of-sending-sms-messages/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Yongmei Shi PhD: Linguistic Information for Speech Recognition Error Detection</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/26/yongmei-shi-phd-linguistic-information-for-speech-recognition-error-detection/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/26/yongmei-shi-phd-linguistic-information-for-speech-recognition-error-detection/#comments</comments>
		<pubDate>Sat, 27 Dec 2008 02:21:02 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1710</guid>
		<description><![CDATA[Yongmei Shi defended her PhD dissertation earlier this fall on using syntactic and semantic information to detect errors in spoken language systems under the direction of Dr. R. Scott Cost (JHU/APL) and Professor Lina Zhou (UMBC).  Her dissertation has been submitted an is now available online.

Yongmei Shi, An Investigation of Linguistic Information for Speech [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ebiquity.umbc.edu/person/html/Yongmei/Shi/">Yongmei Shi</a> defended her PhD dissertation earlier this fall on using syntactic and semantic information to detect errors in spoken language systems under the direction of Dr. <a href="http://ebiquity.umbc.edu/person/html/R./Scott/Cost/">R. Scott Cost</a> (JHU/APL) and Professor <a href="http://userpages.umbc.edu/~zhoul/">Lina Zhou</a> (UMBC).  Her dissertation has been submitted an is now available online.</p>
<p></p>
<p>Yongmei Shi, <a href="http://ebiquity.umbc.edu/paper/html/id/428/An-Investigation-of-Linguistic-Information-for-Speech-Recognition-Error-Detection">An Investigation of Linguistic Information for Speech Recognition Error Detection</a>, Ph.D. Dissertation, Department of Computer Science and Electrical Engineering, University of Maryland, Baltimore County, October 2008.</p>
<p>After several decades of effort, signi?cant progress has been made in the area of speech recognition technologies, and various speech-based applications have been developed. However, current speech recognition systems still generate erroneous output, which hinders the wide adoption of speech applications. Given that the goal of error-free output can not be realized in near future, mechanisms for automatically detecting and even correcting speech recognition errors may prove useful for amending imperfect speech recognition systems. This dissertation research focuses on the automatic detection of speech recognition errors for monologue applications, and in particular, dictation applications.</p>
<p>Due to computational complexity and ef?ciency concerns, limited linguistic information is embedded in speech recognition systems. Furthermore, when identifying speech recognition errors, humans always apply linguistic knowledge to complete the task. This dissertation therefore investigates the effect of linguistic information on automatic error detection by applying two levels of linguistic analysis, speci?cally syntactic analysis and semantic analysis, to the post processing of speech recognition output. Experiments are conducted on two dictation corpora which differ in both topic and style (daily of?ce communication by students and Wall Street Journal news by journalists).</p>
<p>To catch grammatical abnormalities possibly caused by speech recognition errors, two sets of syntactic features, linkage information and word associations based on syntactic dependency, are extracted for each word from the output of two lexicalized robust syntactic parsers respectively. Con?dence measures, which combine features using Support Vector Machines, are used to detect speech recognition errors. A con?dence measure that combines syntactic features with non-linguistic features yields consistent performance improvement in one or more aspects over those obtained by using non-linguistic features alone.</p>
<p>Semantic abnormalities possibly caused by speech recognition errors are caught by the analysis of semantic relatedness of a word to its context. Two different methods are used to integrate semantic analysis with syntactic analysis. One approach addresses the problem by extracting features for each word from its relations to other words. To this end, various WordNet-based measures and different context lengths are examined. The addition of semantic features in con?dence measures can further yield small but consistent improvement in error detection performance. The other approach applies lexical cohesion analysis by taking both reiteration and collocation relationships into consideration and by augmenting words with probability predicted from syntactic analysis. Two WordNet-based measures and one measure based on Latent Semantic Analysis are used to instantiate lexical cohesion relationships. Additionally, various word probability thresholds and cosine similarity thresholds are examined. The incorporation of lexical cohesion analysis is superior to the use of syntactic analysis alone. In summary, the use of linguistic information as described, including syntactic and semantic information, can provide positive impact on automatic detection of speech recognition errors.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/26/yongmei-shi-phd-linguistic-information-for-speech-recognition-error-detection/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Social media conferences, sympoisa, workshops and events</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/25/social-media-conferences-sympoisa-workshops-and-events/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/25/social-media-conferences-sympoisa-workshops-and-events/#comments</comments>
		<pubDate>Thu, 25 Dec 2008 16:47:11 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1709</guid>
		<description><![CDATA[JD Lasica&#8217;s SocialMedia.biz blog has a post, 2009 conferences: Social media, tech, marketing, that lists &#8220;some of the best social media, technology, media and marketing conferences for the upcoming year&#8221; in the US.  The list doesn&#8217;t include any technology research-oriented conferences, but does have quite a range of others.  The post invites everyone [...]]]></description>
			<content:encoded><![CDATA[<p>JD Lasica&#8217;s SocialMedia.biz blog has a post, <a href="http://www.socialmedia.biz/2008/12/2009-conference.html">2009 conferences: Social media, tech, marketing</a>, that lists &#8220;some of the best social media, technology, media and marketing conferences for the upcoming year&#8221; in the US.  The list doesn&#8217;t include any technology research-oriented conferences, but does have quite a range of others.  The post invites everyone to suggest additional entries by adding comments about them.  (I suggested <a href="http://www.icwsm.org/2009/index.shtml">ICWSM</a> and the AAAI Spring Symposium on the <a href="http://tw.rpi.edu/portal/index.php/Social_Web_Semantics">Social Semantic Web</a>.)</p>
<p>This list complements Akshay Java&#8217;s <a href="http://socialmedia.typepad.com/blog/social-media-events.html">Social Media Events</a> calendar which is focused mostly on research conferences.  He also invites suggestions which you can submit by email or through comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/25/social-media-conferences-sympoisa-workshops-and-events/feed/</wfw:commentRss>
		</item>
		<item>
		<title>WWGD: Understanding Google’s Technology Stack</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/24/wwgd-understanding-googles-technology-stack/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/24/wwgd-understanding-googles-technology-stack/#comments</comments>
		<pubDate>Wed, 24 Dec 2008 16:15:38 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1708</guid>
		<description><![CDATA[It&#8217;s popular to ask &#8220;What Would Google Do&#8221; these days &#8212; The Google reports over 7,000 results for the phrase.  Of course, it&#8217;s not just about Google, which we all use as the archetype for a new Web way of building and thinking about information systems.  Asking WWGD can be productive, but only [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s popular to ask &#8220;What Would Google Do&#8221; these days &#8212; The Google reports over <a href="http://www.google.com/search?num=20&#038;hl=en&#038;safe=off&#038;q=%22what+would+google+do%22&#038;btnG=Search">7,000 results</a> for the phrase.  Of course, it&#8217;s not just about Google, which we all use as the archetype for a new Web way of building and thinking about information systems.  Asking WWGD can be productive, but only if we know how to implement and exploit the insights the answer gives us.  This in turn requires us (well, some of us, anyway) to understand the algorithms, techniques, and software technology that Google and other large scale Web-oriented companies use.  We need to ask &#8220;How Would Google Do It&#8221;.</p>
<p><a href="http://michaelnielsen.org/blog/?page_id=181">Michael Nielsen</a> has a nice post on <a href="http://michaelnielsen.org/blog/?p=523">using your laptop to compute PageRank for millions of webpages</a>. His posts reviews PageRank and how to compute it and shows a short, but reasonably efficient, Python program that can easily do a graph with a few million nodes.  While not sufficient for many applications, like the Web, there are lots of interesting and significant graphs this small Python program can handle &#8212; Wikipedia pages, DBLP publications, RDF namespaces, BGP routers, Twitter followers, etc.</p>
<p>The post is part of a series Nielsen is making on the <a href="http://michaelnielsen.org/blog/?page_id=503">Google Technology Stack</a> including PageRank, MapReduce, BigTable, and GFS.  The posts are a byproduct of a series of weekly lectures he&#8217;s giving starting earlier this month in Waterloo.  Here&#8217;s the way that Nielsen describes the series.</p>
<blockquote><p> &#8220;Part of what makes Google such an amazing engine of innovation is their internal technology stack: a set of powerful proprietary technologies that makes it easy for Google developers to generate and process enormous quantities of data. According to a senior Microsoft developer who moved to Google, Googlers work and think at a higher level of abstraction than do developers at many other companies, including Microsoft: “Google uses Bayesian filtering the way Microsoft uses the if statement” (<a href="http://www.joelonsoftware.com/items/2005/10/17.html">Credit: Joel Spolsky</a>). This series of posts describes some of the technologies that make this high level of abstraction possible.&#8221;  </p></blockquote>
<p>Videos of the first two lectures, <a href="http://michaelnielsen.org/blog/?p=509">Introducion to PageRank</a> and <a href="http://michaelnielsen.org/blog/?p=520">Building our PageRank Intuition</a>) are available online.  Nielsen illustrates the concepts and algorithms with well-written Python code and provides exercises to help readers master the material as well as &#8220;more challenging and often open-ended problems&#8221; which he has worked on but not completely solved. </p>
<p><a href="http://en.wikipedia.org/wiki/Michael_Nielsen">Nielsen</a> was trained as a as a theoretical Physicist but has shifted his attention to “the development of new tools for scientific collaboration and publication”.  As far as I can see, he is offering these as free public lectures out of a desire to share his knowledge and also to help (or maybe force) him to deepen his own understanding of the topics and develop better ways of explaining them.  In both cases, it an admirable and inspiring example for us all and appropriate for the holiday season.  Merry Christmas!</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/24/wwgd-understanding-googles-technology-stack/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Make Your Own Digital Newspaper</title>
		<link>http://blog.hakia.com/?p=541</link>
		<comments>http://blog.hakia.com/?p=541#comments</comments>
		<pubDate>Tue, 23 Dec 2008 07:33:39 +0000</pubDate>
		<dc:creator>hakia Team</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://blog.hakia.com/?p=541</guid>
		<description><![CDATA[Before entering the holidays, one may wish that the news we get everyday were somehow customized to our interests. For example, &#8220;I am not really interested in Baseball, or I like Jazz news to appear in my first glance view, or I need to monitor emerging progress about synthetic insulin, or&#8230;&#8221; People can have variety [...]]]></description>
			<content:encoded><![CDATA[<p>Before entering the holidays, one may wish that the news we get everyday were somehow customized to our interests. For example, &#8220;I am not really interested in Baseball, or I like Jazz news to appear in my first glance view, or I need to monitor emerging progress about synthetic insulin, or&#8230;&#8221; People can have variety of first-grade interests, but they have to collect these information from different places everyday, or through clicking bunch of links. Why not have my own newspaper where every column is about my selected interest, laid out in the way I want?</p>
<p>We built <a href="http://my.hakia.com">my.hakia.com</a>, which does exactly what is described above. A screenshot is shown below.</p>
<table>
<tr>
<td>
<img src="http://blog.hakia.com/wp-content/uploads/myhakia1.gif" alt="myhakia1" title="myhakia1" class="alignnone size-full wp-image-542" /></a>
</td>
</tr>
</table>
<p>The screenshot above tells the whole story except one important differentiator. </p>
<p>Semantic technology of hakia allows high-level of precision compared to any other similar platform. This enables the user to park highly specific questions against the emerging news. Therefore, <a href="http://my.hakia.com ">my.hakia.com </a>can be considered as &#8220;intelligence gathering dashboard&#8221;. Let us tell you how.</p>
<p>If you search Google news for <a href="http://news.google.com/news?hl=en&#038;ned=us&#038;nolr=1&#038;q=Obama%27s+strategy+for+the+new+team">Obama&#8217;s strategy for the new team</a>, you will see that the results are mostly irrelevant. Try to create a Google alert for this query and see the results for yourself on a continuous basis.</p>
<p>The same search at hakia for <a href="http://news.hakia.com/search.aspx?q=Obama's%20strategy%20for%20the%20new%20team&#038;lang=en&#038;sort=byrel">Obama&#8217;s strategy for the new team</a> produces dead-on results. This is because semantic technology does not need &#8220;link referrals&#8221; to pull relevant results unlike Google-esque search engines. For dynamic content like news, there is no time to collect &#8220;Link-referral&#8221; statistics and that is why Google-esque search engines fail beyond simple triggers. Try the same query at <a href="http://search.yahoo.com/search?p=Obama%27s+strategy+for+the+new+team&#038;ygmasrchbtn=web+search&#038;fr=ush-news">Yahoo</a>, it displays the same confusion.</p>
<p>This fundamental differentiation is a valuable asset for <a href="http://my.hakia.com ">my.hakia.com</a> users because they will be getting precise results for specific interests that they cannot get it anywhere else. Some ideal cases are outlined below;</p>
<p>- Monitor your business competitors<br />
- Get information on latest progress in the treatment of diseases<br />
- Keep an eye on your favorite artists by activity (like album releases)<br />
- Stay in touch with particular economic developments (such as in real estate in your city)</p>
<p>Try <a href="http://my.hakia.com">my.hakia.com</a>, and tell us if we have met your expectations.</p>
<p>Happy holidays</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.hakia.com/?p=541/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Videos of Semantic Web talks and tutorials from ISWC 2008 now online</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/22/videos-of-semantic-web-talks-and-tutorials-from-iswc-2008-now-online/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/22/videos-of-semantic-web-talks-and-tutorials-from-iswc-2008-now-online/#comments</comments>
		<pubDate>Mon, 22 Dec 2008 14:45:51 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1707</guid>
		<description><![CDATA[High quality videos of tutorials and talks from the Seventh International Semantic Web Conference are now available on the excellent VideoLectures.net site.  It&#8217;s a great opportunity to benefit from the conference if you were not able to attend or, even if you were, to see presentations you were not able to attend.
Videolectures captured the [...]]]></description>
			<content:encoded><![CDATA[<p>High quality videos of tutorials and talks from the Seventh International Semantic Web Conference are now available on the excellent <a href="http://videolectures.net/iswc08_karlsruhe/">VideoLectures.net</a> site.  It&#8217;s a great opportunity to benefit from the conference if you were not able to attend or, even if you were, to see presentations you were not able to attend.</p>
<p>Videolectures captured the slides for most of the presentations (which are available for downloading) and their site shows both the the speaker&#8217;s video and slides in synchronization.  Videolectures used three camera crews in parallel so were able to capture almost all of the presentations.  Here are some highlights from the ~90 videos to whet your appetite.</p>
<ul>
<li><a href="http://videolectures.net/iswc08_hendler_ittsw/">Introduction to the Semantic Web</a>: An all-day tutorial featuring ten top researchers covering all aspects of the Semantic Web.</li>
<li><a href="http://videolectures.net/iswc08_heath_hpldw/">How to Publish Linked Data on the Web</a>: A three-hour tutorial by five researchers explaining how LOD works and how you can exploit it.</li>
<li> <a href="http://videolectures.net/iswc08_giannandrea_fowdw/">Freebase: An Open, Writable Database of the World’s Information</a>: A keynote by MetaWeb CTO John Giannandrea on Freebase.</li>
<li><a href="http://videolectures.net/iswc08_swcbtc/">Semantic Web Challenge &#038; Billion Triple Challenge</a>: The session on the Semantic Web Challenge finalists.</li>
<li><a href="http://videolectures.net/iswc08_greaves_swfttsotsw/">Semantic Wikis: Fusing the two strands of the Semantic Web</a>: Mark Greaves on Semantic Wikis.</li>
<li><a href="http://videolectures.net/iswc08_verma_rattfaasrd/">Requirements Critic: A Tool for Automatically Analyzing Software Requirements Documents</a>: The winner of the best paper award for the In Use track.</li>
<li><a href="http://videolectures.net/iswc08_horridge_lpjowl/">Laconic and Precise Justifications in OWL</a>: Winner of the best paper award for the Research Track.</li>
<li><a href="http://videolectures.net/iswc08_panel_schneider_owl/">An OWL 2 Far?</a>: A panel that takes up the question of whether having standard languages based on formal methods with steadily increasing power is the right way to support the Semantic Web.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/22/videos-of-semantic-web-talks-and-tutorials-from-iswc-2008-now-online/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Tom Briggs Ph.D.: Constraint Generation and Reasoning in OWL</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/22/tom-briggs-phd-constraint-generation-and-reasoning-in-owl/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/22/tom-briggs-phd-constraint-generation-and-reasoning-in-owl/#comments</comments>
		<pubDate>Mon, 22 Dec 2008 08:00:25 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1706</guid>
		<description><![CDATA[Tom Briggs defended his PhD dissertation last month on discovering domain and range constraints in OWL and the final copy is now available.
Thomas H. Briggs, Constraint Generation and Reasoning in OWL, 2008.
The majority of OWL ontologies in the emerging SemanticWeb are constructed from properties that lack domain and range constraints. Constraints in OWL are different [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ebiquity.umbc.edu/person/html/Tom/Briggs/">Tom Briggs</a> defended his PhD dissertation last month on discovering domain and range constraints in OWL and the final copy is now available.</p>
<p>Thomas H. Briggs, <a href="http://ebiquity.umbc.edu/paper/html/id/427/Constraint-Generation-and-Reasoning-in-OWL">Constraint Generation and Reasoning in OWL</a>, 2008.</p>
<p>The majority of OWL ontologies in the emerging SemanticWeb are constructed from properties that lack domain and range constraints. Constraints in OWL are different from the familiar uses in programming languages and databases. They are actually type assertions that are made about the individualswhich are connected by the property. Because they are type assertions these assertions can add vital information to the individuals involved and give information on how the defining property may be used. Three different automated generation techniques are explored in this research: disjunction, least-common named subsumer, and vivification. Each algorithm is compared for the ability to generalize, and the performance impacts with respect to the reasoner. A large sample of ontologies from the Swoogle repository are used to compare real-world performance of these techniques. Using generated facts is a type of default reasoning. This may conflict with future assertions to the knowledge base. While general default reasoning is non-monotonic and undecidable a novel approach is introduced to support efficient contraction of the default knowledge.  Constraint generation and default reasoning, together, enable a robust and efficient generation of domain and range constraints which will result in the inference of additional facts and improved performance for a number of Semantic Web applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/22/tom-briggs-phd-constraint-generation-and-reasoning-in-owl/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Disco: a Map reduce framework in Python and Erlang</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/21/disco-a-map-reduce-framework-in-python-and-erlang/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/21/disco-a-map-reduce-framework-in-python-and-erlang/#comments</comments>
		<pubDate>Sun, 21 Dec 2008 17:45:43 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1705</guid>
		<description><![CDATA[Disco is a Python-friendly, open-source Map-Reduce framework for distributed computing with the slogan &#8220;massive data - minimal code&#8221;.  Disco&#8217;s core is written in Erlang, a functional language designed for concurrent programming, and users typically write Disco map and reduce jobs in Python. So what&#8217;s wrong with using Hadoop?  Nothing, according to the Disco [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://discoproject.org/">Disco</a> is a Python-friendly, open-source Map-Reduce framework for distributed computing with the slogan <i>&#8220;massive data - minimal code&#8221;</i>.  Disco&#8217;s core is written in <a href="http://en.wikipedia.org/wiki/Erlang_(programming_language)">Erlang</a>, a functional language designed for concurrent programming, and users typically write Disco map and reduce jobs in Python. So what&#8217;s wrong with using <a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a>?  Nothing, according to the Disco site, but&#8230;<br />
<blockquote> &#8220;We see that platforms for distributed computing will be of such high importance in the future that it is crucial to have a wide variety of different approaches which produces healthy competition and co-evolution between the projects. In this respect, Hadoop and Disco can be seen as complementary projects, similar to Apache, Lighttpd and Nginx.</p>
<p>It is a matter of taste whether Erlang and Python are more suitable for the task than Java. We feel much more productive with Python than with Java. We also feel that Erlang is a perfect match for the Disco core that needs to handle tens of thousands of tasks in parallel.</p>
<p>Thanks to Erlang, the Disco core remarkably compact, currently less than 2000 lines of code. It is relatively easy to understand how the core works, and start experimenting with it or adapt it to new environments. Thanks to Python, it is easy to add new features around the core which ensures that Disco can respond quickly to real-world needs.&#8221;
</p></blockquote>
<p>The <a href="http://discoproject.org/doc/start/tutorial.html">Disco tutorial</a> uses the standard word counting task to show how to set up and use Disco on both a local cluster and Amazon EC2. There is also <a href="http://discoproject.org/doc/py/homedisco.html">homedisco</a>, which lets programmers develop, debug, profile and test Disco functions on one local machine before running on a cluster. The word counting example from the tutorial is certainly nicely compact:</p>
<blockquote>
<div >
<pre>
from disco.core import Disco, result_iterator

def fun_map(e, params):
    return [(w, 1) for w in e.split()]

def fun_reduce(iter, out, params):
    s = {}
    for w, f in iter:
        s[w] = s.get(w, 0) + int(f)
    for w, f in s.iteritems():
        out.add(w, f)

results = Disco(&#8221;disco://localhost&#8221;).new_job(
		name = &#8220;wordcount&#8221;,
                input = ["http://discoproject.org/chekhov.txt"],
                map = fun_map,
		reduce = fun_reduce).wait()

for word, frequency in result_iterator(results):
	print word, frequency
</pre>
</div>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/21/disco-a-map-reduce-framework-in-python-and-erlang/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Library of Congress forces LCSH Linked Data site to shut down</title>
		<link>http://feeds.feedburner.com/~r/talis/panlibus/~3/491442346/library-of-congress-forces-lcsh-linked-data-site-shut-down.php</link>
		<comments>http://feeds.feedburner.com/~r/talis/panlibus/~3/491442346/library-of-congress-forces-lcsh-linked-data-site-shut-down.php#comments</comments>
		<pubDate>Sun, 21 Dec 2008 15:21:04 +0000</pubDate>
		<dc:creator>Richard Wallis</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php</guid>
		<description><![CDATA[ Back in May I was among others who welcomed the initiative by, Talking with Talis interviewee, Ed Summers in setting up lcsh.info.&#160; This site was set up by Ed to demonstrate how the Library of Congress Subject Headings could be represented as a Semantic Web application using SKOS.
In the intervening months many including myself [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blogs.talis.com/panlibus/archives/2008/05/lcsh-as-linked-data.php" ><img  src="http://farm3.static.flickr.com/2228/2162932731_bf4c9773f3_m_d.jpg" align="right" /> Back in May</a> I was among others who welcomed the initiative by, Talking with Talis interviewee, <a href="http://blogs.talis.com/panlibus/archives/2008/06/ed-summers-talks-with-talis.php" >Ed Summers</a> in setting up <a href="http://lcsh.info/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://lcsh.info/');">lcsh.info</a>.&#160; This site was set up by Ed to demonstrate how the <a href="http://en.wikipedia.org/wiki/Library_of_Congress_Subject_Headings" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://en.wikipedia.org/wiki/Library_of_Congress_Subject_Headings');">Library of Congress Subject Headings</a> could be represented as a Semantic Web application using <a href="http://www.w3.org/TR/skos-reference/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://www.w3.org/TR/skos-reference/');">SKOS</a>.</p>
<p>In the intervening months many including myself used Ed’s work as a pointer to how useful publicly available data could, with the use of open <a href="http://linkeddata.org/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://linkeddata.org/');">Linked Data</a> principles, become a valuable part of sites and services across the globe.&#160;&#160; For instance another Talking with Talis interviewee <a href="http://blogs.talis.com/panlibus/archives/2008/08/semantic-future-for-libraries-martin-marlmsten-talks-with-talis.php" >Martin Malmsten</a>, from the Royal Library of Sweden, almost immediately made use of the links to the LCSH data.&#160; Ed went on to get lots of feedback, and wrote a <a href="http://arxiv.org/abs/0805.2855" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://arxiv.org/abs/0805.2855');">paper</a> which he then presented at <a href="http://dc2008.de/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://dc2008.de/');">DC2008</a>.</p>
<p>It is therefore with great disappointment that I read <a href="http://lcsh.info/2008/12/19/uncool-uris/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://lcsh.info/2008/12/19/uncool-uris/');">this</a> on the lcsh.info site the other day:</p>
<blockquote><p>On December 18th I was asked to shut off lcsh.info by the Library of Congress. As an LC employee I really did not have much choice other than to comply.</p>
</blockquote>
<p>As a LC employee he was put in an untenable position when they obviously decided that they didn’t like this useful service based on publicly available data being delivered from a domain that doesn’t end in loc.gov.&#160; I wonder if there are any other Linked Data enthusiasts, not held back by who their employer is, who would pick up from where he left off?</p>
<p>Ed goes on to say:</p>
<blockquote><p>It was always my intention for concept URIs at lcsh.info to be cool. I advertised the service as ‘experimental’ and indicated it was going to hopefully inform the development of a similar continually updated service at LC where I work. …&#160; My thought was I could leave the service running until there was something similar at LC that I could redirect the concept URIs to. After a year or two when people had rewritten there data to point at loc.gov I could retire lcsh.info. I never imagined I would be asked by LC to take it down.</p>
</blockquote>
<p>LOC should have listened to Ed in the first place and taken the high ground in leading the work in to creating a semantic web of data with their valuable publicly available data.&#160; At the end of his post Ed hints that <em>LC is still considering running a service like lcsh.info at loc.gov, but it’s not there yet.&#160; </em>Why-o-why did they not learn from his work and ride the wave of introducing their own service based on his great initiative.&#160; Instead they present to the world a short-termist not-invented-here attitude, that reminds me of other well established leviathans of the world of library metadata.</p>
<p>Let’s hope that Ed’s hint is correct and we will soon be able to welcome the release of Open Linked LCSH and other Data from the electronic portals of the LofC.</p>
<div class="wlWriterEditableSmartContent" id="174b3e38-b5f0-4377-b062-27e61b9d7ec2" >Technorati Tags: <a href="http://technorati.com/tags/Library+of+Congress" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://technorati.com/tags/Library+of+Congress');" rel="tag">Library of Congress</a>,<a href="http://technorati.com/tags/Ed+Summers" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://technorati.com/tags/Ed+Summers');" rel="tag">Ed Summers</a>,<a href="http://technorati.com/tags/Linked+Data" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://technorati.com/tags/Linked+Data');" rel="tag">Linked Data</a></div>
<p><a href="http://www.flickr.com/photos/library_of_congress/2162932731/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://www.flickr.com/photos/library_of_congress/2162932731/');">Traffic Squad Police (LOC)</a> image published in the <a href="http://www.flickr.com/photos/library_of_congress/" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://www.flickr.com/photos/library_of_congress/');">The Library of Congress&#8217; photostream</a> on <a href="http://www.flickr.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/http://www.flickr.com');">Flickr</a></p>
<!-- Social Bookmarks BEGIN -->
<div class="social_bookmark">
<a><strong><em>Share This</em></strong></a>
<br />
<div class="d">
<br />
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.bloglines.com/sub/http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php" title="Add to&nbsp;Bloglines"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/bloglines.png" title="Add to&nbsp;Bloglines" alt="Add to&nbsp;Bloglines" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.connotea.org/addpopup?continue=confirm&amp;uri=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;Connotea"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/connotea.png" title="Add to&nbsp;Connotea" alt="Add to&nbsp;Connotea" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://del.icio.us/post?url=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;Del.icio.us"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/delicious.png" title="Add to&nbsp;Del.icio.us" alt="Add to&nbsp;Del.icio.us" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://digg.com/submit?phase=2&amp;url=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;digg"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/digg.png" title="Add to&nbsp;digg" alt="Add to&nbsp;digg" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.facebook.com/sharer.php?u=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php" title="Add to&nbsp;Facebook"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/facebook.png" title="Add to&nbsp;Facebook" alt="Add to&nbsp;Facebook" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;Google Bookmarks"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/google.png" title="Add to&nbsp;Google Bookmarks" alt="Add to&nbsp;Google Bookmarks" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://ma.gnolia.com/bookmarklet/add?url=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down&amp;description=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;Ma.gnolia"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/magnolia.png" title="Add to&nbsp;Ma.gnolia" alt="Add to&nbsp;Ma.gnolia" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.stumbleupon.com/submit.php?url=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php&amp;title=Library+of+Congress+forces+LCSH+Linked+Data+site+to+shut+down" title="Add to&nbsp;Stumble Upon"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/stumbleupon.png" title="Add to&nbsp;Stumble Upon" alt="Add to&nbsp;Stumble Upon" /></a>
<a onclick="window.open(this.href, '_blank', 'scrollbars=yes,menubar=no,height=600,width=750,resizable=yes,toolbar=no,location=no,status=no'); return false;" href="http://www.technorati.com/faves?add=http://blogs.talis.com/panlibus/archives/2008/12/library-of-congress-forces-lcsh-linked-data-site-shut-down.php" title="Add to&nbsp;Technorati"><img class="social_img" src="http://blogs.talis.com/panlibus/wp-content/plugins/social_bookmarks/images/technorati.png" title="Add to&nbsp;Technorati" alt="Add to&nbsp;Technorati" /></a>
<br />
</div>
</div>
<!-- Social Bookmarks END -->
<img src="http://feeds.feedburner.com/~r/talis/panlibus/~4/491442346" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://feeds.feedburner.com/~r/talis/panlibus/~3/491442346/library-of-congress-forces-lcsh-linked-data-site-shut-down.php/feed/</wfw:commentRss>
		</item>
		<item>
		<title>New RIF specification releases</title>
		<link>http://www.w3.org/blog/SW/2008/12/20/new_rif_specification_releases</link>
		<comments>http://www.w3.org/blog/SW/2008/12/20/new_rif_specification_releases#comments</comments>
		<pubDate>Sat, 20 Dec 2008 13:46:08 +0000</pubDate>
		<dc:creator>Ivan Herman</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">tag:www.w3.org://84bcdfe8028f89016715274d79b715e4</guid>
		<description><![CDATA[The W3C Rule Interchange Format (RIF) Working Group published five new Working Drafts today. Since the Last Call Working Draft of RIF Basic Logic Dialect (BLD), the group has been developing other key dialects, components, and test cases. The new publications are:


  RIF Use Cases and Requirements: minor changes
  RIF Core: new design to support both BLD and PRD
  RIF Datatypes and Built-Ins 1.0: various improvements
  RIF Production Rule Dialect (PRD): operational semantics are complete
  RIF Test Cases: early stages of test suite


The Working Group is nearing Last Call on these remaining elements of RIF, and welcomes feedback from rulesystem users and designers.]]></description>
			<content:encoded><![CDATA[The <a href="http://www.w3.org/2005/rules">W3C Rule Interchange Format (RIF) Working Group</a> published five new Working Drafts today. Since the Last Call Working Draft of <a href="http://www.w3.org/TR/2008/WD-rif-bld-20080730/">RIF Basic Logic Dialect (BLD)</a>, the group has been developing other key dialects, components, and test cases. The new publications are:

<ul>
  <li><a href="http://www.w3.org/TR/2008/WD-rif-ucr-20081218/">RIF Use Cases and Requirements</a>: minor changes</li>
  <li><a href="http://www.w3.org/TR/2008/WD-rif-core-20081218/">RIF Core</a>: new design to support both BLD and PRD</li>
  <li><a href="http://www.w3.org/TR/2008/WD-rif-dtb-20081218/">RIF Datatypes and Built-Ins 1.0</a>: various improvements</li>
  <li><a href="http://www.w3.org/TR/2008/WD-rif-prd-20081218/">RIF Production Rule Dialect (PRD</a>): operational semantics are complete</li>
  <li><a href="http://www.w3.org/TR/2008/WD-rif-test-20081218/">RIF Test Cases</a>: early stages of test suite</li>
</ul>

The Working Group is nearing Last Call on these remaining elements of RIF, and welcomes feedback from rulesystem users and designers.]]></content:encoded>
			<wfw:commentRss>http://www.w3.org/blog/SW/2008/12/20/new_rif_specification_releases/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Our Approach to Modeling, Fidelity, and KR</title>
		<link>http://clarkparsia.com/weblog/2008/12/19/our-approach-to-modeling-fidelity-and-kr/</link>
		<comments>http://clarkparsia.com/weblog/2008/12/19/our-approach-to-modeling-fidelity-and-kr/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 16:41:25 +0000</pubDate>
		<dc:creator>Kendall Clark</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://clarkparsia.com/weblog/?p=231</guid>
		<description><![CDATA[Image via Wikipedia
	For some people, the point of the Semantic Web is distributed, web-friendly knowledge management and knowledge representation. Generally we&#8217;re in that camp. But that camp breaks down into several factions, and it&#8217;s useful to be clear about which faction we&#8217;re in.
	There is a spectrum that runs from Maximum Fidelity to Maximum Scalability. Given [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" ><div><dl class="wp-caption" ><dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:Data_link_modules.jpg"><img src="http://upload.wikimedia.org/wikipedia/en/thumb/c/ce/Data_link_modules.jpg/202px-Data_link_modules.jpg" alt="SSI data link modules" title="SSI data link modules" height="269" width="202"></a></dt><dd class="wp-caption-dd zemanta-img-attribution" >Image via <a href="http://en.wikipedia.org/wiki/Image:Data_link_modules.jpg">Wikipedia</a></dd></dl></div></div>
	<p>For some people, the point of the <a href="http://en.wikipedia.org/wiki/Semantic_Web" title="Semantic Web" rel="wikipedia" class="zem_slink">Semantic Web</a> is distributed, web-friendly knowledge management and <a href="http://en.wikipedia.org/wiki/Knowledge_representation" title="Knowledge representation" rel="wikipedia" class="zem_slink">knowledge representation</a>. Generally we&#8217;re in that camp. But that camp breaks down into several factions, and it&#8217;s useful to be clear about which faction we&#8217;re in.</p>
	<p>There is a spectrum that runs from Maximum Fidelity to Maximum Scalability. Given our roots in Description Logic, we lie somewhere in-between these two poles. Notice that I have intentionally avoided calling these &#8220;extremes&#8221;; they are endpoints, and perfectly respectable, useful ones, depending on who you are and what you&#8217;re trying to achieve. </p>
	<p>The Max Fidelity folks want to model as closely as possible some world-chunk in as fine-grained and faithful manner as is possible. This often means that they are at least <a href="http://en.wikipedia.org/wiki/First-order_logic" title="First-order logic" rel="wikipedia" class="zem_slink">first order logic</a> fans, and sometimes <a href="http://en.wikipedia.org/wiki/Higher-order_logic" title="Higher-order logic" rel="wikipedia" class="zem_slink">higher-order logic</a> users. They debate edge cases, corner cases, alternate and competing semantics and logics in an attempt to ever more faithfully mirror reality. The price they pay is, generally, computability. For some use cases, that price is perfectly acceptable. For other use cases, that price is entirely too high, since the most perfect representation of the world is useless if you can&#8217;t practically compute with it&#8212;at least, that&#8217;s how Max Fidelity often looks to us.</p>
	<p>At the far end of the spectrum we have Max Scalability folks, for whom the point of the Semantic Web is rather more the &#8220;Web&#8221; than the &#8220;Semantic&#8221; part&#8212;we might playfully call them the &#8220;semantic WEB&#8221; crowd, in order to reflect their ideal ratio. Here the point isn&#8217;t to <strong>model</strong> perfectly; but, rather, to do something with lots and lots of data, ideally Webfuls of data. This means, in the argot of current tech choices, that they tend to be RDF and <a href="http://en.wikipedia.org/wiki/Linked_Data" title="Linked Data" rel="wikipedia" class="zem_slink">Linked Data</a> fans and users, since that&#8217;s just about the only approach to doing anything at all interesting with Webfuls of data. The price they pay, of course, is expressivity. For some use cases, that&#8217;s just fine, since you don&#8217;t always need a lot or even much semantic fidelity to get the job done. Sometimes we build applications for customer that take this approach. But, as above, for other use cases, this is simply a killer, because without enough or the right semantics, you don&#8217;t get the right kind of help from the machine in figuring out complex stuff.</p>
	<p>So what do we have so far? First, we have a notional (and idealized) spectrum that runs from Webfuls of data to, roughly, at least first order logic. Second, we have obviously tons of interesting use cases at (probably) every point along this spectrum. And, third, we have the suggestion that we aim for some kind of sweet spot in the middle&#8212;where &#8220;sweet spot&#8221; and &#8220;in the middle&#8221; are not absolute notions, but are interest-relative and goal-specific, and where the interests and goals we care about are, surprise-surprise, <strong>ours</strong>. </p>
	<p>(In other words, I&#8217;ve setup a little fantasy where we are the Heroes&#8212;where we naturally occupy the &#8220;sweet spot&#8221;&#8212;but then, since I&#8217;m not a complete jerk, I&#8217;ve ironized or called into question that very fantasy in an effort to suggest that we, just like everyone else, try to spin things to make ourselves look smart, cool, and useful.)</p>
	<p>And&#8212;will miracles never cease?&#8212;that&#8217;s just about where <a href="http://en.wikipedia.org/wiki/Description_logic" title="Description logic" rel="wikipedia" class="zem_slink">Description Logic</a> fits along such an idealized spectrum. Technically, it&#8217;s the decidable subset of first order logic, which means that we try to balance Fidelity and Scalability in a way where we can get some of both. </p>
	<p>The Max Fidelity folks are forever poking us with sticks to the effect that we can&#8217;t model world-chunks nearly as faithfully as they can. Well, no crap, of course we can&#8217;t! Then the Max Scalability folks poke us with different sticks to the effect that we can&#8217;t scale to Webfuls of data&#8212;again, no duh!</p>
	<p>And then we poke back at both camps&#8212;hey, they started it!&#8212;to the effect that we can model far better than Max Scalers and we can scale far further than Max Fideliters (yes, I just made that word up&#8230;Rock!)...</p>
	<p>Finally, a word about how this positioning issue plays out in our approach to modeling. In short, we model such that we get the right inferences, since getting the inferences is typically what our kind of applications (analysis, decision support kinds of apps, in short) are all about. So that means some edge or corner cases, even if they fit into DL, get ignored or dropped out or even distorted when there&#8217;s no point&#8212;given requirements analysis&#8212;to fidelity for its own sake. And it means, on the flip side, that we don&#8217;t worry too much that that inference over Webfuls of data is not realistically achievable anytime soon. Fast enough for the customer&#8217;s data is sufficient scalability in most cases for us.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related Articles</h6><ul class="zemanta-article-ul"><li class="zemanta-article-ul-li"><a href="http://www.mkbergman.com/?p=466">Thinking &#8216;Inside the Box&#8217; with Description Logics</a></li><li class="zemanta-article-ul-li"><a href="http://vanirsystems.com/danielsblog/2008/05/23/the-tribes-of-the-semantic-web/">The Tribes of the Semantic Web</a></li></ul>
<div  class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/8a92e2bf-001b-49e7-9592-37ddb1c55437/" title="Zemified by Zemanta"><img  class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=8a92e2bf-001b-49e7-9592-37ddb1c55437" alt="Reblog this post [with Zemanta]"></a></div>

 ]]></content:encoded>
			<wfw:commentRss>http://clarkparsia.com/weblog/2008/12/19/our-approach-to-modeling-fidelity-and-kr/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Why Reasoning Matters: Motivations</title>
		<link>http://clarkparsia.com/weblog/2008/12/19/why-reasoning-matters-motivations/</link>
		<comments>http://clarkparsia.com/weblog/2008/12/19/why-reasoning-matters-motivations/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 16:21:17 +0000</pubDate>
		<dc:creator>Kendall Clark</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://clarkparsia.com/weblog/?p=206</guid>
		<description><![CDATA[Image via Wikipedia
	The  perceived utility of automated reasoning for a wide range of applications matters to us greatly, which makes sense, given that our biz proposition is &#8220;semantic infrastructure OEM&#8221;. In other words, we&#8217;re trying to make money by licensing reasoning infrastructure, and related pieces, for semantic applications to other developers to use in [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" ><div><dl class="wp-caption" ><dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/Image:Industrial_Robotics_in_car_production.jpg"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Industrial_Robotics_in_car_production.jpg/202px-Industrial_Robotics_in_car_production.jpg" alt="German KUKA Industrial robots doing vehicle un..." title="German KUKA Industrial robots doing vehicle un..." height="253" width="202"></a></dt><dd class="wp-caption-dd zemanta-img-attribution" >Image via <a href="http://commons.wikipedia.org/wiki/Image:Industrial_Robotics_in_car_production.jpg">Wikipedia</a></dd></dl></div></div>
	<p>The  perceived utility of <a href="http://en.wikipedia.org/wiki/Automated_reasoning" title="Automated reasoning" rel="wikipedia" class="zem_slink">automated reasoning</a> <strong>for a wide range of applications</strong> matters to us greatly, which makes sense, given that our biz proposition is &#8220;semantic infrastructure OEM&#8221;. In other words, we&#8217;re trying to make money by licensing reasoning infrastructure, and related pieces, for semantic applications to other developers to use in their apps. With the right APIs and tool maturity, as well as supporting materials, our customers should be able to treat automated reasoning as a black box&#8212;not a black art.</p>
	<p>A problem with demonstrating automated reasoning&#8217;s utility is that automated reasoning is complex, with non-trivial logical background and framework, including oodles of domain-specific vocabulary. Another problem is that automated reasoning is, in the end, just a kind of mechanical <a href="http://en.wikipedia.org/wiki/Rewriting" title="Rewriting" rel="wikipedia" class="zem_slink">term rewriting</a> often according to, considered individually, quite trivial rules. (Pellet isn&#8217;t really a rules engine, but we&#8217;ll talk about that another time.)</p>
	<p>That means that for toy cases, which is what most people new to the subject are ready for, it seems dull and unimpressive. And for the hard cases? Well, most people aren&#8217;t <strong>ready</strong> for hard cases, so they simply tune out. And who can blame them, really? It&#8217;s like my example about Emma and Jack. I mean, that example really sucked, but what&#8217;s the alternative?</p>
	<p>This is not an easy problem to solve.</p>
	<p>My approach, rather than showing more toy or real examples, is just to talk about the utility of automated reasoning in plain language, in an attempt to communicate not so much specific details as the general mindset or approach to solving particular sorts of problems using automated reasoning. This approach to marketing mirrors our approach to technology development: both are iterative and experimental, but not just for us. As the man said, even a blind pig occasionally finds an acorn.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related Articles</h6><ul class="zemanta-article-ul"><li class="zemanta-article-ul-li"><a href="http://clarkparsia.com/weblog/2008/06/18/its-not-what-you-know/">Why Reasoning Matters: It&#8217;s Not What You Know&#8230; (2)</a></li><li class="zemanta-article-ul-li"><a href="http://clarkparsia.com/weblog/2008/06/23/why-reasoning-matters-explanations/">Why Reasoning Matters: Explanations (3)</a></li><li class="zemanta-article-ul-li"><a href="http://clarkparsia.com/weblog/2008/09/14/upcoming-on-this-blog/">Upcoming on This Blog&#8230;</a></li></ul>
<div  class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/4f322906-0dae-46b9-aaf4-e2a615a6edb1/" title="Zemified by Zemanta"><img  class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=4f322906-0dae-46b9-aaf4-e2a615a6edb1" alt="Reblog this post [with Zemanta]"></a></div>

 ]]></content:encoded>
			<wfw:commentRss>http://clarkparsia.com/weblog/2008/12/19/why-reasoning-matters-motivations/feed/</wfw:commentRss>
		</item>
		<item>
		<title>APQC in Houston</title>
		<link>http://clarkparsia.com/weblog/2008/12/19/apqc-in-houston/</link>
		<comments>http://clarkparsia.com/weblog/2008/12/19/apqc-in-houston/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 16:03:55 +0000</pubDate>
		<dc:creator>Michael Grove</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://clarkparsia.com/weblog/?p=437</guid>
		<description><![CDATA[Image via Wikipedia
	I don&#8217;t have slides for my time at the APQC in Houston, I was not slated to present, so no cool slide widget with my presentation in this post.&#160; I was merely there to observe and learn, and maybe answer some questions about POPS.
	As Kendall mentioned, POPS was nominated as a best practice [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" ><div><dl class="wp-caption" ><dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:JPL_logo.svg"><img src="http://upload.wikimedia.org/wikipedia/en/thumb/e/e4/JPL_logo.svg/202px-JPL_logo.svg.png" alt="JPL logo" title="JPL logo" height="83" width="202"></a></dt><dd class="wp-caption-dd zemanta-img-attribution" >Image via <a href="http://en.wikipedia.org/wiki/Image:JPL_logo.svg">Wikipedia</a></dd></dl></div></div>
	<p>I don&#8217;t have slides for my time at the <a href="http://apqc.org" >APQC</a> in Houston, I was not slated to present, so no cool slide widget with my presentation in this post.&nbsp; I was merely there to observe and learn, and maybe answer some questions about POPS.</p>
	<p>As Kendall mentioned, <a href="http://www.w3.org/2001/sw/sweo/public/UseCases/Nasa/" >POPS</a> was nominated as a best practice as part of <a href="http://maps.google.com/maps?ll=34.199635,-118.174654&#038;spn=0.01,0.01&#038;q=34.199635,-118.174654%20%28Jet%20Propulsion%20Laboratory%29&#038;t=h" title="Jet Propulsion Laboratory" rel="geolocation" class="zem_slink">NASA JPL</a>&#8217;s overall efforts in <a href="http://en.wikipedia.org/wiki/Knowledge_Management" title="Knowledge Management" rel="wikipedia" class="zem_slink">Knowledge Management</a>. The meeting at APQC was for all the nominees to give a short talk and to hear the overall findings of the study conducted by APQC, which in this case was on Expertise Location and <a href="http://en.wikipedia.org/wiki/Social_network_service" title="Social network service" rel="wikipedia" class="zem_slink">Social Networking</a>.</p>
	<p>I got to see some great presentations by folks from <a href="http://www.ibm.com" title="IBM" rel="homepage" class="zem_slink">IBM</a>, Sun, <a href="http://www.pw.utc.com" title="Pratt &#038; Whitney" rel="homepage" class="zem_slink">Pratt-Whitney</a>, <a href="http://www.rockwellcollins.com/" title="Rockwell Collins" rel="homepage" class="zem_slink">Rockwell Collins</a>, and Mitre and get a lot of insight into what they&#8217;re doing with Expertise Location and Social Networking; challenges they faced in the past, lessons learned, and what they&#8217;re doing now, and in the future, to continue their efforts in these areas.</p>
	<p>It was a great experience, the people from APQC were fantastic, very friendly and put on a great event, and all the nominees and study partners, a group which included L3, Marathon Oil, <a href="http://www.exxonmobil.com/" title="ExxonMobil" rel="homepage" class="zem_slink">ExxonMobil</a>, and Wyeth, were all great and added a lot to the discussions.</p>
	<p>Hopefully I get the chance to participate or work with APQC in the future.</p>
<div  class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/29f5449b-9493-4393-a1d9-f34fb8ff5545/" title="Zemified by Zemanta"><img  class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=29f5449b-9493-4393-a1d9-f34fb8ff5545" alt="Reblog this post [with Zemanta]"></a></div>

 ]]></content:encoded>
			<wfw:commentRss>http://clarkparsia.com/weblog/2008/12/19/apqc-in-houston/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Eigenfactor.org measures and visualizes journal impact</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/19/eigenfactororg-measures-and-visualizes-scientific-journal-impact/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/19/eigenfactororg-measures-and-visualizes-scientific-journal-impact/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 06:01:09 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1702</guid>
		<description><![CDATA[eigenfactor.org is a fascinating site that is exploring new ways to measure and visualize the importance or journals to scientific communities. The site is a result of work by the Bergstrom lab in the Department of Biology at the University of Washington.  The project defines two metrics for scientific journals based on a page-rank [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://eigenfactor.org/">eigenfactor.org</a> is a fascinating site that is exploring new ways to measure and visualize the importance or journals to scientific communities. The site is a result of work by the <a href="http://octavia.zoology.washington.edu/">Bergstrom lab</a> in the Department of Biology at the University of Washington.  The project defines two metrics for scientific journals based on a page-rank like algorithm applied to citation graphs.</p>
<blockquote><p> &#8220;A journal&#8217;s <b>Eigenfactor score</b> is our measure of the journal&#8217;s total importance to the scientific community. With all else equal, a journal&#8217;s Eigenfactor score doubles when it doubles in size. Thus a very large journal such as the Journal of Biological Chemistry which publishes more than 6,000 articles annually, will have extremely high Eigenfactor scores simply based upon its size. Eigenfactor scores are scaled so that the sum of the Eigenfactor scores of all journals listed in Thomson&#8217;s Journal Citation Reports (JCR) is 100.<br />
&#8230;<br />
A journal&#8217;s <b>Article Influence</b> score is a measure of the average influence of each of its articles over the first five years after publication.  Article Influence measures the average influence, per article, of the papers in a journal. As such, it is comparable to Thomson Scientific&#8217;s widely-used Impact Factor. Article Influence scores are normalized so that the mean article in the entire Thomson Journal Citation Reports (JCR) database has an article influence of 1.00.&#8221;
</p></blockquote>
<p>For example, here are the ISI-indexed journals in the <a href="http://eigenfactor.org/results.php?fulljournalname1=&#038;issnnumber=&#038;pub=&#038;finecat=EP&#038;rosvcat=%25&#038;year=2006&#038;nam=names&#038;grping=%25&#038;resultsperpage=100&#038;ordering=perarticle&#038;Submit=Search">AI subject category</a> ranked by the Article Influence score for 2006.</p>
<p>The site makes good use of GoogleDoc&#8217;s <a href="http://eigenfactor.biology.washington.edu/bubble/">motion charts</a> to visualize the changes of metrics for top journals in a subject area. You can also interactively explore <a href="http://eigenfactor.biology.washington.edu/map/maps.htm">maps</a> that show the influence of different subject categories on one another as estimated from journal citations.</p>
<p><center><a href='http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/12/foo.png'><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2008/12/foo.png" alt="Map of Science" title="Map of Science" width="300" height="268" class="aligncenter size-medium wp-image-1704" /></a></center></p>
<p>The details of the approach and algorithms are available in various <a href="http://octavia.zoology.washington.edu/publications/publications.html">papers</a> by Bergstrom and his colleagues, such as</p>
<blockquote><p> M. Rosvall and C. T. Bergstrom, Maps of random walks on complex networks reveal community structure, Proceedings of the National Academy of Sciences USA. 105:1118-1123. Also arXiv physics.soc-ph/0707.0609v3 [<a href="http://octavia.zoology.washington.edu/publications/forthcoming/BergstromEtAl07.pdf">PDF</a>] </p></blockquote>
<p>(spotted on <a href="http://infoproc.blogspot.com/2008/12/map-of-science.html">Steve Hsu&#8217;s blog</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/19/eigenfactororg-measures-and-visualizes-scientific-journal-impact/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Journal requires authors to include Wikipedia article with submissions</title>
		<link>http://ebiquity.umbc.edu/blogger/2008/12/18/journal-requires-authors-to-include-wikipedia-article-with-submissions/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2008/12/18/journal-requires-authors-to-include-wikipedia-article-with-submissions/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 04:34:04 +0000</pubDate>
		<dc:creator>Tim Finin</dc:creator>
		
		<category><![CDATA[English]]></category>

		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1701</guid>
		<description><![CDATA[Scientific journals are undergoing rapid evolution as they adapt to the Web and various forms of social media.  As reported by Nature (Publish in Wikipedia or perish) and in ReadWriteWeb, the journal RNA Biology is experimenting with a connection to Wikipedia.  Articles submitted for publication about new RNA molecules must also include a [...]]]></description>
			<content:encoded><![CDATA[<p>Scientific journals are undergoing rapid evolution as they adapt to the Web and various forms of social media.  As reported by Nature (<a href="http://www.nature.com/news/2008/081216/full/news.2008.1312.html">Publish in Wikipedia or perish</a>) and in <a href="http://www.readwriteweb.com/archives/publish_in_wikipedia_or_perish.php">ReadWriteWeb</a>, the journal <a href="http://www.landesbioscience.com/journals/rnabiology/">RNA Biology</a> is experimenting with a connection to Wikipedia.  Articles submitted for publication about new RNA molecules must also include a draft Wikipedia page that summarizes the work. The journal will then peer review the page before publishing it in Wikipedia.</p>
<p>Here are the <a href="http://www.landesbioscience.com/journals/rnabiology/guidelines">guidelines</a> from the RNA Biology site:</p>
<blockquote><p>
&#8220;To be eligible for publication the Supplementary Material must contain: (1) a link to a Wikipedia article preferably in a User&#8217;s space. Upon acceptance this can easily be moved into Wikipedia itself together with a reference to the published article.<br />
&#8230;<br />
At least one stub article (essentially an extended abstract) for the paper should be added to either an author&#8217;s userspace at Wikipedia (preferred route) or added directly to the main Wikipedia space (be sure to add literature references to avoid speedy deletion). This article will be reviewed alongside the manuscript and may require revision before acceptance. Upon acceptance the former articles can easily be exported to the main Wikipedia space. See below for guidelines on how to do this. Existing articles can be updated in accordance with the latest published results.&#8221;
</p></blockquote>
<p>This is definitely an interesting and forward looking idea. Yet, I can not help having the cynical thought that it&#8217;s also a great way for the journal to boost it&#8217;s page rank.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2008/12/18/journal-requires-authors-to-include-wikipedia-article-with-submissions/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
