Archive

Archive for September, 2008

Publishing Linked Data With PHP

September 30th, 2008

For a while now I’ve been experimenting with writing my own little PHP applications that run against the Talis Platform. Most of these have never been seen in public because they’re mainly just for scratching an itch I have at the time. I’ve also used a lot of them to validate my own thinking around the types of services that the platform needs to provide to build interesting applications. The core of most of those applications became Moriarty my PHP library for accessing the platform. I use Moriarty extensively now to kick start any development I do. I’m even using it to write PHP scripts for running at the command line. I’m not sure that PHP is going to usurp Perl from my toolbox, but it’s certainly becoming my language of choice for working with RDF.

I’ve been looking carefully at the core patterns that my PHP applications have been following to see if there’s anything else I could pull out. This is generally how I prefer to build new libraries: extracting them from several different projects. Assuming you know how your library is going to work before you’ve written any applications is almost always wrong. I like using libraries that have distilled the essence of repeated attempts at solving the same problem. That’s why I never think about modularization of a codebase until I need to.

I’ve been gravitating towards Konstrukt because it appears to be the least intrusive of the PHP web application frameworks out there and it keeps fairly true to REST principles. I used it to build Kniblet as part of a platform tutorial. However, there are some quirks that it has that I don’t like. For example, to return anything other than HTML requires you to throw an exception. That mechanism works quite well for most applications but doesn’t really suit data-rich applications that have multiple output formats.

It’s with this in mind that I’ve started a new PHP web application framework called Paget. Calling it a framework is somewhat of an overstatement. It’s a few classes that make it easy to publish RDF as linked data. It’s very primitive at the moment, but it’s quite versatile.

It uses a simple configuration array that is passed to a dispatcher that handles the request. The application’s default behaviour is specified using this configuration. One part sets up a series of regular expressions that match URI paths handled by the application and map them to the resources it provides. The data about each resource is obtained by using one or more “generators”. These are simply classes that generate RDF for the given resource. Paget runs each generator to gather the RDF data describing the resource and then handles the serving up of that data according to linked data principles. Right now that’s just enough behaviour to function as a generic linked data publishing framework.

I have three different deployments of Paget that are publishing three RDF data sets using different generators. Each of these was quite trivial to set up, being a few lines of confiiguration. For my own site’s data space I wrote a generator that fetched RDF directly from one of my platform stores (this one) and served it up as HTML and various flavours of RDF. See, for example, http://iandavis.com/id/me which is URI that identifies me.

My second deployment was for PlaceTime, a URI space that I have operated since 2003. It provides RDF data for timelike entities like instants and intervals and spacelike points. However, it hasn’t been fully linked data compliant (mainly because it pre-dated the decision on httpRange-14). I wrote a generator for each type of entity that creates trivial RDF for each valid URI in the space. Some examples:

Finally, I created a generator that reads a local RDF file. I then used it to serve up the whisky vocabulary that Tom, I and several others created at the recent VoCamp Oxford

Admittedly, all these datasets and spaces look pretty similar but this is still early days for Paget. I have some ideas for future development that will flesh out Paget into a fully-fledged RDF driven application framework. For example: as well as generators I plan to add filters, augmenters and transformers that alter the generated data in arbitrary fashions. These could be used to trim the data down, or to convert it to a more usable structure. I can imagine that it would be very useful to be able to pull in more RDF from arbitrary locations on the Web to supplement data in the initial set, e.g. with schema information or additional details. In my opinion that’s one of the significant differences between the web of data and the web of documents: the web of data is going to enable more information to be brought automatically together for the user rather than forcing them to seek it out.

Paget’s HTML rendering of RDF is very primitive at the moment, making only basic attempts to make it human readable. It’s still extremely tabular which is hardly a great use of structured information. One area that I’ve been interested in exploring is that of dynamic user interfaces that adapt to the underlying data automatically. RDF is particularly amenable to building these kinds of interfaces because of its uniform data model. A lot of work on this was done by the Fresnel project and it would be interesting to apply some of the learnings from that project to building dynamic web applications. My goal here is to code as little specific behaviour into the application as possible, instead making the application detect patterns in the data and provide suitable user interface behaviours at runtime. This is really the only way we’re going to be able to build true open world applications, i.e. those that are tolerant of missing data and can adapt to new and unanticipated data.

What I’m still experimenting with is whether these user interface additions should be server-side or passed on to the client. Some of the augmentations could make more sense when actioned by the client based on user activity.

There’s lots to research here and hopefully some of these ideas will make it into Paget very soon.

English , , , , , , , , , ,

Europa apuesta fuerte por la web 3.0

September 30th, 2008
Son pocas las startups europeas que logran conseguir cuotas de mercado de importancia. Veremos otro día el tema con más profundidad, pero de momento, una mirada a las 10 finalistas del evento Startup 2.0 este mismo año (iFoods, UnLtdWorld, Bubok, Geospace, AllRise, Planetaki, Talicious, Learnit, Zilok, Wolpy) de las cuáles os sonarán, con suerte, 2 o 3, os dará una idea del estado de la cuestión. internet en el mundo Pero parece que la situación puede, por lo menos mejorar, si prospera la iniciativa propuesta durante los últimos días por la comisión europea: Se trata de favorecer la adaptación a la próxima ola que revolucionará la información debido a cosas como el networking social, el cambio definitivo de las empresas hacia el entorno online, sistemas nómadas basados en GPS, televisión online y las “etiquetas inteligentes” (una nueva forma de denominar la web semántica). El estudio señala que Europa está en un buen punto de partida para explotar estas tendencias, con políticas que favorecen las redes abiertas y pro-competitivas, así como la privacidad y la seguridad. El informe de la comisión revela también un nuevo índice de efectividad de nuestros Anchos de banda (BPI) que mide cosas como la velocidad, el precio, la competitividad y la cobertura de nuestras redes, más allá del tradicional índice de penetración que venían usando los reguladores de las telecomunicaciones. Según el nuevo valor, Suecia y Países bajos son los países mejor preparados, gracias a un entorno competitivo y a las múltiples empresas y usuarios avanzados en el manejo de las nuevas tecnologías. No es así, sin embargo, para todos: el 100% de Bélgica, Luxemburgo y Dinamarca contrastan con el  60% de Rumanos que tienen la oportunidad de acceder a la red. Incluso en países como Alemania e Italia, el 12% de la población no tiene acceso a la red. Aún así, en términos generales los usuarios acceden más rápido y mejor a la red. 1/4 de los usuarios trabajaron en entornos 2.0 en 2007, la mitad de los europeos acceden a la red a 2 megabits por segundo(MBps), el doble que el año pasado, permitiendo cosas como la TV basada en Internet. Se habla incluso de derecho fundamental a la conectividad, con la Comisión europea declarando esfuerzos para implementar Internet para todos sus ciudadanos, como máximo, en una fecha tan próxima como 2010. Os dejo, para terminar con la declaración de intenciones, un fragmento del excelente discurso de Viviane Redding, Comisaria para la Sociedad de la Información y los Medios:
La Web 3.0 significa ‘en cualquier momento, en cualquier lugar’ negocios, diversión y redes sociales sobre redes potentes y seguras. Significa el fin de la división entre líneas móviles y fijas. De cara a 2015, Europa tiene know-how y potencial de redes suficiente  para liderar esa transformación. Debemos asegurar que la web 3.0, cuyo factor fundamental sería una infraestructura que permita todo lo anterior,  se crea y utiliza en Europa. Una red de alta velocidad es nuestro pasaporte para la sociedad de la información y una condición esencial para nuestro crecimiento económico. La web del futuro cambiará nuestra sociedad”
Añade también algunas cuestiones que ya tratamos en Internet de los problemas, Internet de las cosas o en el primer análisis que publicamos sobre las principales tendencias para la web en 2009: Las nuevas aplicaciones requieren de una internet ubicua. La internet de las cosas, la que empieza a imponerse como hito fundamental de la web 3.0, significa interacción sin cables entre máquinas vehículos, aplicaciones, sensores y muchos otros dispositivos a través de internet. Cosas como el pasaporte electrónico o el uso de los móviles para pagos forman parte también de la idea. Y son cosas que estarán en más de un billón de teléfonos móviles en una fecha tan próxima como 2015. (Nota: Se lee hoy acerca de que la comisión europea resta valor a lo semántico en su definición de lo 3.0. Lo mencionan pero dan importancia, en mi opinión, acertadamente, a las infraestructuras que permiten otros desarrollos, a la brecha digital y a otras cosas tan importantes, como mínimo y tal vez previas, a la primera) Para más información:

Compártelo: bookmarkAñadir a Twine bookmarkAprendeme.org bookmarkTechnorati bookmarkAñadir a Diigo bookmarkVotar en Wikio.es bookmarkDelicious bookmarkStumble it bookmarkDigg bookmarkFacebook bookmarkFavoritos Yahoo bookmarkFavoritos Google bookmarkWindows Live bookmarkFavoritos Yahoo bookmarkLinkedin bookmarkTwittéalo bookmarkAñadir en MrWong bookmarkMenéalo bookmarkFresqui bookmarkBlogmemes

Spanish

La Unión Europea confunde la Web 3.0

September 30th, 2008

La Comisión Europea ha presentado un informe sobre cómo Europa debe responder a la nueva etapa de la revolución de la información y liderar la expansión de la Web 3.0.

Según el documento, Europa está “bien situada” para beneficiarse de esta revolución, porque sus políticas pretenden impulsar las redes de telecomunicaciones que favorecen la competencia, y que además protegen la vida privada y la seguridad de los ciudadanos.

Para la comisaria europea de Sociedad de la Información, Viviane Reding, la Web 3.0 es sinónimo de actividades comerciales, sociales y recreativas, que se pueden llevar a cabo “en cualquier lugar y en cualquier momento”, sobre la base de redes “rápidas, fiables y seguras”.

Como complemento al informe, la Comisión ha iniciado una consulta pública sobre políticas e iniciativas privadas para explotar las posibilidades de la Web 3.0.

Me sorprende esta noticia publicada en El Mundo, sobre todo por el concepto tan equivocado (a medias) que tienen en la Comisión Europea, que en vez de incidir en los aspectos de usabilidad de la Web 3.0, siguen viendo la red como un gran centro comercial, paradigma de los beneficios empresariales.

En vez de orientar la iniciativa hacia la “simplicidad“, “usabilidad“, “conceptualidad“, “semántica“, utilizan términos como “beneficiarse“, “competencia“, “seguridad“, “actividades comerciales“, “fiables“, “seguras“, “explotar“, todos ellos muy alejados de lo que en realidad la Web 3.0 o Web Semántica propone y plantea, que es una red orientada a los contenidos, hacia las búsquedas en las que el significado sustituye a la palabra clave, en la que el usuario es quien decide lo que la Web debe mostrar y ofrecerle.

Ellos siguen tratando a Internet como una “terra incógnita”, como algo a controlar o de lo que beneficiarse, y simplemente se apropian de la terminología para pervertirla, convertirla en algo que se atenga a sus conceptos, su visión de la red como un ecosistema de mercado, no como una base de conocimiento compartido, y semántica.


Spanish

EU Commission`s (short sighted) Definition of Web 3.0

September 30th, 2008

An interesting article for all those who are interested in technology discourse. In a recent VNU.net post the EU Commission made a statement about their understanding of Web 3.0:

While Web 2.0 described the trend towards online collaborative working, including the evolution of social networking sites, wikis and blogs, Web 3.0 will rely on high-performance broadband infrastructure.

According to Viviane Reding, Commissioner for Information Society and Media, Web 3.0

means seamless, anytime, anywhere business, entertainment and social networking over fast reliable and secure networks. [...] It means the end of the divide between mobile and fixed lines. We must make sure that Web 3.0 is made and used in Europe.

This sounds to me like selling old wine in new bottles, revitalising the Commission`s infrastructure policies. And altough broadband is a crucial factor in the evolution of the web, the Commission totally misses the point about the semantics-related innovation paths rolling out in a Web 3.0-scenario.

If anyone has the opportunity, please give Mrs. Reding a briefing!

Reblog this post [with Zemanta]

English , , , , , , , , , ,

ISWC 2008 social network

September 29th, 2008

Attending conferences is very much of a social activity. They provide opportunities to meet and interact with a large group of people that share an interest in a topic. More over, you probably know or at least know of, many of them. One reason to attend conferences is to extend and strengthen your professional social network through these interactions and shared experiences.

We’ve set up a ISWC 2008 social networking site for participants of the Seventh International Semantic Web Conference. Conference attendees can register for free and use the site to connect with other participants, coordinate meeting plans, share comments on events and talks, figure out who is who at the conference, etc.

We are using CrowdVine for this site. Our package allows us to define a calendar where we can list talks and other events, which we’ve only just begun populating. Users can rate, comment on or discuss these calendar items.

If you plan on attending ISWC 2008, please check out our ISWC social networking site and register.

While you are at it, if you tag any ISWC 2008 related pictures, web posts, URLs or resources, we suggest you use the tag ISWC2008.

English , , , ,

ISWC 2008 social network

September 29th, 2008

Attending conferences is very much of a social activity. They provide opportunities to meet and interact with a large group of people that share an interest in a topic. More over, you propbably know or at least know of, many of them. One reason to attend conferences is to extend and strenthen your professional social network through these interactions and shared experiences.

We’ve set up a ISWC 2008 social networking site for participants of the Seventh International Semantic Web Conference. Conference attendees can register for free and use the site to connect with other participants, coordinate meeting plans, share comments on events and talks, figure out who is who at the conference, etc.

We are using CrowdVine for this site. Our package allows us to define a calendar where we can list talks and other events, which we’ve only just begun populating. Users can rate, comment on or discuss these calendar items.

If you plan on attending ISWC 2008, please check it out and register.

English , , ,

Semantic Yellow Pages…

September 29th, 2008

I was sent a link late last week to a white paper over on ZDNet by some Finnish researchers looking into the possibilities of opening up the services offered by Yellow Pages to the Semantic Web. Basically, the Finns looked at the traditional Yellow Pages service (specifically in the Helsinki area, but also more broadly), and found it lacking in several key ways.

The basic idea behind Yellow Pages is to provide a directory service for the general public, and monetise the advertisements of businesses wanting more prominence in the directory. The way current services work (online, that is) leaves much to be desired when it comes to matching the requirements of both users and businesses. Businesses miss out whenever a potential customer fails to notice their ad (because they found a different listing on a different page), and the user loses whenever their search turns up wrong or incomplete results. The reasons behind this mis-match is essentially technical: search and directory tables don’t provide the flexibility required to match users and businesses together whenever linguistic (homonyms, synonyms or hyponymic…) problems occur or when a service simply fails to produce the best case for the request.

I suggest reading the paper for the more technological perspectives (you will need to join ZDNet.co.uk, though I believe it’s free of charge), but the possibilities unearthed could potentially benefit both users of Yellow Pages services and the Semantic Web community itself.

A person looking for a business or seeking an answer to a problem in his area could benefit from the fact that his Yellow Pages results will become much more focused. If he looks for a “camera shop” because his is broken, he’s much more likely to find a shop running a repair service, for example. Businesses behind the service would also benefit from their advertisements being more well-placed to attract people actually looking for them.

However, a significant benefit from this system, is that the data made available through such a service could be reused by the Semantic Web itself. Imagine developing an application and being able to tie directly into a massive, international business directory? I’m sure your imagination (if you happen to be an imaginative developer) could be enlivened by this idea.

I’d be very interested to hear what you have to say about this kind of service, and what you’d build using this kind of data. Let me know where you’d take it, and what you’d like to see from such a directory.

English , , ,

Late registration deadlines coming up

September 29th, 2008

Please note that early registration deadline for OWL-ED is 30.09 and that for all conferences (ISWC, OWL-ED and RR) the late registration deadline is 09.10.

For the late registration deadline it is really important that you do not exceed your 7 days grace period for bank transfers. If you plan to pay by bank transfer and have to run it through your administration / financial department, you should rather register now and ensure that your money reaches us before 16.10. For every payment that reaches us after 16.10, we will have to charge an extra (difference to on-site registration) onsite.

For those planning to pay by Paypal: If you do not have a validated account yet, it might take some time to create an account, so please also create a Paypal account now to avoid delays.

Looking forward to seeing you all in Karlsruhe!

English ,

Thoughts on the Thomson Reuters / Zotero case

September 29th, 2008

Reuters-Zotero My Thomson Reuters Sends Zotero a $10 Million EndNote post yesterday, attracted several comments and ping-backs.

The thoughts seem to drop in to a couple of themes.  Firstly there is the legal position – Have Thompson Reuters got a case, was it presented correctly, which bit of EndNote licensing does it depend upon, of what relevance is the GMU license to use EndNote, etc., etc.  My colleague Rob Styles, who has a far better understanding of these things, has published an excellent post over on our sister connecting  knowledge blog,  Xiphos, reviewing some of the legal issues.

At first glance it seems the case would be specious. Reverse engineering file formats in order to allow interoperability has been settled on several occasions.

In this case, however, GMU have a site license for EndNote. In Bower vs Baystate the courts upheld an anti reverse-engineering clause in the case where it had been knowingly and voluntarily entered into.

Rob also references James Grimmelmann’s post - Thomson Reuters: The Gang That Couldn’t Sue Straight - in which he questions the quality of the case that Thomson Reuters has presented.

The other theme that has emerged from the comments and other posts, is the corporate approach to things like this. As Bruce D’Arcus commented:

If there’s a problem here with corporate academia, it’s the fact that they mindlessly support companies like Thomson with expensive site licenses with ridiculous terms who are prone to litigate when things don’t go their way.

The flippant answer to Bruce’s point is that “they’ve always done it that way, so it’s hardly surprising”.  The corporate approach to the licensing, distribution, and protection of software intellectual property, has evolved over the last forty years or so.  It is only in the last few years that broad open source distribution of functionality, such as is at issue here, has even been possible.   It is therefore unsurprising that corporate monoliths, and especially their legal departments, appear to be way behind the curve of the Open Source and Open data movements.

As I said in my previous post - I predict that this will only one skirmish in a series of battles that will ensue as the information and knowledge publishing and distribution industry morphs into something new.  I stand by this.  We didn’t get from buying CDs in our local store to the current online, pay-as-you-go, take-it-wherever-you-go, iTunes world, without a few battles like this one.

English , , , , , ,

Sprint launches WiMax service in Baltimore

September 29th, 2008

USA Today has a story, Sprint takes wireless service to the max in Baltimore, on Sprint’s new WiMax system for the Baltimore area.

“Monday, Sprint will launch wireless WiMax services in Baltimore, marking the beginning of what could become a new era in mobile broadband. The mobile data network — which will be marketed under the Xohm brand name — is designed to cater to the needs of laptop and home broadband users, not cellphone users.

Prices will start at $10 for a day pass, good for 24 hours of unlimited usage. Monthly service starts at $35. There are no contracts. To use the service, Baltimore customers will have to buy a special WiMax “aircard” or modem, which cost about $45 apiece. There are also special launch discounts, including a $50-a-month plan that offers subscribers unlimited data usage for life.”

Xohm is pronounced “zome”, by the way.

The xohm site has pricing details, which seem to be $35/month for home, $45/month for mobile, and $50/month for both. For home use, they do have a modem that you can hook up to a home router. The $50 fee is good for as long as you are a member, which could be a great deal. I know someone who only pays $5/month for Sprint’s basic all-you-can-eat EDVO service because he was an early adopter.

Speed? They claim that it will be “Comparable to basic DSL and Cable internet service” with a footnote stating “Comparison based on basic DSL and cable plans offering download speeds of 786 kbps (DSL) and 1.5 Mbps (cable) as of September 2008.”

The site says that DC and Chicago are next in line for the service and Dallas, Fort Worth, Boston, Providence, Philadelphia are in the works.

Of course, there are lots of details to check into (e.g., if I buy service in Baltimore, can I use it when in Chicago), but this looks very interesting. Maybe Sprint can make WiMax work.

English , , , , , , , , , , , , , , ,

Last call for participation

September 29th, 2008

You are invited to ISWC 2008, the major international forum for the Semantic Web. The conference serves introductory tutorials, cutting-edge research presentation, exhibitions of the latest business products, and many social activities. It also offers great social networking opportunities for meeting academic leaders, industrial practitioners, researchers, developers, and students.

==Attending Conference==
ISWC 2008 will be held in Karlsruhe Germany, October 26-30, 2008.
conference website

Please register online by October 9 to save 100 eu and take advantage of special hotel offers which are good if made four weeks in advance.
online registration

==Highlights of the Conference Program==
* 11 tutorials on, e.g., Introduction to Semantic Web, Ontology, RDFa, Multimedia, Business Intelligence, Health Care, Applications, Linked Data.

* 13 workshops on, e.g., e-science, reasoning, software engineering, web service, information extraction, scalability, and social network.

* one doctor consortium

* 10 keynotes and invited industrial talks
- Multimedia Semantic Web, Ramesh Jain (UCI)
- Freebase: An Open, Writable, John Giannandre (Metaweb Technologies)
- How can the Semantic Web Community be more convincing, Stefan Decker, (DERI Galway, Ireland)
- Semantic Wikis: Fusing the two strands of the Semantic Web, Mark Greaves (Vulcan Inc.)
- Internet of Services, York Sure (SAP Research)
- Semantic Web @ BBN, Mike Dean (BBN)
- Data Intelligence, Evelyne Viegas (Microsoft Research)
- Semantic Web in Asia: Example Use Cases, Tony Lee (SaltLux)
- Making the Web searchable, Peter Mika (Yahoo Inc),
- Semantic Web from an industry perspective, Jürgen Angele (Ontoprise)

* 57 paper presentations, including 14 in Semantic Web in-use track, 87 posters and demonstrations, a panel on OWL2, and lightning talks

* social events , social networking, reception, banquet and dancing.

* co-located events: RR2008 and OWLED2008

The complete conference program is available at this link.

English , , , , , , , , , , , , , , , , , , , , , , , , , , ,

A Week in Den Haag

September 29th, 2008

I was in the Netherlands for a week on business: a few days at a NATO “semantic interoperability” workshop and then a day meeting with some of our new Dutch research partners—more about that (and them) in the near future.

Thoughts and observations, more personal than professional:

  1. Nearly everyone I spoke with for more than a few minutes referred to the financial shockwaves in the US, often with more than a little schadenfreude. But I can’t blame them.
  2. The Dutch, like the Danes, are courteous and helpful as a rule. I seem to meet more Dutch who speak English than Danes, though, so that biases my perceptions of the two a bit. I’ve also spent more time in NL than DK (6 weeks versus 2). Neither are as effusively friendly and, well, warm as the Spanish or Italians in my experience.
  3. Heineken in public, casually, on trains, walking down the street—that’s cool and very Dutch, but not my thing. I don’t think I could ever live anywhere, including the Netherlands, long enough to drink a beer on a train.
  4. My wife, a graphic designer, is quite well-traveled, speaks Spanish and Italian, and is worldly-wise; but she’s still never been to the Netherlands and is envious of my trips largely because of the Rijksmuseum and Dutch design generally.
  5. Related: I claim the Netherlands is the most typographically correct culture on earth. It’s a small thing, I suppose, but it makes me happy. I suppose the Swiss or perhaps the Danes would object.
  6. From Den Haag to Delft via taxi: 50 euros. Public transit: 5 euros.
  7. The fashion trends today in Europe for young women replay the preppy fashion of the late 80s, when I was in high school. This makes me feel incredibly old, in a way few things do.
  8. If an adjacent table of self-styled “liberal Republicans” talks so loudly as to make it impossible not to listen, my failure to ignore their conversations, by virtue of eating alone myself, is not eavesdropping, it’s self-defense.
  9. That I don’t repeat their names here—public figures, both of them—and some of their more shockingly tasteless dinner conversation (“oh, it’s one of those Jewish names” and other tidbits) is an undeserved mercy.
  10. If you ever need a good dinner in Den Haag, Spijs is lovely.
  11. My server—new: her third night—at Spijs insisted that a “gamboa” was not a “shrimp”, in English, since it wasn’t very small. I, a native English speaker, tried three times to assure her that “shrimp” was correct, but she knew better.
  12. She did accept my claim that “rack of deer” was better replaced with “rack of venison”, however.
  13. Monkfish—which she described as “the one so ugly they cut off its head”—grilled in a tandoor like kebab is surprisingly good.
  14. The beach in Den Haag—in Scheveningen, precisely—is stunning this time of year, if you’re lucky enough to get clear weather, as I was for three days.
  15. I’m now officially too old—a trend, that—for the ritual walk through Amsterdam’s red light district: boring and pointless. Give me a stroll through Leiden, a bami goreng (or even just a kroket), and an afternoon with van Rijn, and I’m a pig in mud.

English , , , , , , ,

Mobile texting now more popular than calling in US

September 29th, 2008

The NYT has a short note (Letting Our Fingers Do the Talking ) on a new Nielsen Mobile report on texting use in the US.

“In the fourth quarter of 2007, American cellphone subscribers for the first time sent text messages more than they phoned, according to Nielsen Mobile. Since then, the average subscriber’s volume of text messages has shot upward by 64 percent, while the average number of calls has dropped slightly.”

Average Number of Monthly Calls vs. Text Messages Among U.S. Wireless Subscribers

 

Calls

Texts

Qtr 1, 2006

198

65

Qtr 2, 2006

216

79

Qtr 3, 2006

221

85

Qtr 4, 2006

213

108

Qtr 1, 2007

208

129

Qtr 2, 2007

228

172

Qtr 3, 2007

226

193

Qtr 4, 2007

213

218

Qtr 1, 2008

207

288

Qtr 2, 2008

204

357

Source: Nielsen Mobile

The article also points out that “Teenagers ages 13 to 17 are by far the most prolific texters, sending or receiving 1,742 messages a month”. The Nielsen data shows that this age group sends two orders of magnitude more data than people over 65.

<td width="252" colspan="3" valign="bottom" class="header_nav_on
style34">

Source: Nielsen Mobile

Average Number of Monthly Calls vs. Text Messages Among U.S. Wireless Subscribers by Age (Q2 2008)

 

Calls

Texts

All Subs

204

357

12 & Under

137

428

Ages 13-17

231

1742

Ages 18-24

265

790

Ages 25-34

239

331

Ages 35-44

223

236

Ages 45-54

193

128

Ages 55-64

145

38

Ages 65+

99

14

English ,

Evri, el blogging relacional semantizado

September 28th, 2008

Thomson Reuters Sends Zotero a $10 Million EndNote

September 28th, 2008

Reuters-Zotero George Mason University is being sued by Thomson Reuters to prevent the distribution of the excellent Firefox plugin, Zotero.  As reported via the Courthouse News Service:

Thomson Reuters demands $10 million and an injunction to stop George Mason University from distributing its new Web browser application, Zotero software, an open-source format that allows users to convert Reuters’ EndNote Software. Reuters claims George Mason is violating its license agreement and destroying the EndNote customer base.

Subject of a Talking with Talis podcast last year with Trevor Owens, Zotero is an impressive free open source tool for capturing, organising and citing research resources, that has been building a successful community of users around it.

Thomson Reuters is complaining about the 1.5 preview release of  Zotero, announced on July 8th, which introduces several new features including:

Support for thousands of existing Endnote® export styles.

Following that link to Endnote export styles you end up on a page containing the following words:

EndNote output styles are provided solely for use by licensed owners of EndNote and with the EndNote product.

That seems to be the bit that is behind the legal action taken.  The question is can they, or should they, enforce such a restriction – not being a legal expert I’ll stop ruminating further in that direction.

The folks in the Center for History and New Media at George Mason, must be wondering what has hit them, but you can’t go rattling the current business model of a someone the size, history and market position of Thomson Reuters without expecting some form of backlash.

I can imagine the cries of outrage that will emanate from the Open Source and Open Data communities because of this.  They will no doubt be matched by indignation and litigious thoughts from the commercial sector as other publishers check to see how Zotero is helping to distribute their output but not necessarily in a way they would like.

It’s ironic then that somewhere else in the Thompson  Reuters organisation there is a site/service with the following ambition:

We want to make all the world’s content more accessible, interoperable and valuable. Some call it Web 2.0, Web 3.0, the Semantic Web or the Giant Global Graph - we call our piece of it Calais.

Calais (Powered by Thomson Reuters) is a semantic web technology based project which in simple terms provides an API to information about people, organisations, geographies, books, authors, events, facts about them, and links between them.  It is a free API service can be used openly, for commercial and non-commercial use, to enrich applications.  (For an insight in to Calais and how it fits with Reuters’ business, I can recommend the podcast Paul Miller recorded with Barak Pridor of ClearForest, the technology with which Calais has been built).

The action being taken against Zotero is symptomatic of the classic growing pains as technology and distribution mechanisms move on.  Just like the scribes complaining  about movable type in the 1400’s, or  the music industry complaining about the mp3 download culture that emerged some 600 years later.

I predict that this will only one skirmish in a series of battles that will ensue as the information and knowledge publishing and distribution industry morphs into something new.  Will actions like this prevent it happening? - of course not.  Will it slow it down? – possibly.   If I was part of the Zotero project would I be worried? – yes, I might be;  some of the early vanguards of the music download revolution were forced out of the race by such legal challenges.  Nevertheless, be it the opening of access to newly created knowledge or providing useful open access to traditionally controlled data, things are a changing.  We will look back on actions like the one against Zotero, viewing them as inevitable battles to try to preserve rapidly outdating business models – anybody read the Innovator’s Dilemma recently!

I hope  that the Zotero folks survive to reap the rewards of their pioneering efforts.

English , , , , , , , , , , , , , , ,

Keep your lab motivated

September 28th, 2008

Sometimes worse is not better, it’s just very bad. Here’s a good motivational poster for those of us who express ourselves in code or formal models.



According to Wikipedia, "On September 21, 1997, a divide by zero error in the USS Yorktown (CG-48) Remote Data Base Manager brought down all the machines on the network, causing the ship’s propulsion system to fail."

English

Evri helps you search less and understand more

September 27th, 2008

Evri is another entry into the ’semantic search’ space and has recently opened up a beta site with the slogan Search less, understand more. Evri is an startup launched by Vulcan Inc, a company founded by Paul Allen in 1986 as a private investment and R&D firm.

Here’s part of how Evri describes itself on their (FAQ).

What is Evri doing? Evri is creating a map of connections between people, places, and things on the web. You’ll use this map to find the things you’re interested in. Instead of searching by keywords and looking for relevant results, Evri will lead you to other relevant articles, images, and video based on what you’re reading.

Where does Evri get its information? We search the World Wide Web and gather content from as many highly regarded information sources as we can find, and we’re adding more sources all the time.”

Saying that Evri does ’semantic search’ is not quite right — their initial focus is on providing widgets for blogs and other web sites that use the text on the page to recommend links to other, related information.

Evri appears to have developed an underlying ontology that is used to organize their knowledge of “people, products and things”, capturing both a type taxonomy and relations. Some of this is revealed in the beta**2 part of their site, Evri’s Garden. There is a query system over their knowledge base complex search queries.

The current push, though, seems to be to get bloggers to add an Evri widget to their blogs that will pop up a window with links to related articles and information.

This is an interesting development that is worth watching.

English , , , , , ,

Looking back at a successful VoCamp Oxford

September 27th, 2008

Thinking... about new vocabularies for the Semantic Web

(by Matthias Samwald)

The first VoCamp ever was successfully completed this week at Oxford University. Tom Heath (Talis) and Jun Zhao (Oxford University) led us through two days devoted to creating new vocabularies, schemas and ontologies. The first day was mainly spent on finding common interests, getting to know each other, and identifying the vocabularies that needed to be created. The second day was spent on creating the vocabularies, first on paper, then on the computer.

Fabien Gandon introduced me to his interesting work around corporate ontologies, which I will explore in further detail for the KiWi project. We also made significant progress on developing a basic, common ontology for the representation of agreement, disagreement and discourse, based on SIOC, SCOT, FOAF and the bibliographic ontology. Such an ontology can be of great utility in many knowledge domains, such as biomedical research or the representation of bug/issue reports in software development knowledge management (something that needs to be adressed for adapting the KiWi system for a use-case at Sun Microsystems). I will elaborate on these developments in separate blog posts next week.

In a very short timespan, the participants of the VoCamp created several new vocabularies, such as:

IRC Vocabulary

Participation Ontologies

UDO (Unified Discourse Ontology)

VotePost

Evidence ontology

Whisky Ontology (yes, it’s an ontology about whiskey)

Journey Ontology

Data publishing, sharing, visualisation ontology

On the second day of the VoCamp I also held a short session called „Do OpenCyc and UMBEL know it?“, where I asked for classes and properties that others wanted to create in their developing vocabularies. It turned out that OpenCyc and UMBEL had a relatively good coverage of the terms that others were creating, including whiskey, evidence, or the relation that one person is the boss of another person (relevant for the corporate Semantic Web). I tried to emphasize that linking and re-using such existing entities was vital for the success of new vocabularies, and of the Semantic Web as a whole. Others objected that re-using such existing resources might not be possible given the often very specific requirements and short time-frames of some projects. Still, I think that linking to existing, large resources on the Semantic Web should have a very high priority when developing new vocabularies.

If you are interested in getting new vocabularies out, but did not get the chance to attend VoCamp Oxford: don’t worry, there are many VoCamps planned for the near future. The next VoCamp will already happen in November, and will be located at DERI Galway. The Semantic Web Company will host a VoCamp in Vienna next year.

English , , , , , , ,

ISWC 2008 call for participation: register online by Oct 9

September 26th, 2008

Register online for the 7th Int. Semantic Web Conf. by Oct 9

There is still time to register online for ISWC 2008, the Seventh International Semantic Web Conference, which will be held in Karlsruhe Germany on on 26-30 October 2008. Register online by October 9 to save 100 eu and take advantage of special ISWC hotel offers which are good if made four weeks in advance.

The ISWC program booklet has the complete schedule with further details on the ISWC site. The program includes much for practitioners, researchers, students, and business people, including both experts and those new to the concepts and technology. The days before the start of the main conference include the Doctoral Consortium, 11 half and full-day tutorials, and 13 workshops. The conference features 10 keynote and invited talks, 57 submitted paper presentations, 55 poster presentations, 32 demonstrations, a panel, a series of lightning talks, exhibits of products and services, and social events including guided tours, a reception, a banquet and dancing.

English ,

Wall Street’s collapse may be IT program’s gain

September 26th, 2008

Virtually all information technology programs in the US and Europe saw their enrollments drop significantly after the dot com bubble deflated in 2001. Students decided to pursue other majors, even though the IT job market remained strong — it just wasn’t insanely strong.

At UMBC, the number of our Computer Science majors fell by almost 50%, even though the number of BS degrees we produced declined only slightly. Our Information Systems Department suffered an even greater decrease in their undergraduate programs.

One of the popular alternatives students moved toward was business, especially finance, banking and trading, where young people with good analytic skills who were willing to work hard could do very well.

Computer World has an article, Wall Street’s collapse may be computer science’s gain, that speculates the flow will reverse.

“The collapse of Wall Street may help make computer science and IT careers attractive to students who abandoned these fields in droves after the pop of the last big bubble, the dot-com bust of 2001.
    William Dally, chairman of the computer science department at Stanford University, said that for the last several years, he has watched some students interested in technology go into banking and finance because those fields could be more lucrative.
    ”Many thought they could make more money in hedge funds,” Dally said. He said students are returning to computer science because they like the field and not because it can necessarily make them rich.”

My only regret is that the IT industry (including the academic sector) didn’t get a multi-hundred-billion dollar federal bailout back in 2000.

(h/t to Marie desJardins)

English , , , , , , , , ,

Why Faviki is able to suggest tags in 13 languages

September 26th, 2008

Just got in touch with Vuk Miličić from Faviki recently - Faviki has been selected as a featured project on Google code, and in that context, Vuk describes the process of how Faviki retrieves its suggestions in a little more detail. It’s really interesting! It also sheds more light on the way that DBpedia is used in Faviki: Not immediately for the retrieval of tags, but for the translation of tags - long live the smartness of linked data!

  1. Faviki fetches a web page and extracts a core text (without HTML and non-relevant content).
  2. Then it tries to figure out if a content is in English. If it isn’t, it is sent to Google language API, which detects the original language automatically, translates it into English and returns the translation.
  3. The content is then sent to and analyzed by Zemanta API, which then finds relevant links. Faviki uses links from English Wikipedia - titles are used as semantic tags.
  4. If users language is not English, we must translate them. Using DBpedia datasets “Links to Wikipedia Article” , we can find names of Wikipedia’s titles in one of 13 languages. These datasets actually contain the connections between English Wikipedia articles and articles from Wikipedia in other languages.
  5. Finally, suggested tags are offered to a user.

Read the whole blog post on Vuk’s Faviki blog

Reblog this post [with Zemanta]

English , ,

Can you surf the web and chew gum at the same time?

September 26th, 2008

The NYT has an article (Get Off the Internet, and Chew Some Gum) on an ad campaign by Cadbury advocating that young people spend less time on the Internet and more being up close and personal, after, of course, sweetening their breath with Dentyne.

“The campaign, called “Make face time,” was created by McCann Erickson for Dentyne, a brand owned by Cadbury, the No. 2 gum maker in the United States after Wrigley. The ads feature happy people embracing and kissing — their breath presumably freshened by Dentyne — as an alternative to pounding their BlackBerrys or sending electronic messages to their Facebook friends.” (src)

Somewhat ironically, the ad campaign is going into high-gear now with a Web component. There is also a three minute version at www.makefacetime.com.

“It opens with a warning announcing that it will shut down after three minutes. “When people are surfing the Web, they’re missing the best part of life — being together,” it reads.”

When I viewed it, however, it was against a background of PHP error messages. Maybe that’s part of the message — get off the Web, it’s run by flaky machines that speak in strange and unnatural languages.

UMBC’s Zeynep Tufekci was quoted in the NYT article as a skeptic.

“That strategy could be a gamble, as the ads focus on exactly the people who are most passionate about these digital tools.

“I think most college kids would roll their eyes” at the ads, said Zeynep Tufekci, a sociologist at the University of Maryland, Baltimore County, who studies the way young people use technology to socialize. “In fact, they’re checking out these sites in the hopes that sooner or later it will end up in a hug or kiss.”

Ms. Tufekci said that the idea that social networking sites and other digital tools have separated people from those that matter in their lives will probably not sit well with the gum industry’s young customers.

“This is a false dichotomy,” she said. People use online tools as a way to be more social, she said, updating their acquaintances on what they are doing and making plans to meet in person. Her research has shown that people who use these tools have just as many offline friends and spend just as much time with them as people who do not socialize online.” (src)

Ths makes me wonder if the Dentyne lose its flavor on the Web post overnight.

(Apologies to The Happiness Boys)

English , , , , , , , , , , ,