Archive

Archive for October, 2008

Proyectos I+D en Europa para una nueva Web

October 31st, 2008

La comisión europea publica un resúmen de los proyectos I+D concedidos en el séptimo programa marco en el área de Software y Servicios.

Se resumen los 28 proyectos concedidos a partir de las 186 propuestas financiados con 120 millones de euros. Es un resúmen bastante útil para futuras propuestas.

Algunas estadísticas por entidad y país. Los campeones españoles son Universidad Politécnica de Madrid (7 proyectos), Telefonica I+D (6 proyectos) y ATOS Origin (4 proyectos).

El proyecto que más servicios promete es sin duda SOA4ALL. A ver como cunden los 120 millones.

(via madri+d)

Spanish

Grandes empresas en marcha hacía la Web 3.0

October 31st, 2008
Ademàs de las empresas que ya hemos nombrado hay otras que apuntan fuerte por la Web 3.0. Es el caso de Telefonica I+D. Aqui nos cuentan "Cómo será la web 3.0" en un resumen que va desde la Web 2.0 pasando por los microformatos y la tan experada Web Semántica para terminar con unas consideraciones sobre la movilidad (que seguramente mucho interesa a Telefonica).
Al otro lado del oceano no se quedan mirando. Project10X ha publicado hace unos dìas el"Web 3.0 Manifesto". La versi
ón gratuita (y muy reducida) se puede bajar desde aquí. Después del Semantic Wave 2008 publicado siempre por Project10X, no hay que perderse esta nueva entrega.
Yahoo en Santa Clara (California) hace tan sólo una semana a la conferencia sobre Web 3.0 aclaraba su posición acerca de esta tendendecia. Se ha hablado de la Yahoo! Open Strategy (Y!OS) cuyo objetivo es modificar el modelo de la empresa, adoptando un modelo abierto y orientado a la tecnología social. De allí la presentación de un sistema de gestion de perfiles centralizado para Yahoo y quizas para la web en general (Yahoo! Profiles).
Y si también los grandes se mueven..

Spanish

Video Explaining Semantic Search, Google’s Misnamed Open-ID Support, and Some Good Ol’ VC Bashing!

October 31st, 2008

Join us at Mashup Camp to Compete in the first Calais Mashup Contest

October 31st, 2008

 INTRODUCING THE CALAIS MASHUP CONTEST @ MASHUP CAMP

Please join us for Mashup Camp to compete in our first Calais Mashup Contest at the Computer History Museum in Mountain View, CA, Nov. 17-19, 2008.

THREE WAYS TO WIN
We are offering a 4TB Drobo as top prize for Calais mashups in three categories:

     

  1. Best Calais Mashup addressing the delivery and/or display of news
  2. Best Calais Mashup addressing the needs of business users
  3. Most creative Calais Mashup overall (in any category serving any user)

Mashup Camp is a unique opportunity to work directly with other Calais developers - including members of our expert internal team who are flying in from the ClearForest development center in Israel - to create new and compelling mashups.

See the complete rules for the Calais mashup contest on the Mashup Camp wiki.  You must be a Mashup Camp participant in order to compete.  An unlimited Calais API key as well as a variety of helpful APIs and feeds will be made available to entrants as well.

To participate, register for Mashup Camp and note your intention to enter the Calais Mashup Contest here.  If you want to get started early, and need the special Calais key, shoot us a note at MashupContest (at) OpenCalais (dot) com.

We look forward to seeing you there.

  

English , , , , , ,

We Won!

October 31st, 2008

So, a ton of stuff happened over the last 7 or 8 days (OWLWG meeting, Product Modeling XG meeting, OWLED, ISWC, RDB2RDF XG meeting, New Ever Awesomer Pellet) and I hope to carve out some time to post about it all, but the latest big news is that the following paper:

Matthew HorridgeBijan Parsia, and Uli SattlerLaconic and Precise Justifications in OWL

won the best (research) paper award! I can’t tell you what a pleasure this is on so many fronts. Matthew is just into the second year of his PhD with Uli and me and it’s amazing (but not unexpected) that he churned out this level of quality in his work. It’s very gratifying to have ISWC recognize this.

Also, I hope this changes perceptions amongst people (esp. people who review our papers) that explanation is a “finished” or trivial area. This is the second best paper award for an explanation paper (that I know of) in the past few years and it was extremely non-trivial. I, certainly, had been working on it since Aditya graduated (he had a first stab in his thesis). Other researchers had worked on it. This paper nails it and it took many months of work to get it. However, I’ve, over the years, gotten a surprising number of comments to the effect that explanation wasn’t interesting, or it was solved. In fact, when Matthew was considering starting a PhD on explanation, several people told him that they didn’t think there was enough to do on the topic for a PhD!

Take that!

Matthew did a brilliant job on all fronts: Theoretical, algorithm design, implementation, user interface design, and experimentation. We have some truly intriguing results. His talk was videoed and should show up on the web in due course.

Expect the groovy new explanation features to show up soon in Pellet. They’ll blow your mind.

English , ,

Algorithms vs. Data: The Seesaw Effect

October 30th, 2008

The Seesaw Effect of Algorithms vs. DataOver the years I've noticed that the importance of algorithms and data tends to shift back and forth, depending on which at the time is hardest to duplicate (often from a business perspective). This effect seems to be caused by the availability or demand of one side increasing or decreasing, shifting the balance of importance to the other. At one point the world of software was dominated by the proprietary. The organization with the best software (backend, algorithms, etc) was the dominant entity and data (from say, a Web 2.0 perspective) was generally not the focus. This may have partly been the responsibility of a mindset formed during an era with very little storage space and before mass user activity on the Web.

Things have changed and the word proprietary has become a sort-of developer faux pas. Open source has caused a paradigm shift away from the old proprietary software models and has allowed organizations to focus their attention on the other side of the equation: data. As a result of this shift we saw the start of the Web 2.0 era (perhaps with a few years of padding before the phrase started floating around). Now many organizations focus on the data they acquire and how they can leverage it to their advantage. As a result we see many walled gardens in an attempt to preserve this advantage.

However we may be seeing another shift, this time back to software once again. The Semantic Web calls for making data open and ubiquitous. This is a strong paradigm shift away from the walled garden mindset (and most people understand this, especially the business set). After writing about the cross-pollination of DBpedia and Freebase it occurred to me that the project with the most advanced proprietary information extraction algorithms would in a sense be the "dominant" project because it would be able to leverage its software in a space where data is becoming a commodity.

Freebase has a secret sauce and that is probably their biggest advantage over competing projects. In the Semantic Web/Linked Data Web/Web 3.0 (whatever we feel like calling it at the time), data may decrease in value as it spreads and becomes more commoditized; at least in the original sense of value it once had: as a tool that only the walled gardens could leverage.

We are seeing the walls come down, possibly to be replaced once again by proprietary algorithms.

Got something to say? Leave a comment!

English , ,

Quien es quien en la Web 3.0 (Centros de Investigación)

October 30th, 2008
Quienes son algunos de los grupos de investigación que piensan sobre estas cosas:

  • Laboratorio de Inteligencia Artificial (LIA), Universidad Politécnica de Madrid
  • Forschungszentrum Informatik (FZI)
  • Knowledge Media Institute (KMi) at The Open University
  • Center for the Development of Information and Communication Technologies in Asturias (CTIC Foundation)
  • Institute of Applied Informatics and Formal Description Methods (AIFB)
  • Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
  • Semantic Technology Institute Innsbruck (STI)
  • Semantic Web and Multimedia Group (SEMEDIA)
  • Laboratory for Applied Ontology (LOA)
  • Digitial Enterprise Research Institute (DERI)
  • German Research Centre for Artificial Intelligence (DFKI)

Spanish

Post to Twitter with Glue: Add Context to the Conversation

October 30th, 2008

twitter-message-10-30.pngGlue makes it easy to share the things you like with your friends on Twitter. Here’s a quick overview on how to make this happen.

Sharing 2cents on Glue is a light-weight way to express your thoughts about what you’re looking at. The 140 character restraint encourages the sharing of short, expressive thoughts and enables Glue to integrate into Twitter.

Sharing 2cents to Twitter makes it easy to share what you like with friends. It also adds context to the conversation.

2cents-10-30.pngEarlier today I visited the page on Wikipedia for a great new HBO show, True Blood, and wanted to share my thoughts with my friends on Twitter.

I had already set up Glue to automatically share my 2cents onto Twitter so I quickly hit the 2cents button, shared a short thought, and it posted automatically to Twitter.

It sent my 2cents and a link to the Wikipedia article, helping others see what I was talking about.

What’s cool about this is that it pulls your Twitter friends to the Wikipedia page where they are plugged into the Glue network.

When I returned to the page I saw that a number of my friends had clicked through. I also saw that some of my friends also liked the show and that one of them shared his 2cents.

true-blood-glue-bar-2-10-30.png

Here’s how to post to Twitter from Glue:

true-blood-actions-10-30.png

1) Mouse over the item’s image in the Glue bar and select ACTIONS

 

 

 

 

 

twitter-actions-10-30.png2) Select SHARE ON TWITTER

 

 

 

 

 

 

 

 

 

 

post-to-twitter-10-30.png3) Add you twitter message, select “Automatically Share on Twitter when I add 2cents” and select POST IT!

 

 

 

 

 

 

 

 

From now on whenever you add 2cents to the things that you like you’ll share your thoughts on Twitter and your friends will be able to click through to discover something new and continue the conversation in context.

English

hakia Polls Web Searchers: 83% Think Obama Is Better for the Future of the Internet

October 30th, 2008

2008 is viewed as the first “Internet” presidential election. As the Washington Post aptly put it, the 2008 presidential race got the “whole world blogging.” Everyone was “Twittering” or blogging right after the debates this month. Media matched voters’ online activity with live streaming videos of Barack Obama and John McCain debates. YouTube has captured SNL’s Sarah Palin skits and her recent interview with Katie Couric. Lastly but not least, presidential candidates aggressively used the Internet for their campaigns.

We were curious to find out what Web searchers thought and asked one question: “Who is better for the future of the Internet, John McCain or Barack Obama?” Our on-going poll, which can be found at the homepage of hakia.com (please click “more” if you cannot see the poll), of over 2,700 respondents as of now is striking: 83% of Web searchers said that Barack Obama is better for the future of Internet- a win by a large margin.

This result may come to some as no surprise, as it has become widely known in the election that Obama’s campaign has mastered the art of new media in its efforts. Although this poll is by no means an indication of the outcome of this historic election, it does provide a flavor of the perception when it comes to gauging the level of “tech-savvy” the political candidates possess (or, at the very least, hakia users!).

For more information on the political candidates, try the hakia galleries for your one-stop search:

Barack Obama

John McCain

Joe Biden

Sarah Palin

We will keep the poll open until the Election Day. If you have not already voted, please tell us what you think!

English , , , , , , , ,

Sticky Web App launched from Adaptiveblue

October 30th, 2008
Improved Workspace

Image by Zach_Beauvais via Flickr

For me, one of the things which makes the idea Semantic Web so fascinating is the idea that “stuff” online is treated as real. What I mean is that the things we like online, are very often the the things we like offline: people, books, wine, gadgets are obvious examples. When I ogle the new Macbook range, I’m not thinking about the page about the Macbook, but the Lapple itself. My mates on social networking sites, or from their own blogs, are more than the sum of their profile boxes!

I’ve been playing with glue, for a few weeks, and it was officially launched the other day (which means I’m finally allowed to talk about it!) I wrote a piece about it over on my blog, so I won’t be long-winded here, or explain exactly how it works beyond a brief introduction (as the developers, AdaptiveBlue has a better description of how it works, and we’re big on reuse here). Glue is a firefox plugin which allows you to begin to interact with the “stuff” online.

One more thing I will mention, however, is that the “stuff” can actually be non site-specific with Glue. So, if I rate a book on Amazon, the book is actually marked on other sites which carry it! It’s the first time I’ve ever seen this kind of interaction, and I think it ties in heavily with visualisation, interaction, and even development of the Semantic Web.

Enhanced by Zemanta

English ,

A (very personal) bit of ISWC08 trendspotting

October 30th, 2008

As ISWC08 is drawing to a close, it dawns to me that something which Frank van Harmelen has been forecasting for years is now happening, seemingless without conscious effort. He calls it Approximate Reasoning - have a look at his ESWC06 keynote. The basic idea behind it is to do reasoning over ontologies with a different focus, namely by giving up some reasoning correctness in order to gain better scalability.

And indeed, at ISWC08 I have seen a number of things which fit exactly into this corner (while at the same time the authors/programmers might not even be aware of it).

  • As part of the Billion Triple Challenge, Axel Polleres presented the SAOR system, which does approximate OWL reasoning by means of forward chaining rules. Now you can’t do OWL reasoning (in a sound and complete way) with forward chaining rules (and Axel knows this), so in the end you’re losing some consequences. But at the same time you do get some consequences when having to deal with large amounts of data.
  • Eyal Oren, also at the Billion Triple Challenge, presented the MARVIN system which performs approximate RDF reasoning by means of massive parallelisation. MARVIN comes out of the EU project LarKC, which is actually pursuing approximate reasoning on a large scale (pun intended).
  • Among the results presented at ISWC08, I found those by Claudia D’Amato on Statistical Learning for Inductive Query Answering on OWL Ontologies really amazing. She and her collaborators managed to do OWL instance retrieval without any deduction algorithm. Instead they used Support Vector Machines and learned which (named) OWL classes individuals belong to. The learning was done from a small sample set (generated by a reasoner), but the network was able to generalise from the data to achieve about 90% of coverage. In my opinion, this is something conceptually new and it is really remarkable that it works.
  • In a regular paper Eyal Oren also reported on using Evolutionary Algorithms for RDF query answering.

The above is only a selection of approximate reasoning related things at ISWC08. There was also the Workshop on Nature inspired Reasoning for the Semantic Web where related ideas were discussed. At the colocated Web Reasoning and Rule Systems conference, RR2008, there will be two papers on approximate reasoning (incidentially, with me as coauthor).

I foresee the importance of such approaches rising substiantially in the future (and I think it’s a safe guess since Frank also seems to think so). The Billion Triple Challenge series could become one of the driving forums for this. There are exciting times ahead!

English , , , , , ,

The Day after Freebase went RDF

October 30th, 2008

So what’s been happening on the blogosphere after John Giannandrea’s keynote at ISWC and the revelation that Freebase now produces Linked Data from an RDF service

Tetherless World sums up the Freebase facts (e.g. 156,000,000 assertions made; 1370 published types; 75 domains; graph model, identity, web based) and further points out that ontology creation “is a social process, and both freebase and semantic wiki are tools that enable users to create ontological vocabulary without worrying too much on building a comprehensive ontology.”

Inkdroid notes that the RDF service release “is important news because Freebase is an active community of content creators, creating rich data-centric descriptions with a wiki style interface, fancy data loaders, and useful machine APIs.” This is followed up by a quick and handy tutorial how you can get machine readable data back from freebase using a URI with Freebase. Conclusion:

So why is this important? Because following your nose in HTML is what enabled companies like Lycos, AltaVista, Yahoo and Google to be born. It allowed for agents to be able to crawl the web of documents and build indexes of the data to allow people to find what they want (hopefully). Being able to link data in this way allows us to harvest data assets across organizational boundaries and merge them together. It’s early days still, but seeing an organization like Freebase get it is pretty exciting.

Yves Raimond was the first to wonder on the public W3C LOD mailinglist: “now, to see whether it links to other datasets :-)” - the idea of having linked data without the linkage would indeed seem like love’s labour lost. Semantic Focus / James Simmons seconds: “One downside is the data doesn’t appear to link to external resources, in a sense walling itself in. It should be trivial to link the topics that came from Wikipedia back to Wikipedia as well as DBpedia (which would be killer, by the way).” This is followed up a later post, where James expresses concerns regarding the relationship DBpedia / Freebase: “Freebase may see a drop in userbase growth and participation if it becomes a mirror of DBpedia (or vice-versa) and the popularity once garnered by one project may shift towards the other, or away entirely.”

More News / Andrew Newman puts the Freebase RDF service release in context with Cathrin Weiss’ “250 million triples on your iphone” submission, iMoCo, to the Billion triples challenges, also DBpedia and Semaplorer, developed at the University of Koblenz:

DBPedia stood out because it was the only one that allowed you to write data to the Semantic Web rather than just read the carefully prepared triples. For a similar reason I though SemaPlorer was good because they tried to do more than just the standard triples but went that extra bit further by making it more generic like integrating flickr. But they were all excellent, all of them showing what you get with a billion or more triples and inferencing.

That combined with the guys at Freebase making all of their data available as RDF and it was a big day for the Semantic Web.

ARQtick / AndyS plays a bit with the Blade Runner example cited by Freebase, e.g. takes a look at the graph, looks for interesting properties and extracts author names

N.B. If you want to follow ARQtick’s example: use the Linked Data browser plugin Tabulator or go to the Marbles site to view the RDF - without a data browser you’ll be redirected to the HTML page. You will also need it to make sense of rdf.freebase.com.

English , , , , , , , , , ,

Utility computing in the Cloud

October 30th, 2008

It is usually more interesting and educational to see a good heart-felt debate than complete agreement so you are in for a treat if you take the time to read the following from Nick Carr, Tim O’Reilly and the Smoothspan blog.

You can see from the debate that economics is at the heart of the discussion yet not understood in the same way by the three. I find myself pretty much in agreement with Tim, but it might be worth pulling out some of the strands to clarify. I think there is real confusion between economies of scale, direct and indirect network effects.

In this post I will focus on the utility computing layer in the cloud. I think the economics of platform as a service (PAAS), especially the cruical distinction between direct versus indirect network effects for defensibility, needs its own post.  

It’s pretty clear that utility cloud computing is highly capital intensive so it should come as no surprise that there are powerful economies of scale to be had. But the bottom line is that you are talking about plant and power. These are rival goods, scarce resources that are created and consumed. This is not different from many utility industries with one exception: the distribution network has global reach, already exists and is very cheap compared to existing utility distribution networks. It is a lot cheaper to access a computing resource on the other side of the planet than it is to send electricity or gas across the globe. So maybe Hugh McLeod ) is right. What is to stop economies of scale turning this into a global natural monopoly?

Actually, unless there are some large network effects, quite a lot stops single companies ruling entire industries. For a start, without network effects, economies of scale tend to run out: the curve is usually U-shaped ( take a look at http://en.wikipedia.org/wiki/Economies_of_scale ). Telecoms, Gas, rail companies have strong network effects from their infrastructure—it makes little sense to have duplicate rail networks or gas networks in a country. Utility computing does not have this advantage because the distribution network is not owned by them.

Smoothspan argues there are two potential network effects that could cause a single winner.

1) Lower costs of data exchange between apps in the same cloud

2) Elasticity

There is a network effect based on increased costs for cross-cloud interoperability, exactly as we have with mobile phone networks today. I don’t think this is a significant, long-term issue because we are talking about a relatively small number of cloud providers thanks to capital costs. Ironically, that means the cost of providing massive high speed bandwidth BETWEEN different cloud providers is actually very small; especially when compared with the cost of providing large bandwidth to every single home and mobile phone in the world. And, of course, the backbone telecoms providers are already geared up to provide exactly this kind of point to point, high capacity infrastructure.

If a cloud provider artificially inflated their cross cloud costs, they would directly cut the available data-sharing applications for a customer and would suffer a big negative network effect compared to providers that ensured their cloud was as open to cross cloud use as possible. Would you choose the walled garden?

Regarding the second point: I think Smoothspan is confusing economies of scale with network effects. A larger provider can more easily deal with variation of demand, but this is an economy of scale (the cost of providing variable demand of size X to a customer is lower for a bigger player) and in fact is a negative network effect; just like your Internet connection at home. If every other customer stopped using the service there would be more capacity available for you. If everyone is using the service there is less capacity available: a negative network effect. Just as with the power grid, dealing with variation of demand is more easily managed with multiple providers that can be called on when require. In the single supplier model, they have no one to share demand peaks with and must over-provide capacity far in excess of a shared model.

For me the bottom line on utility computing is that it is very much like the provision of telecoms and power but without the network effect of owning the network. I would not be surprised to see backwards integration along the supply chain in this industry (i.e. a power generator and a bulk telecoms provider might have the infrastructure and capital structure to build data centres more cost effectively than Google, Amazon or MS as the market matures).

This market is no where near mature. I expect that Google, Amazon and MS are still there own biggest cloud customers.

With the rise of utility computing in the cloud, it will soon become very easy to create a PAAS offering because the utility computing provider absorbs the large fixed costs and rents the infrastructure to the PAAS provider on an incremental marginal cost basis. This is very similar to the virtual mobile network operators (like Virgin) which ride on the back of the network providers. The difference here is that the PAAS has the chance to create powerful network effects. 

So to summarise, utility cloud computing is firmly built on economies of scale where as I think cloud based platforms (PAAS) need to be firmly built on the economics of network effects to be defensible. An interesting battle ground for PAAS seems to be centred around the difference between software centric and data centric network effects, but more on that in a later post. 

English , , , , , , , , , , , , , , , , , ,

Cross-Pollinating DBpedia and Freebase

October 29th, 2008

Cross-Pollinating DBpedia and FreebaseNow that Freebase is available as Linked Data a big question that comes to mind is whether these two major projects will move to assimilate one another. DBpedia and Freebase – two endeavors primarily focused on curating unstructured and semi-structured data about everything and releasing it back into the wild (with structure) – get the bulk of their information from Wikipedia, so the amount of topical overlap is assumed to be extremely high. DBpedia gains new information when it extracts data from the latest Wikipedia dump, whereas Freebase, in addition to Wikipedia extractions, gains new information through its userbase of editors.

It is this incredible amount of overlap (with regard to content and purpose) which creates a sort of paradox, where it can be speculated that DBpedia and Freebase would both gain and lose value through efforts to cross-pollinate. Assimilating each other's updates would cause both to become "more complete" (in the same sense that an incrementing number is closer to infinity after each increment), thus gaining value. However, both may lose value as well if "value" is the perception of being "the most complete database about everything." Freebase may see a drop in userbase growth and participation if it becomes a mirror of DBpedia (or vice-versa) and the popularity once garnered by one project may shift towards the other, or away entirely.

This may not be an actual paradox since we're talking about mixing two different perceptions of value (value from the developer's point of view and value from the point of view of the project itself), but we must still look at it from both vantage points. This may simply be another issue of business interest vs. developer interest. All issues regarding popularity and ubiquity aside, cross-pollination is a Good Thing for the purposes of the Semantic Web and Linked Data in general.

Got something to say? Leave a comment!

English

Schedule Thursday, 30.10

October 29th, 2008

Thursday Oct 30

9.00-10.00 Keynote 3: Message in a Bottle or: How can the Semantic Web Community be more convincing? (Johannes Brahms hall)
Stefan Decker, DERI
10.00-10.30 Coffee Break
10.30-12.30 Research 1: Web Data and Knowledge (Johannes Brahms hall)
Chair: Luciano Serafini

  • RDFS Reasoning and Query Answering on top of DHTs
    Zoi Kaoudi, Iris Miliaraki, and Manolis Koubarakis
  • An Interface-based Ontology Modularization Framework for Knowledge Encapsulation
    Faezeh Ensan and Weichang Du
  • On the Semantics of Trust and Caching in the Semantic Web
    Simon Schenk
10.30-12.30 Research 2: Semantic Web Services (Johann Peter Hebel hall)
Chair: Vasant Honavar

  • Semantic Web Service Choreography: Contracting and Enactment
    Dumitru Roman and Michael Kifer
  • Formal Model for Semantic-Driven Service Execution
    Tomas Vitvar, Maciej Zaremba, and Adrian Mocan
  • Efficient Semantic Web Service Discovery in Centralized and P2P Environments
    Dimitrios Skoutas, Dimitris Sacharidis, Verena Kantere, and Timos Sellis
10.30-12.30 In Use: Knowledge Management (Alfred Mombert hall)
Chair: Mike Dean

  • Thesaurus-based search in large heterogeneous collections
    Jan Wielemaker, Michiel Hildebrand, Jacco van Ossenbruggen, and Guus Schreiber
  • Deploying semantic web technologies for work integrated learning in industry. A comparison: SME vs. large sized company
    Conny Christl, Chiara Ghidini, Joanna Guss, Viktoria Pammer, Stefanie Lindstaedt, Peter Scheir, Luciano Serafini, and Marco Rospocher
  • Creating and Using Organisational Semantic Webs in Large Networked Organisations
    Ravish Bhagdev, Ajay Chakravarthy, Sam Chapman, Fabio Ciravegna, and Vitaveska Lanfranchi
  • An architecture for semantic navigation and reasoning with patient data - experiences of the Health-e-Child project
    Tamás Hauer, Dmitry Rogulin, Sonja Zillner, Andrew Branson, Jetendr Shamdasani, Alexey Tsymbal, Martin Huber, Tony Solomonides and Richard McClatchey
12.30-14.00 Lunch break
14.00-15.30 Research 1: Semantic Social Networks (Johannes Brahms hall)
Chair: Fausto Giunchiglia

  • Exploring Semantic Social Networks using Virtual Reality
    Harry Halpin, David Zielinski, Rachael Brady, and Glenda Kelly
  • Semantic Grounding of Tag Relatedness in Social Bookmarking Systems
    Ciro Cattuto, Dominik Benz, Andreas Hotho, and Gerd Stumme
  • Semantic Modelling of User Interests based on Cross-Folksonomy Analysis
    Martin Szomszor, Harith Alani, Ivan Cantador, Kieron O’Hara and Nigel Shadbolt
14.00-15.30 Research 2: Rules and Relatedness (Johann Peter Hebel hall)
Chair: Michael Kifer

  • ELP: Tractable Rules for OWL 2
    Markus Krötzsch, Sebastian Rudolph, and Pascal Hitzler
  • Term Dependence on the Semantic Web
    Gong Cheng and Yuzhong Qu
  • Semantic relatedness measure using object properties in an ontology
    Laurent Mazuel and Nicolas Sabouret
14:00 Lightning Talks (Alfred Mombert hall)
15.30-16.00 Coffee break
16.00-17.30 Closing Ceremony and Best Paper Awards (Johannes Brahms hall)
Outlook on ISWC2009

English , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Calais 3.1 Goes to Production on Monday November 3

October 29th, 2008

Calais Community:

 

A quick note to let you know that the Calais Release 3.1 technology preview moves to Calais production this Monday 11/3. If you haven't had a chance to test with 3.1 - this is the time to do it.

Details on the 3.1 release are located here

English

Freebase Officially Linked Data with Release of RDF Service

October 29th, 2008

At ISWC2008 Freebase released its new RDF service for generating RDF representations of Freebase topics, allowing Freebase to be used as Linked Data! To obtain the RDF data for a topic send a GET request to http://rdf.freebase.com/rdf/some.topic.id where "some.topic.id" is replaced by the desired topic identifier (slashes in the identifier must be replaced by dots). Topic data can be represented as N3, RDF/XML or Turtle depending on the preferences expressed in your client's HTTP Accept header. Try it out with the Freebase topic Semantic Web.

You can also cater to clients that prefer HTML output by using the /ns end-point (http://rdf.freebase.com/ns/some.topic.id). The service performs the content negotiation automatically; delivering human-friendly HTML representations to Web browsers, and redirecting clients expecting RDF to the /rdf URL (via 302 redirect).

One downside is the data doesn't appear to link to external resources, in a sense walling itself in. It should be trivial to link the topics that came from Wikipedia back to Wikipedia as well as DBpedia (which would be killer, by the way).

Got something to say? Leave a comment!

English , , , , ,

Interview for Journalism.co.uk… Journalists get to know the Semantic Web!

October 29th, 2008

I was interviewed last week by Colin Meek from Journalism.co.uk on the topic of “Web 3.0″ and what it means for journalists… You can read the full article in two parts (1, 2). My original answers are part of an interview on their Insite blog. I also had the chance to talk about various DERI offerings in the Semantic Web area including SIOC, SWSE, Sindice, Semantic Radar, etc.

Colin also asked me about other readable data that is being crawled by Semantic Web search engines like Sindice, SWSE or Swoogle. These search engines can usually match keywords in any data that has been crawled or integrated into a semantic store, not just people. It could be from structured information about people, places, dates, library documents, blog items or topics, whatever. In fact, there is no limit to the types of things that can be indexed and searched - since RDF (an open data model that can be adapted to describe pretty much anything) is used as the data format. Anyone can reuse existing RDF vocabularies like SIOC to publish data, or they can publish data using their own custom vocabularies (e.g. to describe stamp collecting or Bollywood movie genres or whatever), or they can combine public and custom vocabularies (e.g. take FOAF and your own vocabulary about soccer to describe players and managers on a soccer team). Geotemporal information is particularly useful across a range of domains, and provides nice semantic linkages between things. For example, having geographic information and time information is useful for describing where people have been and when, for detailing historical events or TV shows, for timetabling and scheduling of events, etc., and for connecting all of these things together (”I’m travelling to Edinburgh next week: show me all the TV shows of relevance and any upcoming events I should be aware of according to my interests…”).

The keyword searches in the Sindice search engine allow you to find more information on where resources of interest are (searching for “john breslin” will point to all public pages that contain semantic information about yours truly). Sindice also has an API that can provide results in a resuable (semantic) format that can be leveraged by other applications. Alternately, SWSE (Semantic Web Search Engine) shows you semantic information about the object of interest (e.g. my phone number, my friends, etc.) which may be derived from multiple sources (e.g. this information on me comes from tens of sources consolidated together via unique identifiers for me or through what’s called “object consolidation”).

For me, this article highlighted the fact that the Semantic Web community needs to be very aware that one of the key features of the Social Web for journalists and for many others is the ability to find a lot of personal and sensitive information on people, and with the advent of “Web 3.0″, we need to realise that (”with great power comes great responsibility”) the availability of contextual and semantically-related information is going to become even more apparent, and people will talk about it in both positive and negative terms. Educating site owners about what semantic data they may be publishing (knowingly or unknowingly, even if it’s just RSS feeds) is needed, and developers should determine exactly what opt-in or opt-out mechanisms are required before implementing semantic solutions. Users also should be aware of the benefits and other potential uses of their semantic data.

I think now is the time to avert any scares, because in reality, the data that is on the Web or the Social Web can be used in new ways anyway, whether metadata is present or not (some facts can be derived). Google have recently implemented some discussion forum parsing algorithms to determine how many posts are on a thread, how many users posted on that thread and when the last post was made. You can see this in a search result I did for “irish pubs boards.ie” below. It’s not complete, and probably relies on identifying certain HTML structures for non-Google discussion sites, e.g. you can see two threads in the middle that don’t show details of the total posts or commenters. But it’s moving towards the SIOC vision of providing more metadata about discussions on the Web to help you in finding more relevant information - whether the site owners want to provide Semantic Web data or not!

Making data available semantically enables computers to help us do things we cannot easily do (or cannot do at all) right now, and this is what makes it so powerful. We also need to think more towards educating people about the benefits as well as how we can minimise any hazards. Is this a job for W3C SWEO? As my colleague James Cooley said: “I think scientists thought the benefits of GM food were so obvious that there was no case to make. Then you got Frankenstein Food and the game was up.”

For journalists interested in the Semantic Web, I’d recommend reading this paper entitled “SemWebbing the London Gazette” by Jeni Tennison and John Sheridan which describes how they have exposed information from their newspaper website using RDFa so that it becomes easy to re-use (slides here). You can also view some interesting slides by Colin Meek from a seminar he gave to journalists about the Social Web in Olso a few days ago. It’s in three parts (1, 2, 3). I’ve embedded the third part (on the Semantic Web) below…

Other posts referencing this article:

English , , , , , , , , , , , , , , , , , , , , , , , , , ,

Complementos, plugins de humanización de la web semántica, colaboración en LBV.

October 29th, 2008

Os dejo la prometida segunda parte o aplicación práctica de los conceptos que introducíamos hace un par de días. Extraigo parte del contenido aquí de la colaboración publicada hoy en La brújula verde:

Sería la continuación del artículo: Web semántica y redes sociales. Más allá de los buscadores “Google Killer” . Hablábamos allí de SIOC y lo hacemos ahora de cómo implementar SIOC en nuestros blogs, webs o perfiles en diversas redes sociales:

WordPress SIOC Exporter: genera metadatos SIOC instalando 2 archivos en la carpeta de plugins de nuestro Wordpress autoalojado. Es plugin más utilizado como SIOC exporter.

Existen desarrollos, también, para Drupal, PHPbb, B2Evolution e incluso Twitter que podéis consultar desde esta página del W3C.

Los resultados, si os animáis, podéis comprobarlos descargando la segunda herramienta que quería presentaros hoy:

Semantic radar, un fantástico complemento para Firefox que nos permitirá ver los contenidos RDF de cualquier página que los contenga (además de realizar un ping automático a Pingthesemanticweb, servicio que recoge actualizaciones al respecto.

En la barra de tareas, previo reinicio típico del navegador y al visitar vuestro blog, los símbolos, activados, de SIOC y de contenido semántico indicarán que lo habéis hecho bien y el perfil de vuestro espacio ya es parte de la web semántica.

Spanish

You Are a Brilliant Knowledge Engineer?

October 28th, 2008






Challenge your knowledge engineering skills and
solve three challenging tasks!

The Grand Prize is an iPod Nano!

Discover more about Ontoprise’s contest
in the SMW User Forum.

English

Schedule Wednesday, 29.10

October 28th, 2008

Wednesday Oct 29

9.00-10.00 Keynote 2: Freebase: An Open, Writable Database of the World’s Information (Johannes Brahms hall)
John Giannandrea, Metaweb Technologies Inc.
10.00-10.30 Coffee Break
10.30-12.30 Research 1: OWL (Johann Peter Hebel hall)
Chair: Frank van Harmelen

  • Combining a DL Reasoner and a Rule Engine for Improving Entailment-based OWL Reasoning
    Georgios Meditskos and Nick Bassiliades
  • Improving an RCC-Derived Geospatial Approximation by OWL Axioms
    Rolf Grütter, Thomas Scharrenbach, and Bettina Bauer-Messmer
  • OWL Datatypes: Design and Implementation
    Boris Motik and Ian Horrocks
  • Laconic and Precise Justifications in OWL
    Matthew Horridge, Bijan Parsia and Ulrike Sattler
10.30-12.30 Research 2: Ontology Alignment (Johannes Brahms hall)
Chair: Harith Alani

  • Learning Concept Mappings from Instance Similarity
    Shenghui Wang, Gwenn Englebienne, and Stefan Schlobach
  • Instanced-based mapping between thesauri and folksonomies
    Christian Wartena and Rogier Brussee
  • Collecting Community-Based Mappings in an Ontology Repository
    Natasha Noy, Nicholas Griffith, and Mark Musen
  • Algebras of ontology alignment relations
    Jérôme Euzenat
10.30-12.30 Industry Talks 1 (Alfred Mombert hall)
Chair: Joel Sachs

  • Semantic Wikis: Fusing the two strands of the Semantic Web
    Mark Greaves (Vulcan Inc.)
  • Internet of Services
    York Sure (SAP Research)
  • Semantic Web @ BBN
    Mike Dean (BBN)
  • Data Intelligence
    Evelyne Viegas (Microsoft Research)
12.30-14.00 Lunch break
14.00-15.30 Research 1: Description Logics (Johann Peter Hebel hall)
Chair: Gerd Stumme

  • Scalable Conjunctive Query Evaluation Over Large and Expressive Knowledge Bases
    Julian Dolby, Achille Fokoue, Aditya Kalyanpur, Li Ma, Edith Schonberg, Kavitha Srinivas, and Xingzhi Sun
  • A Kernel Revision Operator for Terminologies- Algorithms and Evaluation
    Guilin Qi, Peter Haase, Zhisheng Huang, Qiu Ji, Jeff Z. Pan, and Johanna Voelker
  • Description Logic Reasoning with Decision Diagrams: Compiling SHIQ to Disjunctive Datalog
    Sebastian Rudolph, Markus Krötzsch, and Pascal Hitzler
14.00-15.30 Research 2: User Interfaces (Johannes Brahms hall)
Chair: Natasha Noy

  • RDF123: from Spreadsheets to RDF
    Lushan Han, Tim Finin, Cynthia Parr, Joel Sachs, and Anupam Joshi
  • Evaluating long-term use of the Gnowsis Semantic Desktop for PIM
    Leo Sauermann and Dominik Heim
  • Bringing The IPTC News Architecture into the Semantic Web
    Raphael Troncy
14.00-15.30 Industry Talks 2 (Alfred Mombert hall)
Chair: Fuesane Cheng

  • Semantic Web in Asia: Example Use Cases
    Tony Lee (SaltLux)
  • Making the Web searchable
    Peter Mika (Yahoo Inc)
  • SemanticWeb from an industry perspective
    Jürgen Angele (Ontoprise)
15.30-16.00 Coffee break
16.00-18.00 Semantic Web Challenge & Billion Triple Challenge (Johannes Brahms hall)
19.30-20.00 Reception & Get Together (Foyer)
20.00-01.00 Dinner & Dancing (Weinbrennersaal)

English , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

OCLC Updated Record Use Policy - clarification

October 28th, 2008

I appended an update to my previous posting on the impending OCLC announcement about changes to their policy on record usage.    The update pointed to a response from Karen Calhoun, Vice President, WorldCat and Metadata Services, that was posted in the comments to a Tim Spalding posting on the Thingology Blog about today’s rumors.

I believe Karen’s comments deserve a bit more analysis than just a link, so here goes…

From the following, it is clear that the OCLC Member’s Council delegates were not happy with the situation, as reported, either:

I regret NYLINK’s premature messages to its member libraries during this week. This week was intended as the updated policy’s discussion period with Members Council delegates and directors of OCLC’s regional networks. NYLINK’s message expresses the understanding that "access and use of OCLC online systems by any institution after November 2nd will be taken as acceptance of the new policy." Because of concerns expressed by the Members Council delegates and network directors since last Monday, October 20, the sentence to which the NYLINK message refers is no longer in the policy.

Karen goes on to say that she believes the updated policy when it is announced will be generally welcomed by the OCLC membership…

… as it opens WorldCat records to new, noncommercial uses by members and non-member libraries alike.

The intent of the updated policy, due for release next week, is to:
* Respond to the changing information landscape
* Modernize the language of the Guidelines
* Make it clearer how WorldCat records can be used and shared
* Expand the opportunities for record sharing among members and non-members
* Clarify OCLC’s support for WorldCat data sharing that encourages innovation and benefits libraries while protecting OCLC members’ investment in WorldCat

Further on she assures us that November 2nd isn’t a day to get your lawyer out of bed early to confirm the safety of continuing to use OCLC services, but the day to check out their new new Web pages to inform members about the updated policy and provide its text.

No doubt this will not decrease the interest in what is about to emanate from Dublin, but it may allay some of the worst fears about imposed change that have been circulating the lists, channels, and blogs today.

 

Technorati Tags: ,,