Archive

Archive for November, 2009

Raptor 1.4.20 RDF syntax library released

November 28th, 2009

Released Raptor 1.4.20 as a bug fix release – no ABI or API changes but fixes for wrong-to-spec bugs, crashes and performance. Raptor 2 will contain ABI/API changes and have new features when it is released – no ETA. My main development focus returns to Rasqal, it’s new query engine and full SPARQL 1.0 support, which is coming along well.

The main changes in the new Raptor version are:

  • Turtle serializing performance improvement by Chris Cannam
  • librdfa RDFa parser updates to fix empty datatype, xml:lang and 1-char prefixes by Manu Sporny
  • Fix a crash when the GRDDL parser reported errors
  • Enable large file support for 32-bit systems
  • Several resilience improvements by Lauri Aalto
  • Other minor portability and bug fixes
  • Fixed reported issues: 0000306 0000307 0000310 and 0000312.

See the Raptor 1.4.20 Release Notes for the full details of the changes.

Download it from: raptor-1.4.20.tar.gz

SKOS Primer translated to Spanish

November 28th, 2009
Juan Antonio Pastor Sánchez and Javier Martínez Méndez have published a Spanish translation of the SKOS Primer, under the title “Manual de SKOS (Simple Knowledge Organization System, Sistema para la Organización del Conocimiento simple)”.

Activity news, English

Karen Calhoun completes a conversation with Talis

November 27th, 2009

sm_calhoun_karen When recording my previous Talking with Talis podcast with OCLC’s Karen Calhoun, in a hotel lobby over the road from the British Library in London, we suffered a technology failure loosing the last third of our conversation.

Karen kindly agreed to spend some time in a follow up conversation so that listeners could get to hear her thoughts on a couple of further questions I asked, including one about the future for library metadata formats. 

In addition I also gained the opportunity to ask her reflect upon the presentation she gave on that day.  The slides for which are available to view from the OCLC site.  The other benefit being that we were not competing with the music, staff, and hotel guests during the recording.

Technorati Tags: ,

English, Uncategorized

Atictes, Jornades Internet mòbil: Web 3.0 mobile.

November 27th, 2009

Os dejo la presentación que realizaré el próximo domingo desde Citilab Cornellá para las jornadas Atic 2a Internet Mòbil.

Incluye alguna referencia sobre Mobile learning (enlaza la presentación al respecto de Hugo Pardo en Virtual Educa).

En la intersección de este con la Realidad Aumentada, como decíamos en Construcción colaborativa de la Realidad Aumentada, creo que están muchas de las claves del Futuro del e-learning y la educación:

Compártelo



  • BarraPunto
  • del.icio.us
  • Tumblr
  • Twitter
  • Facebook
  • Google Bookmarks
  • Meneame
  • MisterWong
  • StumbleUpon
  • Technorati
  • LinkedIn
  • Wikio
  • Bitacoras.com
  • Diigo
  • FriendFeed
  • Netvibes
  • Ping.fm
  • Posterous
  • PDF
  • Print

Planeta educativo, Redes sociales, Spanish, Videotutoriales, Web Semántica, cloud computing-web 4.0, futurismo, herramientas para blogs, herramientas semánticas, medios, móviles, periodismo ciudadano, web3.0

The POWDER Working Group is now closed

November 26th, 2009
After having successfully published the POWDER Recommendation, the POWDER Working Group is now closed. For more details, see the group's (last) blog entry...

Activity news, English

Slow Adoption of Linked Data: Why?

November 25th, 2009


During the November issue of Semantic Web Gang podcast, we also talked about DBpedia, which led me to share new thoughts about Linked Data. And more specifically, two things: 1- my growing concerns that adoption of Linked Data is going slower than expected (an observation I will test further) given the R&D investment and focus of a large community including the inventor of the web and the W3C, which led us to anticipate a faster growth curve. 2 - the possibility this slow adoption rate might be due to inherent flaws in the design of Linked Data. Even if adoption is in fact rapid (which I don't think it is), I am nonetheless concerned those flaws could also limit the potential of Linked Data to "turn the Web into a database", as per the main promise associated with the Semantic Web by its apostles (to this day...).

English, Knowledge Management, Knowledge Representation, Linked Data, Resource Description Framework, Semantic Web, Swirrl, Talis, Thought Express, Tim Berners-Lee, artificial intelligence, commercialization, dbpedia, market adoption, market development

Discussing the Semantic Portals of Eqentia with the Gang

November 25th, 2009


Paul Miller has published the Semantic Web Gang podcast we recorded in November, in which we first talk about Eqentia with its CEO (and also friend of mine) William Mougayar. Eqentia is a web application offering pre-made vertical portals leveraging an ontology-based search engine to extract and distribute relevant information.

English, Knowledge Management, Ontologies, Semantic Web, Web 3.0

Nokia en 2015 – mirador al futuro de la Internet móvil

November 25th, 2009

Me envían este interesante mirador a cómo imagina el futuro, la Internet móvil, Nokia.

Estaré hablando sobre Web 3.0 y dispositivos móviles en Atictes el próximo domingo, así que con la charla de la consultora Morgan Stanley en el Web 2.0 Summit (La próxima web es la de las redes sociales móviles) y las imágenes que sobre el futuro de la web (la web al cuadrado), además de un artículo de la propia Betina Lippenholtz (que nos envía el vídeo), de  educ.ar (La realidad aumentada. Educación e inmersión. Una buena dupla para reflexionar sobre las posibilidades de las nuevas tecnologías) y algunas otras fuentes e ideas propias, creo que podré elaborar algo interesante.

Sobre el vídeo, comentar lo acertado de su estrategia en los Social Media, impulsando la conversación y el feedback de sus actuales o futuros clientes mediante una pregunta clave: ¿Cuáles de las funciones, de los servicios  os atraen, impresionan más?:

Las primeras opiniones interpretan un futuro en que la información, el conocimiento y las relaciones parecen ser más valoradas que  cuestiones relacionadas con métodos de pago, banca online, etc…

Parece, así, que fue la posibilidad de identificar personas, obteniendo mediante dispositivos de AR (Realidad aumentada) los datos de sus perfiles en redes sociales, su localización, etc… la que más daba que hablar.

Aparece también  lo que creo que cuando esté más desarrollado será un hito importante en la evolución de las redes sociales: la posibilidad de compartir el visionado de televisión (sí, parece que sobrevive a la imaginación de un futuro cercano) con amigos.

La proyección de películas de alta definición en cualquier lugar y la búsqueda contextual (según tiempo, lugar y relación en el grafo social) inteligente también destacan entre los intereses de los Prosumers de Nokia. A ver qué nos parece a nosotros…

Compártelo



  • BarraPunto
  • del.icio.us
  • Tumblr
  • Twitter
  • Facebook
  • Google Bookmarks
  • Meneame
  • MisterWong
  • StumbleUpon
  • Technorati
  • LinkedIn
  • Wikio
  • Bitacoras.com
  • Diigo
  • FriendFeed
  • Netvibes
  • Ping.fm
  • Posterous
  • PDF
  • Print

2009, Campañas, Planeta educativo, Redes sociales, Spanish, Videotutoriales, Vídeos, Web 3.0, cibercultura, cloud computing-web 4.0, curiosidades, dispositivos, educación 2.0, empresa 2.0, filtrado de contenidos, futurismo, futuro web, innovación, lifestreaming, medios, multimedia, móviles, nokia, redes sociales móviles, social media, video-documentales, video-publicidad, videos-creacion, web 2015, web móvil, web3.0

OCLC’s Karen Calhoun Talks with Talis

November 23rd, 2009

sm_calhoun_karen british library I caught up with Vice president of OCLC WorldCat and Metadata Services, Karen Calhoun, in the lobby of a hotel across the road from the iconic British Library building in London.  Karen was preparing for her presentation at the 2009 OCLC Tech Forum to be held in the Library conference centre.

I took the opportunity to talk to her about the last twelve months since the announcement about changes to the OCLC record reuse policy.  We then moved on to discuss how new entrants, Biblios and SkyRiver, in to the record supply sector may alter that landscape.

As well as discussing the themes for her presentation later that morning, we also explored the blurring of the boundaries between OCLC’s traditional record supply focus and the ILS vendor community offering library automation software.

Technorati Tags: ,

English, Libraries, OCLC, Podcast

Llegaron los implantes cerebrales, somos infornívoros y la singularidad va en serio.

November 20th, 2009

Pensamos en dispositivos, en teléfonos móviles, cuando nos referimos a Ubicuidad,  a la posibilidad de acceder a Internet desde cualquier lugar. Y es que, aunque sabíamos ya de la existencia de fármacos actuales para aumentar la inteligencia, parecía que los implantes iban a tardar algo más en llegar.

Me sorprendía hace unas horas la noticia de que IBM había logrado replicar la Inteligencia de un gato. Recordábamos entonces los vídeos de Hergueta: Cuando los ordenadores puedan emular la capacidad de procesamiento de la información de nuestros cerebros, podría emerger incluso la conciencia.

Reflexionábamos en el mismo sentido acerca de  un artículo de Jamais Cascio sobre la Inteligencia aumentada:

A diferencia de otras especies, no dependeremos de la evolución natural que incremente nuestra inteligencia para sobrevivir:  Tecnología, farmacología impulsarán aquello que nos hace únicos: nuestra inteligencia.

Los científicos hablan de que hace 12,000 años de la última era glaciar (holoceno). Fue cuando emergió la civilización humana y nuestra co-evolución con herramientas y tecnologías que nos permiten adaptarnos mejor al entorno físico.

Si el aumento de la inteligencia tiene el impacto esperado, pronto entraremos en una nueva era. El foco de nuestra evolución tecnológica estará en menor medida en cómo adaptarnos al entorno físico y en mayor en cómo adaptarnos a la enorme cantidad  de conocimiento que hemos creado.

Es la Singularidad, definida en Wikipedia así:

“En futurología, la singularidad tecnológica (algunas veces llamada simplemente la Singularidad) es un evento futuro en el que se predice que el progreso tecnológico y el cambio social acelerarán debido al desarrollo de inteligencia superhumana, cambiando nuestro ambiente de manera tal, que cualquier ser humano anterior a la Singularidad sería incapaz de comprender o predecir.”

Dicho en otras palabras, lo que vivimos es un proceso evolutivo en por el que nosotros y sistemas digitales nos convertimos en más rápidos, más sofisiticados y más capaces a la par.

Diría Danny Hillis: “A largo plazo, Internet llegará a ser una infraestructura muy rica, tanto, que las ideas podrán evolucionar fuera de la mente humana”. La relación con el Conectivismo es evidente, aportando un elemento más para poder considerar este último como la teoría del aprendizaje más consecuente con la evolución que vivimos de la web.

Estamos, según Cascio, co-evolucionando: aprendemos para adaptar nuestro pensamiento, nuestras expectativas a esos sistemas digitales y a la vez nuestra inteligencia crea sistemas cada vez más complejos (fármacos que aumentan las capacidades cognitivas, implantes tecnológicos como el que motiva este post) y poderosos para satisfacer las expectativas del ser Infornívoro:

El término es acuñado por Schirrmacher, a partir de un comentario de George Dyson: “Y si el precio de que las máquinas piensen es que las personas dejemos de hacerlo?

Creo que pronto podré presentaros una traducción del artículo completo, pero como adelanto me sorprendía la noción de que podemos ver las plataformas computacionales como sistemas sociobiológicos que permiten repensar tres de los conceptos más importantes del siglo XIX: El Taylorismo (multitasking), El Marxismo (contenidos libres, copyright) y el Darwinismo (algoritmos de búsqueda y “foraging” -que traduzco como devorar- de información).

La perspectiva Darwiniana es la más interesante: La información es una ventaja adaptativa para los informávoros y el software que la codifica imita los hábitos de alimentación del hombre prehistórico).


Pues bien, volviendo al tema que motiva este post,  se añadiría a la perspectiva informávoro-adaptativa lo que leo en RWW, sobre investigadores de Intel en Pittsburgh que  explicaban cómo se están probando implantes tecnológicos en cerebros voluntarios con el objetivo de ayudarnos a navegar la red, manipular documentos, etc… Surfear la web a través del poder del pensamiento…..

La tecnología al servicio del ser humano (somos tecnología) dibuja un escenario de Singularidad que me parece mucho más probable (y atractivo) que el que imagina las máquinas tomando el control e incluso reemplazando la vida basada en el DNA, como hacía recientemente Stephen Hawking (“Humans Have Entered a New Stage of Evolution”, Los humanos hemos entrado en un nuevo estadio de la evolución).

Compártelo



  • BarraPunto
  • del.icio.us
  • Tumblr
  • Twitter
  • Facebook
  • Google Bookmarks
  • Meneame
  • MisterWong
  • StumbleUpon
  • Technorati
  • LinkedIn
  • Wikio
  • Bitacoras.com
  • Diigo
  • FriendFeed
  • Netvibes
  • Ping.fm
  • Posterous
  • PDF
  • Print

2009, Evolución, Planeta educativo, Spanish, Web 3.0, cognitivismo, darwinismo, futurismo, información, infornivoros, lifestreaming, singularidad, web3.0, zeitgeist evolución

Twitter API enables geotagging

November 20th, 2009

Twitter turned on its API for geotagging tweets yesterday, as announce in in a post on their blog, Think Globally, Tweet Locally. Currently, geographic information will only be associated with your tweets if you use an application that adds it and will only be used to display your tweets when viewed with an application that can exploit it. Here’s the way Twitter described it.

“This release is unique in that it’s API-only which means you won’t see any changes on twitter.com, yet. Instead, Twitter applications like Birdfeed, Seesmic Web, Foursquare, Gowalla, Twidroid, Twittelator Pro and others are already supporting this new functionality (go try them out now!) in interesting ways that include geotagging your tweets and displaying the location from where a tweet was posted.”

Examining Twitter’s status update API description shows how one associates a location with a Tweet. Pretty simple.

Since disclosing your location raises privacy concerns, Twitter has made geotagging an opt-in service and also allows users to delete all of the location information associated with their tweets. Moreover, their policy, as described here, says

“We require application developers to be upfront and obvious about when they are Geotagging an update. If you ever find that an application is doing it without notifying you, please let us know.”

Twitter has updated its privacy policy to cover location information.

You can read more on ReadWriteWeb and Techcrunch.

English, Semantic Web, social media, twitter

Web Integrated Data

November 19th, 2009

Last Friday I spoke at the Open Knowledge Foundation Open Data & The Semantic Web event. I was giving the opening talk of the day and thought that I’d take the opportunity to lay out a view that I’ve been meaning to articulate for some time: that integrating data with the web maximises its utility. Moving from data dumps, through APIs, and to Linked Data we maximize utility by reducing the amount of effort required to interact with data.

While there’s clearly still a lot of work to do around creating ways to visualise and explore Linked Data, the simply utility of being able to browse a dataset means that we move beyond publishing for a developer audience to publishing for anyone who can wield a browser. This is the angle to the Semantic Web vision that is most often overlooked in my opinion.

Developers often claim that “I can do the same thing using technology X, so why use technology Y”. In this early adopter phase of the Semantic Web its perfectly valid and important to critique the technology; to measure its ease of use and benefits for developers. But for me the end game is to move to a world where anyone can easily do complex manipulations on data — without resorting to writing code — because there’s enough machine support to make it achievable. That’s what standard vocabularies and a common data model enables. And its a natural part of the evolution towards increasingly declarative ways of manipulating information.

I’ll do a proper write-up of the presentation some other time, but for now here are the slides:

English, Semantic Web, Web

Linked Data Liminal Zones

November 19th, 2009

One of the things that has interested me for some time now is how RDF and Linked Data enables communities to enrich information published by organizations, e.g. by annotating it with additional properties and relationships (links). This is after all, one of the intended goals of the technology: to make it easier for people to converge on common names for things and collectively share data about those things.

The ability to publish URIs for things, and then have those URIs decorated by a motivated community with additional metadata, provides organizations with an interesting way to take advantage of Linked Data. The enriched data can be reused by the organization to improve its own datasets and used to drive improved processes, new product development, etc.

The interesting angle is that while both the organization and the community directly benefits from the sharing (both gain access to data they wouldn’t have normally, or at least without extra expense) there are some asymmetries in the relationship. Specifically, an organization worried about its brand is likely to have higher, or at least different, standards for reliability and quality than its community; especially so it we consider only the non-commercial users in that community. Before the organization may ingest and republish this data (e.g. on its website) then those standards and a certain amount of filtering may need to be applied.

I like to think of these contributions as being in a “liminal zone” between the authoritative content that is completely owned and managed by the publisher and the stuff that exists out there on the Wild Wild Web which is only tangentially related (at best). There’s a zone of transition between the two spaces, where the data and the URIs start out being owned by the publisher then embraced, adopted (and even co-opted) by a community. A user may want to freely navigate between these different areas and apply their own rules about quality, reliability or general bozo filtering. And they can end up in a very different space to where they started. An organization may want to act quite differently; in terms of what and how much they fetch, and how they use what data they collect.

The following diagram attempts to sketch out this liminal zone from the perspective of the BBC.

Its the user annotations that annotate or relate to the BBC URIs that form the liminal zone between the authoritative publisher-sourced data, and the rest of the content on the web. You could put almost any organization into that central space and the same relationship would hold. Its the strong identifiers associated with Linked Data that connects up the internal and external views of the data.

I recently commissioned a project at Talis called Fanhu.bz which aims to help surface content and contributions that exist in this liminal zone. I see it as a first step towards exploring some of these subtle data sharing issues. Mapping out the fringes of Linked Data sets, as exemplified by BBC Programmes and Music, and then exploring how that data can be remixed and reused not only by the community but also by the publisher themselves, is an attempt to explore models for consuming Linked Data that goes beyond simple re-publishing and visualisation. The technology has a lot more to offer. And when we talk about “Linked Data for the Enterprise” I think we need to be thinking beyond just internal data integration.

English, Semantic Web

Adrian Dale looks forward to Online Information 2009

November 19th, 2009

online09 adrian-dale The twelve months that have elapsed since the previous Online Information Conference has seen an explosion in technologies that influence the information world and life in general.  What was being talked about as up coming trends last year, are now core to the agenda of this years conference.

Conference Chair, Adrian Dale, joins me in conversation to discuss these trends an to explore his hopes for the highlights of this years conference.

Technorati Tags: ,

English, Online Information, Podcast

SKOS Reference translated to Chinese

November 18th, 2009
范炜 (Fan Wei) has published a Simplified Chinese translations of the SKOS Reference document, under the title “SKOS 简单知识组织系统参考”.

Activity news, English

XKCD on the difference between academia and business

November 17th, 2009

Pellet 2.0 Release

November 16th, 2009

We’re happy to announce the release of Pellet 2.0, the first OWL 2 DL reasoner available commercially. Pellet 2.0 is available for use in open source projects under the AGPL v.3 license; for commercial usage, alternative license terms are available.

During the past 13 months, we closed 199 tickets as part of the 2.0 release candidate cycle, including numerous enhancements and bug fixes. Please see CHANGES.txt in the distribution for a complete change log; some highlights include:

  • full OWL 2 support (modulo a few bugs that will be fixed in the 2.1 release)
  • supports domain & range axioms, class expressions, qualified cardinality restrictions, literal constants, annotations, and nested class expressions in SPARQL queries
  • support for all SWRL builtins, including previously missing builtins (substring, tokenize, and optional precision parameters for roundHalfToEven)
  • optimized support for OWL 2 EL reasoning; OWL 2 EL reasoner is autoselected based on data input
  • supports automated ontology module extraction
  • supports incremental classification
  • supports fine-grained inference extraction
  • enhanced SWRL rules performance
  • OWLAPI v3 support
  • lots of improvements, cleanups to Pellet’s command line tools
  • updated to work with Jena 2.6.2 — Pellet is the only DL reasoner available from Jena
  • supports explanations via Jena
  • support autoselecting best SPARQL query engine based on input query
  • user-defined timeouts for reasoning
  • switch to dual license model to support commercial and open source projects

This release marks a change in Pellet development process: starting with 2.1, Pellet will be released according to a time-based development cycle. We will do four quarterly releases per year. We will make point releases between the quarterly releases, as necessary, to fix critical bugs only. Thus, the release schedule for the 2.x series will be 29 March 2010, 28 June 2010, 27 September 2010, 20 December 2010.

We believe this new development and release process will further accelerate the commercialization of Pellet, with no undue impact on its utility for either research or other non-commercial applications.

Finally, with the release of Pellet 2.0, we will no longer support previous versions via the pellet-users mailing list.

English, OWL 2, Pellet

Online educa 2009, Enrédate Alicante, Atictes, Edusol, más eventos 2009

November 15th, 2009

Tenía pendiente terminar la lista de eventos en los que estaré los próximos días:

top

Invitada por amigos twitteros como Francisco Páez y Miguel Sánchez de León, CEO de Perros Viejos Network, participaré en la mesa – taller número 7, sobre Redes sociales y empresa.

1. Las empresas en movimiento.
2. Internacionalización como estrategia de crecimiento de las PYMES.
3. Oportunidades de Negocio en Tecnologías Limpias.
4. Taller de Networking para emprendedores.
5. Tendencias del mercado.
6. La innovación en la oferta y el universo micro.
7. Cómo utilizar las redes sociales en la empresa de una manera efectiva.

PLENARIO: “Cómo reinventar la empresa”

Espero poder desvirtualizar a muchos amigos allí. Aprenderemos mucho, todos/as, seguro.

dc5q75gd_23x9hq3zhq_b

La revolució que suposa l’accés a Internet amb mobilitat: accés des dels mòbils, els mapes al mòbil, xarxes sense fils, vídeo, gps i les seves diferents opcions de futur. Aquest serà el tema central de les sisenes Jornades sobre Tecnologies de la Informació i la Comunicació “àtic 2a”, els dies 28 i 29 de novembre. El debat es realitzarà a un espai virtual, www.atic2.cat, suportat per tres seus físiques: Tortosa, Tarragona i Cornellà de Llobregat. El públic podrà intervenir des dels tres espais i també des d’Internet, on novament s’emetran en directe les ponències i debats.

Mi intervención, en mayor medida que otras, va a requerir de mi la exploración de nuevas fuentes: Web semántica móvil. Un desafío, sin duda, del que hay mucho que aprender y al que no dudaré en dar respuesta.


a_branding

Allí presentaré la esencia y las oportunidades de futuro del innovador programa de Comunidades de Práctica Compartim, impulsado desde la Administración de Justicia catalana, su Centro de Estudios Jurídicos y formación especializada y el responsable del programa, Jesús Martínez Marín.

Me encontraré allí con Irene Zurborn Fernández, de la Fundacion CEDDET, que moderará la mesa “Improving Efficiency and Effectiveness in Public Institutions”.

“In times of recession, it is critical that public institutions make the tax payers’ money go further and achieve more. This session highlights how public institutions can lead the way in practices that improve both the efficiency and impact of skills provision in the workplace.”

Hablamos ya del encuentro, así que allí os remito para ver la presentación (en inglés) que realizaré el 3 de diciembre en Berlín.


  • Por último, tiene lugar desde el 9 de Noviembre el Quinto encuentro en línea sobre educación, cultura y software libre, Edusol.

rotate.php

Invitada por Alejandro Miranda Díaz,  mi intervención será similar a la ya que presenté hace unos días en Virtual Educa 2009 sobre Open Social Learning. Quienes no pudisteis asistir podéis hacerlo desde este congreso online. Será el18 de Noviembre a las 18:30 hora española.

En fin…tengo por confirmar algún evento más, que os anunciaré en su momento. Espero, además de aportar ideas a cada uno de ellos, tener tiempo y fuerzas para poder trasladaros, desde aquí y desde Twitter, lo mejor de lo aprendido.

Compártelo



  • BarraPunto
  • del.icio.us
  • Tumblr
  • Twitter
  • Facebook
  • Google Bookmarks
  • Meneame
  • MisterWong
  • StumbleUpon
  • Technorati
  • LinkedIn
  • Wikio
  • Bitacoras.com
  • Diigo
  • FriendFeed
  • Netvibes
  • Ping.fm
  • Posterous
  • PDF
  • Print

2009, Anuncios generales, Aprendizaje, PLEs, Planeta educativo, Spanish, Web Semántica, colaboraciones, compartim, comunidades, conferencias, cops, dolors reig, e-learning2.0, educación 2.0, edupunk, empresa 2.0, eventos, futurismo, periodismo ciudadano, seminarios, social media, talleres, web 2.0, web semántica móvil, web3.0

Wikipedia infobox template coherence

November 15th, 2009

Wikipedia has an interesting RFC on approaches to achieve and maintain better coherence in its infobox templates. This is significant because Wikipedia is becoming the new CYC — a broad, practical KB filled with general purpose background knowledge. The RFC was kicked off by discussions on dbpedia template annotations. The RFC defines the problem as:

“Wikipedia uses hundreds of infobox templates for describing various entity types like NFL teams, schools in Canada, train stations etc. These infoboxes are separated and do not use a common vocabulary. Several different spellings of attributes are used for them, which all stand for the same meaning (e.g. birth_place, birthPlace, origin). This poses limitations to checking consistency within Wikipedia infoboxes, amongst different language editions, and it makes it hard for external tools to reuse the information in infoboxes.”

The goals mentioned in the RFC include (1) establishing the currently missing links between synonymous template attributes, (2) enabling authors to use template annotations to check for for factual inconsistencies (e.g., outdated population figures), and (3) providing consensus about which properties should be used in templates and what data they should contain.

English, KR, Semantic Web, Wikipedia, cyc, dbpedia, social media

Exposing Government Data at the ISWC

November 13th, 2009
One of the goals of the organizers of the International Semantic Web Conference this year was to expose government information managers and contractors to the Semantic Web. To some extent this succeeded; there were a number of attendees who came to the conference from government agencies. On the other hand, it failed; many of them were already Semantic Web enthusiasts, so there was some element of preaching to the choir.

I felt that my own major session, a tutorial on building semantic web applications for government data, was very successful in this regard. I was originally disappointed with the registration, but discovered on the day that most people signed up for 'tutorials' in general, then attended whatever they liked. The room made it to SRO before the first coffee break, and many of the people there were from government agencies or contractors, as desired (many were from other places, but I'm not going to complain about that!).

Probably the biggest measure of success for this goal was the exposure in the Government Computer News. Senior Technology Editor Joab Jackson seemed to like my elevator pitch about the Semantic Web (though it only works for veerrryyy slloooww elevators) enough to repeat it in one of his articles about the event. In another article, Joab told me something I didn't know - that the report we generated in the tutorial is actually interesting to government IT managers, and would somewhat labor intensive without linked open government data.

GCN's 13 resources seems like an intentional flaunting of superstition, since one could easily come up with many more. I am flattered that one of my own pages made it to the list; many of the omissions  are available there, and include US Gov XML and OEgov.

All in all (thanks to a great extent to Mr. Jackson's efforts), I think we managed to achieve some exposure for semantic web technology for government information managers.

Goverment

EU approves law requiring user consent for Web cookies

November 13th, 2009

This ought to be fun.

According to an article in the WSJ, Europe Approves New Cookie Law, “the Council of the European Union has approved new legislation that would require Web users to consent to Internet cookies..”

The law could have broad repercussions for online ads. “Almost every site that carries advertising should be seeking its visitors’ consent to the serving of cookies,” wrote Struan Robertson, a lawyer specializing in technology at Pinsent Masons and editor of Out-Law.com. “It also catches sites that count visitors — so if your site uses Google Analytics or WebTrends, you’re caught.”

This hit slashdot (“Breathtakingly Stupid” EU Cookie Law Passes) this morning.

By the way, our ebiquity site uses cookies. Send mail to no-more-ebiquity-cookies at cs.umbc.edu if you want to opt out.

Hmmmm. I wonder how we would implement cookie opt-out. I think setting a cookie to indicate that the user has opted out of your site’s cookies would be a good approach.

English, Web, privacy, social media

Lyndon Nixon (STI): “Clear guidelines on how to best make use of Linked Open Data by enterprises is needed”

November 13th, 2009

The European Semantic Technology Conference 2009 will take place in Vienna at the beginning of December 2009. Andreas Blumauer (Semantic Web Company) talked with Lyndon Nixon who is the program advisor of this conference:

estc09_logo

SWC: In its own saying the European Semantic Technology Conference brings together the smartest minds in Semantic Technologies. What will be the highlights of this year’s conference?

There are many highlights this year! We have a full program of presentations, workshops, panels and a keynote by Susie Stephens from Johnson&Johnson. On top of this, the first day will see the first ever ESTC Innovation Seed Camp, where enterpreneurs and young start ups are invited to pitch their ideas to a panel of venture capitalists and there will be cash prizes! Besides the main program, an open demo space will continually offer new showcases of semantic technologies and products, while a networking zone gives attendees a relaxed space to make business away from the conference hectic. We will also be holding matchmaking sessions, where attendees can schedule one-to-one meetings with other attendees organized by a handy online tool. Finally, in line with ESTC’s focus on semantics and innovation – during the two days we will give participants the chance to check out two innovative conference tools: an electronic vCard exchange and the “Web Comparator”. So, too much to explain, but you can get full information on every aspect of the conference at its webpage www.estc2009.com.

SWC: How would you describe the state of the art in semantic technology business especially in Europe?

We are at a very exciting period in the enterprise uptake of semantic technologies, which can be seen in the growth in attendance at events such as ESTC. Semantic technologies are finally maturing and can be used away from toy examples in real, critical business processes. Technologies are being standardized and tools aligned to those standards, while progress in being made in supporting the sorts of extensions that businesses need (see OWL 2, or SPARQL 1.1). This year, the case studies and the business applications that will be presented are going to reflect that. We are still inside the early adopter phase in semantic uptake, with the critical mass of companies still checking out semantics at a research and prototyping level. However, the balance is shifting and enabling the technology transfer to real business projects is key; ESTC’s focus on direct contact between the technology vendors and the business clients – the networking zone, the open demo space, or the matchmaking sessions – is a reflection of the importance of an event such as ESTC to bring these two groups together.

SWC: One of the big issues at the moment is Linking Open Data. How do you perceive this development and how can you start?

 

Yes, Linked Open Data is an interesting development, making a significant amount of semantic data about a broad subject range available to everyone. I think it has real value in the research community where large data sources have been needed. For industry, I would say its value is less straight-forward: the data is not always so clean and care needs to be taken before building business applications on top of it. Clear guidelines on how to best make use of Linked Open Data by enterprises is needed. ESTC picks up on this in its program this year: we will have an expert panel precisely on this subject! Of course, leveraging the Linked Data Cloud in the enterprise is already a topic for many organisations – one of our paper sessions is on Linked Data and I am sure there will be plenty mentions of it elsewhere during the conference!

SWC: In recent years large IT-companies and system integrators have rather been playing around with semantic web technologies than identifying the semantic web
 as a market opportunity. Do you think that this situation has changed already?

I think a lot of these companies are being cautious – the Semantic Web was so hyped to industry in its first years that a certain level of cynicism grew. Now, that we have in my opinion very real and valuable tools and technologies built on semantics, the companies are being careful in how to present this to the market. There are already many very encouraging examples of semantics making inroads in key markets where data heterogeneity, integration, and management have become key issues: I think Health Care and Life Sciences is simply the market which is being most open about it (the ESTC keynote speaker Susie Stephens will also report on semantic technologies in this market). There are further examples we just don’t know about, because companies don’t want to let their competitors know, or mention that it is semantics which is being used.

SWC: Please add another statement which is important for you!

ESTC 2009 will be *the* meeting place in Europe this year for semantic technology vendors and users – don’t miss it!

Conferences & Events, ESTC, English, Vienna, estc09

XKCD on elections and voting

November 12th, 2009

A review of the Google Go programming language

November 11th, 2009

Mark Chu-Carroll is a Google software engineer who’s written a long, detailed and informed review of Google’s new programming language Go. It’s worth a read if you are interested in understanding what it’s like as a programming language. Here’s a few points that I took note of.

    “The guys who designed Go were very focused on keeping things as small and simple as possible. When you look at it in contrast to a language like C++, it’s absolutely striking. Go is very small, and very simple. There’s no cruft. No redundancy. Everything has been pared down. But for the most part, they give you what you need. If you want a C-like language with some basic object-oriented features and garbage collection, Go is about as simple as you could realistically hope to get.”

    “The most innovative thing about it is its type system. … It ends up giving you something with the flavor of Python-ish duck typing, but with full type-checking from the compiler.”

    “Go programs compile really astonishingly quickly. When I first tried it, I thought that I had made a mistake building the compiler. It was just too damned fast. I’d never seen anything quite like it.”

    “At the end of the day, what do I think? I like Go, but I don’t love it. If it had generics, it would definitely be my favorite of the C/C++/C#/Java family. It’s got a very elegant simplicity to it which I really like. The interface type system is wonderful. The overall structure of programs and modules is excellent. But it’s got some ugliness. … It’s not going to wipe C++ off the face of the earth. But I think it will establish itself as a solid alternative.”

Go sounds like a language that will help you grow as a computer scientist if you use it. That’s a good enough recommendation for me.

English, Google, Programming, go, programming languages

CFP: JWS special issue on semantic search

November 11th, 2009

Yong Yu and Rudi Studer are editing a special issue of the Journal of Web Semantics on semantic search that will appear in the summer 2010. The special issue will cover interdisciplinary topics between Semantic Web and search. See the call for papers for a list of relevant topics and details on how to submit papers, which are due by 20 January 2010

AI, English, Semantic Web, search

Google VP on semantic search and the Semantic Web

November 11th, 2009

PCWorld has a story, Google VP Mayer Describes the Perfect Search Engine, with some interesting comments on semantic search from Marissa Mayer, Google’s vice president of Search Products & User Experience.

“IDGNS: What’s the status of semantic search at Google? You have said in the past that through “brute force” — analyzing massive amounts of queries and Web content — Google’s engine can deliver results that make it seem as if it understood things semantically, when it really functions using other algorithmic approaches. Is that still the preferred approach?

Mayer: We believe in building intelligent systems that learn off of data in an automated way, [and then] tuning and refining them. When people talk about semantic search and the semantic Web, they usually mean something that is very manual, with maps of various associations between words and things like that. We think you can get to a much better level of understanding through pattern-matching data, building large-scale systems. That’s how the brain works. That’s why you have all these fuzzy connections, because the brain is constantly processing lots and lots of data all the time.

IDGNS: A couple of years ago or so, some experts were predicting that semantic technology would revolutionize search and blindside Google, but that hasn’t happened. It seems that semantic search efforts have hit a wall, especially because semantic engines are hard to scale.

Mayer: The problem is that language changes. Web pages change. How people express themselves changes. And all those things matter in terms of how well semantic search applies. That’s why it’s better to have an approach that’s based on machine learning and that changes, iterates and responds to the data. That’s a more robust approach. That’s not to say that semantic search has no part in search. It’s just that for us, we really prefer to focus on things that can scale. If we could come up with a semantic search solution that could scale, we would be very excited about that. For now, what we’re seeing is that a lot of our methods approximate the intelligence of semantic search but do it through other means.”

I interpret these comments to mean that Google’s management still views the concept of semantic search (and the Semantic Web) as involving better understanding of the intended meaning of text in documents and queries. The W3C’s web of data model is still not on their radar.

AI, English, Google, Semantic Web, nlp, search

Can cloud computing be entirely trusted?

November 10th, 2009

The Economist has been running a series of online Oxford Union style debates on topical issues — CEO pay, healthcare, climate change, etc. The latest one is on the cloud computing: This house believes that the cloud can’t be entirely trusted.

In his opening remarks, moderator Ludwig Siegele says

“The participants in this debate, including the three guest speakers, all agree that computing is moving into the cloud. “We are experiencing a disruptive moment in the history of technology, with the expansion of the role of the internet and the advent of cloud-based computing”, says Stephen Elop, president of Microsoft’s business division, which generates about a third of the firm’s revenues ($13 billion) and more than half of its profits ($4.5 billion) in the most recent quarter. Marc Benioff, chief executive of Salesforce.com, the world’s largest SaaS provider with over $1.2 billion in sales in the past 12 months, is no less bullish: ‘Like the shift [from the mainframe to the client/server architecture] that roiled our industry in decades past, the transition to cloud computing is happening now because of major discontinuities in cost, value and function.’”

While the debate’s proposition suggests that security or privacy is its focus, it’s really a broader argument about how software services will be delivered in the future in which security is just one aspect.

“Whether and to what extent companies and consumers elect to hand their computing over to others, of course, depends on how much they trust the cloud. And customers still have many questions. How reliable are such services? What about privacy? Don’t I lose too much control? What if Salesforce.com, for instance, changes its service in a way I do not like? Are such web-based services really cheaper than traditional software? And how easy is it to get my data if I want to change providers? Are there open technical standards that would make this easier?”

English, High performance computing, Security, Semantic Web, cloud computing, privacy, services

Application of semantic technologies in Internet on 2020 (II): education

November 10th, 2009

Yesterday I was reading El caparazón, one of the most relevant blogs about semantic applications in Spanish language, when I found this post about “Education and Web 2.0”. Undoubtedly this is an area where semantic technologies will have an important say in the future.

Some experts state that most of the knowledge that an elementary school student will need to perform his job when he will grow up, don’t exist yet. How can we focus the education in so a rapidly changing environment?. Certainly knowledge is advancing so quickly that it is an almost impossible task trying to keep the pace. This raises a fundamental change in the education approach, as its key task will be to transmit information to the students, to help them to manage all this information, to help them to distinguish useful one from useless, to help them to extract knowledge from all this information… This is, learning how to learn.
No doubt Internet will change education approach at all levels. No longer students will go to university to pick up some notes, or to listen a one way explanation in which the teacher talks and students listen. Because to access to information we have Google, and to hear lectures, we can easily access to those of the outstanding experts in each subject.
Any country that wants to maintain a high level in the knowledge society must be capable of integrating technologies within education systems at all levels. We’re going to be bombarded along all our lives with millions and millions of information bytes, this is a real fact we must live with. In this situation it will be paramount to extract useful knowledge from this information, indeed this will mark the difference among efficient and no efficient people. In this environment semantic technologies will play an important role, because they will help us to navigate through information and to adapt it to our needs, that is to contextualize it. Nowadays, semantic technologies have got an important level of madurity and standards as RDF, or OWL will help us to give the jump form a “textual” management of information to a “concept” treatment of this information. This is a first step and a very important achievement. However, some years will be required to settle these concepts, and to develop technologies allowing us to extract knowledge from all the information around us: this means tools to show us to learn.

Education, English, Spanish, Uncategorized, Web 3.0, semantic technologies, virtual education

Demozone for semantic applications launched

November 10th, 2009

The Semantic Web Company compiled a suite of some of the best semantic web applications and put them in one place for you to try out: The SWC Demozone.

swc demozone logo

We selected tools pertaining to the different application areas of the Semantic Web – be it for finding, creating, linking and/or publishing information.

The showcased applications and services so far are:

Have a look at the demos and try them out for yourself – we provided explanations and links to screencasts teaching you how to use them.

We will add more demos in the future. If you are the owner of or a contributer to an application that you’d like to see showcased in the demozone, too, please drop us a line and we’ll try to add a demo for your software.

English, Tools & Software, Videos & Tutorials, semantic web applications

Survival with the fittest: the story of a Google library partner

November 9th, 2009

Universidad ComplutenseI hadn’t previously come across any of Google’s library partners, so it was great to listen to the experiences of Manuela Palafox from the Universidad Complutense in Madrid, Spain at the Eurolis seminar Doom or bloom: reinventing the library in the digital age. Complutense originally signed its digitisation agreement with Google back in 2006, and was the first non-Anglo-Saxon (her words) library to join the programme.

Based on books in the public domain, the agreement enables Complutense to offer universal free of charge full-text access to a large number of books. So I, for example, a former student from a Spanish university, can now explore a rich vein of Cervantes books without having to endure the punishing euro-sterling exchange rate.

In digitising Complutense’s public domain books, Google assumed all the costs of digitisation and transportation. Google also created an interface, something they do for all their library partners. In return, Complutense selected and provided the books, as well as technical staff.  The overarching aim was to offer access to the university’s library heritage. It was also perceived as an important part of selling the Spanish language abroad – providing access to the vast number of Spanish speakers in the world.

The process started with an analysis of the collection to determine how many books were out of copyright. They then catalogued 70,000 books and established selection criteria – publication year and physical condition – and formulated workflows and logistics for digitisation. Using PDAs, for example, the selection team stored details of the physical condition of books against the book barcode.

As a result of this herculean effort, thousands of Complutense’s digitised books are already accessible in Google Books. It’s possible to navigate directly to the full text from the catalogue record. There are also links enabling users to buy the book. This is truly how to extract optimal value from materials that were formerly languishing in the library. And even in the short time that they’ve been available, 34% of the materials have already been used.

Google logoMeanwhile, Jason Hanley, one of Google’s partner managers who spoke immediately after Manuela, seemed anxious to dispel a number of myths about Google and its work with libraries. On the predominance of English language materials, he pointed out that of all Google’s library partners, 8 are outside the US – 2 being in Japan and the rest, such as Universidad Complutense, in Europe. He also believed the predominance of language, linguistics and literature over STEM subjects to be surprising – I’m not sure why.

The question and answer session at the end, involving both Manuela Palafox and Jason Hanley, may have inadvertently answered the question of Google’s motives in this. It’s not the library world that should be afraid of Google – it’s the competing search engines. Google’s longstanding mission – to organise the world’s information and make it universally accessible and useful – has clear benefits not only to library partners such as Universidad Complutense, but to the library world as a whole, and to bibliophiles like me. But Google will be imposing limits on the availability of digitised materials for indexing by other search engines for a certain (undefined in this session) period of time, although Hanley denied that Google was trying to be exclusive (which came across as being more than slightly defensive).

The session was a clear window into the aims and experiences of a library partner, and maybe into Google’s motives as well… As one speaker from the floor noted, what are the chances of any other search engine being able to compete fully with Google in the foreseeable future?.

English, Google Book Settlement, Libraries, digitisation, eurolis