Archive

Archive for July, 2009

I-Semantics 2009 – Call for Participation

July 30th, 2009

isemantics_logo** About 250 participants have already registered. Please note that the deadline for early registration will end tomorrow, July 31. **

From September 2 – 4, 2009 I-SEMANTICS brings together LOD aficionados in Graz/Austria. Chris Bizer (FU Berlin) will give an invited talk on “The Emerging Web of Linked Data” and the winners of the Triplification Challenge ‘09 will be honoured.

Submissions to the Triplification Challenge ‘09 are still possible till August 9, 2009. The descriptions should be submitted electronically via email to Michael Hausenblas with the subject Triplification Challenge.

Further highlights of the conference will be …

Keynote presentations from

  • Paolo Traverso (FBK, Italy)
  • Eric Tsui (The Hong Kong Polytechnic University, China)
  • Peter Kropsch (Austrian Press Agency, Austria)

Pragmatic Web Track

We are delighted to host the 4th AIS SigPrag International Pragmatic Web Conference Track at I-SEMANTICS. The special track  is centered around the study of “pragmatics” in the Semantic Web. It draws attention to how communicative actions with a pragmatic context are performed via Web media and illuminates how mutual understanding and commitments to actions can evolve in conversations. For additional information see http://www.pragmaticweb.info/

International Cooperation Event

Due to the huge success in the last two years the Styrian Economy Funding Agency (SFG) in cooperation with the Enterprise Europe Network (EEN) again decided to organize an International Cooperation Event. You can sign up for the International Cooperation Event free of charge by registering on the official cooperation event website.

I-KNOW ‘09 – International Conference on Knowledge Management and Knowledge Technologies

I-SEMANTICS ‘09 will be held concurrently with I-KNOW ‘09. This special concept aims at bridging the gaps between the various communities and their technology fields. The overall program includes about 90 scientific presentations from all over the world. A German-speaking industry track offers about 30 industry presentations and an exhibition.

We look forward to welcoming you to Graz in September 2009!

Calls & Competitions, Conferences & Events, English, Linked Data & Open Data, i-semantics i-know lod

JISC Grasp the Marc Record Re-use Legality Nettle

July 29th, 2009

The JISC Information Environment Team have just announced a study to explore the legal and ownership implications of making catalogue records available to others when this involves copying, transferring them into different formats.

The JISC has just commissioned a study to explore some of these issues as they apply to UK university libraries and to provide practical guidance to library managers who may be interested in making their catalogue records available in new ways. Outcomes are expected by the end of 2009.

The specific objectives of the study are to:
•    Establish the provenance of records in the catalogues of a small but representative sample of UK university libraries and in the national Copac and SUNCAT catalogues;
•     Identify any rights or licences applying to the records and assess how these apply to re-use in the Web environment. This work should include clarifying the legal status of MARC records and copies of MARC records, and the legal implications of translating records between different formats such as MARC and MODS XML;
•     Provide practical guidance to UK university libraries about the legal issues to be considered in making catalogue records available for re-use in Web applications such as social networking sites – drawing on the findings from the sample;
•     Make recommendations to the JISC and the UK higher education community about any initiatives which could usefully be undertaken to facilitate the re-use of catalogue records in Web applications in a way which respects legal rights and business interests.

The core nugget of this being clarifying the legal status of MARC records and copies of MARC records.  Without establishing that anything else would be building castles on sand.

One of the many things that was never fully clarified in the OCLC record re-use saga earlier in the year was the legal status of a Marc record – can it, or parts of it, be considered as a creative work and therefore be applicable for copyright and a concept of ownership.

I wish whoever is undertaking the JISC study (the announcement does not indicate any study group members) well as they set foot in to this minefield of assumption, traditional practice, legal interpretation, and commercial interest and bias.  Let’s hope they do a thorough job and carry enough weight from legal, library, and publishing backgrounds to deliver advice and opinion that will clarify these particularly murky waters well beyond the UK University sector.

Copyright, English, JISC, Libraries, Licensing, Open Data

Governance with TopBraid Ensemble

July 28th, 2009

For those of us who have been doing Knowledge Representation for decades, we judge a modeling tool on its power: How many whiz-bang shortcuts for complex OWL restrictions or mass editing of similar items or refactoring does it have? But when we try to get Modeling to the Masses, or at least Modeling in the Enterprise, we find that it isn't the power tools that they are interested in. Enterprise knowledge workers will prefer pretty simple model editing tools. But they insist that they have strict control over version governance.

What exactly is version governance? Often the people who want it aren't quite sure, but they know it when it isn't there. Someone makes a change to a part of a model on someone else's turf. Someone wants to try out a long-transaction 'better idea' to see how it works - but we want to be able to toss it later on if we don't like it. Or we find something wrong in a category - who changed it? When? What was the model like when they changed it?

Some of this stuff comes for free when you use a version control system like SVN or CVS. But these solutions, which are great for managing versions of java code, aren't intuitive to a team that is organizing, say, a vocabulary project. They want something a bit finer grained (who changed this term?) and with a bit of process ("I can propose a change, but only John can approve it").

That's why the biggest part of TopQuadrant's Enterprise Vocabulary Management System (EVMS) is a system for collaborating on model changes. You don't just use the EVMS to change a vocabulary; you use it to build a sandbox in which you make your changes. The changes then enter a (configurable) workflow, where, if they get approved, they are committed to the shared version. If not, well, then they aren't.

Now, that's pretty cool. After all, it lets teams collaborate on their vocabulary management, lets them manage territory on a term-by-term basis, and even provides a process for moving the changes along. But the thing that I find most cool about this is that it was all built using the TopBraid Ensemble assembly platform.

You see, I never got the hang of coding Java, and I'm not really a programmer. But I like making systems do what I want them to do, so I am a big scripter. The entire EVMS collaboration control system is written as a TopBraid Ensemble application.

What does this mean? It means a lot of things, but for this project it means that when I was talking to a colleague about how to display the changes that had been made to a vocabulary. He said, "to my mind, I want to be able to click on a term, and see all the people who have changed it, and why!" Well, all that information is modeled in the system - it is just a matter of querying it out with SPARQL.

Governance

In the figure, we see the final step of this. We are looking at a fragment of the NCI Thesaurus regarding Organisms. The change log shows a rather silly argument over what we should call lab mice by two of the taxonomists. Every change was made through the EVMS, so we can track back the whole story about each term. Adding this to the system was as easy as writing a SPARQL query and wiring it up to the display components (a grid in the upper-right and a form in the lower-right) so that the changes relevant to a chosen term would be shown.

Uncategorized

Knowledge Management and the Semantic Web

July 28th, 2009

That’s the title of my diploma thesis and first of all, thanks to SWC for the possibility to say some words about it. My interest in knowledge management reaches back some time now and I decided to make it the subject of my diploma thesis in my first attempt to write one back in 2001. The semantic web “came to me” in the last one or two years and the TRIPLE-I conference last year was somehow the trigger for me to connect the two topics.

My basic idea was very simple. When you read about the Semantic Web you are confronted right away with connections to creating knowledge and knowledge management. But in my understanding the Semantic Web is a technical thing and knowledge management is primarily a cultural and organisational thing. So the research questions for my thesis where:

  • What relevance do knowledge management and semantic technologies have in the daily work of people working in knowledge intensive domains?
  • Which possibilities lie in the adoption of knowledge management and semantic technologies?
  • Are semantic technologies already fit for practical use?

The basis of the empirical part of my thesis are group discussions held in different organisations. As a result I developed starting points for an understanding of the topics “Knowledge Management” and “Semantic Web” and their relevance in organisations. The empirical results, in short, provide the following answers to the research questions:

  • The “theoretical relevance” of both topics is high, the “practical relevance” on the other hand is rather low. Neither do structured concepts for knowledge management exist in the studied organisations, nor are there attempts at using semantic technologies
  • Most of the participants have not heard of the “semantic web” prior to the discussions. After having been introduced to the topic, the relevance of the semantic web and of semantic technologies is rated high
  • Possibilities are seen in a better management of information or knowledge in organisations and, especially for semantic technologies, in the improvement of search functionality’s and search results
  • Semantic technologies are not yet seen as fit for practical use
  • The connection between knowledge management and semantic web is taken as a fact without giving any justification for it.

In my conclusion I tried to match my results with the results of the Semantic Web Barometer 2009 and it was very interesting for me, that there were several similarities. I also found that talking to the people that have to work with technologies that are developed for them can be quite interesting and that group discussion are a great way to do that.

I wrote most parts of my diploma thesis in a wiki (and the rest is available as PDF) so you can find it on my wiki.

Your comments and annotations are very welcome!

Thanks for reading as far as this, Helmut

Reblog this post [with Zemanta]

English, Knowledge Management, Knowledge Representation, Literature & Publications, Semantic Web

New W3C Rule Interchange Format (W3C RIF) standard published

July 28th, 2009

The W3C Working Group working on W3C Rule Interchange Format (RIF) has recently launched a new standard for the interchange of rules. Some guys from the Coporate Semantic Web Working Group of Freie Universität Berlin have been heavily involved. An interview on the practical aspects of RIF will follow in August.

Reblog this post [with Zemanta]

Corporate Semantic Web, English, Free University of Berlin, RIF, Rule Interchange Format, Vocabularies & Languages

Who won the Netflix Prize? Ensemble or BellKors Pragmatic Chaos?

July 27th, 2009

Who won the Netflix Prize? Ensemble or BellKors Pragmatic Chaos

Who won the Netflix Prize? According to a post in the NYT Bits blog, Netflix Challenge Ends, But Winner Is In Doubt, it’s still very much up in the air.

” So The Ensemble won, right? Not necessarily. In an e-mail message Sunday night, Chris Volinsky, a scientist at AT&T Research and a leader of the BellKor’s team, said: “Our team is in first place as we were contacted by Netflix to validate our entry.” And in an online forum, another member of the BellKor team, Yehuda Koren, a researcher for Yahoo in Israel, said his team had “a better Test score than The Ensemble,” despite what the rival team submitted for the leaderboard.

So is BellKor the winner? Certainly not yet, according to a Netflix spokesman, Steve Swasey. “There is no winner,” he said.

A winner, Mr. Swasey said, will probably not be announced until sometime in September at an event hosted by Reed Hastings, Netflix’s chief executive. The movie rental company is not holding off for maximum P.R. effect, Mr. Swasey said, but because the winner has not yet been determined.

The Web leaderboard, he explained, is based on what the teams submit. Next, Netflix’s in-house researchers and outside experts have to validate the teams’ submissions, poring over the submitted code, design documents and other materials. “This is really complex stuff,” Mr. Swasey said.

A leading member of The Ensemble, Domonkos Tikk, a Hungarian computer scientist, did not sound too hopeful. “We didn’t get any notification from Netflix,” Mr. Tikk said in a phone interview from Hungary. “So I think the chances that we won are very slight. It was a nice try.”

It seems strange that Netflix called the Bellkor team first, since according to the Leaderboard the Ensemble team submitted the top entry.

UPDATE 2/28: Today’s NYT has a good article on the Netflix Prize and the role of teamwork for developing machine learning systems, Netflix Competitors Learn the Power of Teamwork.

English

OLE – $5.2m to get from Diagrams to an ILS Replacement in two Years

July 27th, 2009

The OLE Project I’m currently reading my way through the final draft of the OLE Project Final Report.  The one year Mellon Funded Open Library Environment (OLE) project which “convened a multi-national group of libraries to analyze library business processes and to define a next-generation library technology platform

the project planners produced an OLE design framework that embeds libraries directly in the key processes of scholarship generation, knowledge management, teaching and learning by utilizing existing enterprise systems where appropriate and by delivering new services built on connections between the library’s business systems and other technology systems.

We at Talis, along with some 200 other organisations, participated in the process by feeding back our experiences in implementing live integrations between Library Management Systems and other institutional entities that the report authors recognise as being key to delivering a seamless workflow.  Our experience indicated that successful integration between systems is as much to do with local departmental motivations, understanding, and politics as it is to do with technology. This was discussed in more depth on the March Library 2.0 Gang Show with Tim McGeary from the OLE project and Talis’ Andy Latham  were guests.

The body of the report consists of many process model diagrams, describing the required interactions between library and other processes/components, which when brought together will enable the construction of library associated workflows for the next-generation library service that will utilise this next-generation library technology platform.

This first year project is in it’s own terms a success “The OLE Project met all of its objectives and was completed on time and within budget”.  One cannot deny the thought, effort, commitment and enthusiasm that has gone in to the production of this report.   Without rerunning the analysis they undertook, it would be difficult to criticise the model they have described.  The proof of the pudding of course will come in the next phase, when they move on from describing a new technology platform to start building it.

The planning phase of this project is complete. The next steps are to identify a group of build partners to provide investment funds and to develop and test the initial software. A build partner  can be an individual library, a consortium or a vendor.

The total partnership cost of the OLE Project over two years is projected to be $5.2 million, a figure that includes all programming effort as well as project management and quality assurance staffing. In addition to OLE Project costs, costs of participation would include some local staff, governance and travel funding. Project partners intend to contribute half of the OLE partnership costs and seek the other half from The Andrew W. Mellon Foundation.

ole diag Viewing the process diagrams in the report takes me back to 1990, in a snow covered hut in the grounds of the University of Birmingham.  I shared that hut for several weeks with Talis (then BLCMP) staff and a group of folks from a Dutch library system vendor (long since subsumed in to the OCLC global organisation) with the objective of designing the next-generation library technology platform.  Several years, and a few £ million in investment, led to the development a very successful library system from which the current Talis Library System, Alto, has since evolved.

There are many parallels between that 1990 development process and the road that OLE are about to embark upon, if their bids for continued funding are successful.  Not only that BLCMP was a library cooperative during that period, but also that we had the luxury of being able to step back from previous systems and start with a clean set of library process requirements. 

I wish the OLE project continued success.  Whatever achieved, I believe the exercise they are undertaking is massively valuable to the whole library domain. 

Will they be able to translate their clean [uncluttered by interaction issues with systems over which they have little influence, or uncoloured by local institutional inter-departmental politics, and ‘traditional practices] diagrams in to an installable, manageable, collection of components suitable to deliver format agnostic library services? – possibly.  Will they be able to do it in 2 years for a mere $5.2? – Experience tells me to be a little more sceptical on that last point.

English, Service Oriented Architecture, Systems and technologies, Talis, open source

A New Commercial Ontology from hakia

July 27th, 2009

Perhaps the world’s first, we are proud to announce our upcoming Commercial Ontology (CO). What is a commercial ontology? If you asked this question you have just touched on an important distinction: fantasy versus reality. In the context of World Wide Web, the CO is the realistic version of an ontology for the reasons explained below.

The Realities of the Web

We have accomplished two important innovations in building the CO. First, the development of concepts and lexicons followed a strict guideline of the realities of Web operations. What were these realities? Most of the search queries on the Web reflect a single dimension of intent, almost exclusively relevant to commercial topics. Note that the interpretation of “commercial topics” must be taken in the broadest sense possible. For example, if you were looking for “the benefits of foot massage” or “the director of the movie Last Emperor” your queries fall into the same commercial pattern. One particular distinction of the commercial pattern is that they come in short packages including a name (onomasticon), or always referring to something sold, bought, watched, heard, etc.

In contrast, many ontologies (if not all) that have been built to date, or claimed so, are focused on the use of language in the general sense, but not in the sense of commercial patterns on the Web. Therefore, their usefulness when tackling the Web search queries is greatly compromised, sometimes to the point of absolute failure. If such an ontology could disambiguate a dozen of different senses of the word “kill”, it would be sad news that the last 100,000 queries in the search logs did not include a single occurrence of the word “kill”. Like drowning in 2 inches-deep water, such ontologies will not utilize their disambiguation skills nearly 80% of the queries because the queries include nothing but onomasticons and/or they are too short (under-articulated).

The Sequence Approach

The second innovation used in the CO is the use of sequences instead of single words. A single word, like “kill”, is the most ambiguous state of information and is hardly used in human communication without a strong underlying/implied context. As a result, building a natural language processing (NLP) systems by taking single word as the unit of computation is an invitation for disaster.

In contrast, word sequences (2 or more words) are inherently safe and highly descriptive. Take “road kill”, for example. This sequence describes a corpse of an animal killed on the road by a passing vehicle. If a language processing system takes the sequences as unit of computation, 99% of the ambiguity problem vanishes. There is no need to process the word “kill” and “road” separately, trace their senses, and locate convergence to identify the meaning of “road kill” if you can just take the sequence “road kill” itself as your unit of computation for mapping. This is depicted below:

road kill

Note the number of traces required in a conventional ontology approach compared to the sequence approach. The sequence approach requires a lot of data storage space (which is dirt cheap) whereas the conventional ontology approach requires a lot of CPU for a simple mapping task (which is expensive). But the bad news does not stop there. The trace routes in conventional ontology requires manual work (impossible to automate) whereas sequence-based ontology can be easily built via automation.

I realize only a handful of people will understand the second point above. Nevertheless, the scalability and performance of the end product will speak for itself when we put the testing platform on-line.

Usage of the Commercial Ontology

The immediate use of the CO is related to search queries, or document characterizations, that are not tied to any advertising in conventional systems. This unrecognized domain of search queries and characterizations means loss of revenue. hakia’s CO is designed to fill this gap. For example, if the search query or page characterization is “beat generation” the CO can map it to “literature” on the fly. As a result, systems using the CO will have much deeper understanding of the incoming terms, thus will be able to recognize the underlying intent beyond the face value of the words. The same capability can be used in a number of places other than advertising with the same effect.

Stay tuned for the release of the first version of our commercial ontology.

English, News, Technology

Netflix Prize contest closes; Ensemble wins

July 26th, 2009

Netflix has announced that the Netflix Prize contest is now closed. Presumably, The Ensemble is the winner, subject to final qualification.

“We are delighted to report that, after almost three years and more than 43,000 entries from over 5,100 teams in over 185 countries, the Netflix Prize Contest stopped accepting entries on 2009-07-26 18:42:37 UTC. The closing of the contest is in accordance with the Rules — thirty (30) days after a submitted prediction set achieved the Grand Prize qualifying RMSE on the quiz subset.

Qualified entries will be evaluated as described in the Rules. We look forward to awarding the Grand Prize, which we expect to announce in a few weeks. However if a Grand Prize cannot be awarded because no submission can be verified by the judges, the Contest will reopen. We will make an announcement on the Forum after the Contest judges reach a decision.”

So what’s left for the judges to do. The rules say that “a panel of senior Netflix engineers and qualified independent judges” need to “ensure that the provided algorithm description and source code could reasonably have generated the prediction sets submitted”. To do this, the candidate winner must produce the algorithm along with a description of who it works. And, of course, before receiving the prize the winner has to grant Netflix

“an irrevocable, royalty free, fully paid up, worldwide non-exclusive license under the Participants’ copyrights, patents or other intellectual property rights in the winning algorithm (”Winning Algorithm”) to reproduce, distribute, display, and create derivative works from the Winning Algorithm and also to make, have made, use, sell, offer for sale, and import products that would otherwise infringe the Winning Algorithm.”

The Netflix Prize was a great idea and generated a lot of interest around the world. It’s been good for the field of AI and its machine learning sub-field, especially. Congratulations to the Ensemble team and condolences to BellKor’s Pragmatic Chaos. I wish there could have been two winners.

UPDATE 2/27: Wait! The winner is still in doubt.

English

Ensemble leads Netflix Prize contest, besting BellKors Pragmatic Chaos

July 26th, 2009

The race for the Netflix Prize is still on.

With just one day left in the 30 day last call period before BellKor’s Pragmatic Chaos (BKPC) was awarded the $1M Netflix Prize for a better movie recommender system, another team has broken the 10% improvement threshold and taken the lead by one hundredth of one percent — The Ensemble.

The Ensemble was formed by the merger of two existing Netflix Prize teams that had been ranked second and third behind BKPC: ‘Grand Prize Team’ and ‘Opera Solutions and Vandelay United’. Here’s how The Ensemble describes it’s genesis.

The crowd is indeed wiser than the individual.

The 10% barrier once seemed distant and insurmountable. But when the contest’s “last call” heralded the heroic achievements of BellKor’s Pragmatic Chaos, the rest of the crowd pondered, and asked why the barrier couldn’t be broken twice.

And lo, as if powered by gravity, Grand Prize Team and Vandelay Industries! began to draw in more and more members. And Vandelay went on to join forces with Opera Solutions, and then Vandelay and Opera united with Grand Prize Team, and then … and then … well, things got so complex we decided just to call ourselves The Ensemble.

We can be sure that there will be a lot of Netflix Prize activity in the coming weeks and maybe months as these two teams compete and perhaps more mergers create super-teams. BKPC and Ensemble could even decide to merge and share the prize. Watch the Netflix Leaderboard for the latest ranking.

UPDATE: I had assumed the 30 day last call would reset with each new leader, like auctions on ebay. Not so. The prize will be won (and lost) today! Here’s the relevant section in the rules:

“To qualify for the Grand Prize the RMSE of a Participant’s submitted predictions on the test subset must be less than or equal to 90% of 0.9525, or 0.8572 (the “qualifying RMSE”). After three (3) months have elapsed from the start of the Contest, when the RMSE of a submitted prediction set on the quiz subset improves beyond the qualifying RMSE an electronic announcement will inform all registered Participants that they have thirty (30) days to submit additional candidate prediction sets to be considered for judging. At the end of this period, qualifying submissions will be judged (see Judging below) in order of the largest improvement over the qualifying RMSE on the test subset. In the case of tied RMSE values on the test subsets, the submission received earliest by the Site will be judged first.”

The August 2009 CACM has a short note, Just for You (pdf), on recommender systems and the Netflix prize by BKPC member Don Monroe that includes a visualization by Ensemble member Chris Hefele.

Spotted on Hacker News. See Techcrunch also.

UPDATE II: The Netflix Prize contest has closed.

English

AAAI study examines long-term AI futures and impact on society

July 25th, 2009

John Markoff has an article for tomorrow’s New York Times, Scientists Worry Machines May Outsmart Man on a recent AAAI study on the future of AI.

“A robot that can open doors and find electrical outlets to recharge itself. Computer viruses that no one can stop. Predator drones, which, though still controlled remotely by humans, come close to a machine that can kill autonomously. Impressed and alarmed by advances in artificial intelligence, a group of computer scientists is debating whether there should be limits on research that might lead to loss of human control over computer-based systems that carry a growing share of society’s workload, from waging war to chatting with customers on the phone.”

The study was commissioned by AAAI to “to explore and address potential long-term societal influences of AI research and development”. Look for a report published by AAAI later this year. The study involved twenty-five participants who were divided into three subgroups: on concerns, control and guidelines, the nature and timing of disruptive advances, and ethical and legal issues.

There was a panel session earlier this month at IJCAI where some of the study participants discussed highlights from the study. Hopefully this was filmed and the results will be added to the videolectures.net IJCAI09 collection.

While I am generally skeptical of an impending technological singularity, which seems to sum up many of the concerns some have, there are aspects of the future that I do wonder about. At the top of my list is what will happen when virtually all of human knowledge is published on the Web (as it nearly is now) in a for that machines can understand. I’m pretty sure that this will happen in the next decade or two, either through the current Semantic Web approach (as a web of data) or by gradually improving techniques for machine understanding of human languages and images.

English

Recomendados en twitter

July 25th, 2009
El caparazón como mejor blog en la categoría de educación
Votar en los Premios Bitacoras.com
Medio Oficial Premios Bitacorasúltimas noticias

Compártelo



  • BarraPunto
  • del.icio.us
  • Tumblr
  • Twitter
  • Facebook
  • Google Bookmarks
  • Meneame
  • MisterWong
  • StumbleUpon
  • Technorati
  • LinkedIn
  • Wikio
  • Bitacoras.com
  • Diigo
  • FriendFeed
  • Netvibes
  • Ping.fm
  • Posterous
  • PDF
  • Print

Facebook, George Siemens, Gerd Leonhard, Nancy White, Planeta educativo, Social network, Spanish, comunidades, conversacion, enlaces, recomendaciones, rss, social media, twitter, web3.0

Submit a talk proposal for BlogTalk 2009, Social Software Conference in Korea this September! (Final Call)

July 24th, 2009

Here's an update regarding BlogTalk 2009, the 6th International Conference on Social Software. BlogTalk Asia will be held in Jeju, Korea from 15th-16th September.

read more

Asia, BlogTalk, Blogs, Danah Boyd, David Weinberger, English, Hong Kong, Jeju Island, Jeju-do, Korea, South Korea

Submit a talk proposal for BlogTalk 2009, Social Software Conference in Korea this September! (Final Call)

July 24th, 2009

Here's an update regarding BlogTalk 2009, the 6th International Conference on Social Software. BlogTalk Asia will be held in Jeju, Korea from 15th-16th September.

read more

Asia, BlogTalk, Blogs, Danah Boyd, David Weinberger, English, Hong Kong, Jeju Island, Jeju-do, Korea, South Korea

Interview with wetoku about BlogTalk 2009 (Jeju, Korea) and social software

July 24th, 2009

I was interviewed by David Lee, founder of video interview site wetoku, this morning about the forthcoming BlogTalk 2009 in Korea.

read more

Backchannel, BlogTalk, Blogs, English, Korea, Online Communities, Text box, Web, Webcam, twitter, web browser

Interview with wetoku about BlogTalk 2009 (Jeju, Korea) and social software

July 24th, 2009

I was interviewed by David Lee, founder of video interview site wetoku, this morning about the forthcoming BlogTalk 2009 in Korea.

read more

Backchannel, BlogTalk, Blogs, English, Korea, Online Communities, Text box, Web, Webcam, twitter, web browser

Submit a talk proposal for BlogTalk 2009, Social Software Conference in Korea this September! (Final Call)

July 24th, 2009

Here’s an update regarding BlogTalk 2009, the 6th International Conference on Social Software. BlogTalk Asia will be held in Jeju, Korea from 15th-16th September.

We hope that you can submit a proposal to speak at BlogTalk: the deadline is the 31st July 2009. A one-page abstract (less than 600 words) is required. http://2009.blogtalk.net/callforproposals

Why attend?

BlogTalk provides a unique interdisciplinary opportunity for academics, developers and practitioners to come together and discuss social software projects, ideas, research prototypes or success stories.

What is the structure?

As with previous events, we will have a mixture of peer-reviewed presentations, keynote speakers, discussion panels and special sessions (including one on the Korean Social Web). Previous events in the series have featured prominent speakers such as David Weinberger, Mena and Ben Trott, Matt Mullenweg, Suw Charman, Danah Boyd, Salim Ismail and Nova Spivack.

Why Jeju?

Jeju’s temperate climate, natural scenery, and beaches make it a popular tourist destination for both South Koreans and many visitors from Japan, China, northern and southern Asia. The Cheonjeyeon and Cheonjiyeon waterfalls, Mountain Halla, Hyeobje Cave, Hyeongje Island are popular places for tourists. Jeju Island was a finalist in the new ‘Seven Wonders of Nature’, and contains a Natural World Heritage Site (Jeju Volcanic Island and Lava Tubes). In the conference venue (Jungmum Resort Complex) and associated hotels, there are also many bars, casinos, pools, etc.

Jeju is easily accessible from many parts of Asia, with flights to Tokyo, Beijing, Osaka and Hong Kong. You can also fly to Seoul and from there take a one hour flight to Jeju. http://2009.blogtalk.net/travelling

We will have special hotel rates in top-class hotels at the conference venue. http://2009.blogtalk.net/accommodation

Also, BlogTalk will be held just before the Lift Asia event on the 17th and 18th, so you have double the reason to attend! There is a special joint registration rate for both events.

What people have said about BlogTalk

“Discussed what blogs are useful for and why they are changing…”
“Good to see what academics and those in business have in common…”
“Detailed and informative!”
“Inspiring!”
“Highly relevant. Small. Great mix of people from different backgrounds.”
“Well-organised and a great selection of speakers and topics. A useful and productive time.”

Programme Committee

Gabriela Avram, University of Limerick
Anne Bartlett-Bragg, Headshift
Mark Bernstein, Eastgate Systems Inc.
Stephanie Booth, Climb to the Stars
Rob Cawte, Web 2.0 Japan
Josephine Griffith, National University of Ireland, Galway
Steve Han, Korea Advanced Institute of Science and Technology
Conor Hayes, Digital Enterprise Research Institute
Jin-Ho Hur, NeoWiz
Ajit Jaokar, FutureText Publishing
Alexandre Passant, Digital Enterprise Research Institute
Robert Sanzalone, pacificIT
Jan Schmidt, Hans Bredow Institute
Hideaki Takeda, National Institute of Informatics

Contact us

blogtalk2009@gmail.com
@blogtalk on Twitter

Thanks!

John Breslin, Thomas Burg, Honggee Kim
Channy Yun, Haklae Kim

Reblog this post [with Zemanta]

Asia, BlogTalk, Blogs, Danah Boyd, David Weinberger, English, Hong Kong, Jeju Island, Jeju-do, Korea, South Korea

Interview with wetoku about BlogTalk 2009 (Jeju, Korea) and social software

July 24th, 2009

I was interviewed by David Lee, founder of video interview site wetoku, this morning about the forthcoming BlogTalk 2009 in Korea.


(I apologise for the echo, I didn’t have any headphones so was causing some feedback.)

wetoku is a very interesting service that anyone can use, whereby interviews are simply carried out through a web browser that requests a connection to your webcam and mike. It shows the interviewer on one side and the interviewee on the other, and in a backchannel, the interviewee can ask questions of the interviewer via a text box (for clarifications, etc.: these are not shown in the final video). A nice review of wetoku was recently published on Read/Write Web. You can also follow @wetoku on Twitter.

Reblog this post [with Zemanta]

Backchannel, BlogTalk, Blogs, English, Korea, Online Communities, Text box, Web, Webcam, twitter, web browser

Scripting “Find My iPhone” from Ruby

July 23rd, 2009

When the iPhone OS 3.0 came out with new Mobile Me features allowing you to remotely discover the location of your iPhone and send it a message and an alarm, I hoped that there’d be an API. While there’s no official way to access it, the enterprising Tyler Hall and Sam Pullara dug out their HTTP sniffers and figured out how the javascript on me.com talks to its backend service.

Their code is written in PHP and Java respectively, two languages I’m not particularly comfortable in. Translating from their source code, I’ve produced a ruby version and packaged it as a very simple gem. It lacks real documentation or elegant error handling, but it’s easy to figure out.

Use it like this to locate your phone:

$ sudo gem install mattb-findmyiphone --source http://gems.github.com

>> require 'rubygems' ; require 'findmyiphone'
>> i = FindMyIphone.new(username,password)
>> i.locateMe
=> {"status"=>1, "latitude"=>51.546544, "time"=>"8:06 AM", "date"=>"July 23, 2009", "accuracy"=>162.957953, "isLocationAvailable"=>true, "isRecent"=>true, "isLocateFinished"=>true, "statusString"=>"locate status available", "isAccurate"=>false, "isOldLocationResult"=>true, "longitude"=>-0.05744}

Important Message on the iPhoneAnd to send a message:

>> i.sendMessage("Unimportant message")
=> {"status"=>1, "time"=>"8:17 AM", "date"=>"July 23, 2009", "unacknowledgedMessagePending"=>true, "statusString"=>"message sent"}

Finally, if you look in the examples directory you’ll find a short script that uses the location data to update Fire Eagle via its API. Fill in the example YAML files with the appropriate credentials and it’ll do the rest.

Of course the code’s all open source and contributions via Github are very welcome.

English, Uncategorized

What kind of Semantic Web researcher are you?

July 23rd, 2009

It’s hard to keep secrets in today’s increasingly interconnected, networked world. Social network megasites, mobile phones, webcams and  inter-site syndication can broadcast and amplify the slightest fragment of information. Data linking and interpretation tools can put these fragments together, to paint a detailed picture of your life, both online and off.

This online richness creates offline risk. For example, if you’re going away on holiday, there are hundreds of ways in which potential thieves could learn that your home is vacant and therefore a target for crime: shared calendars, twittered comments from friends or family, flickr’d photographs. Any of these could reveal that your home and possessions sit unwatched, unguarded, presenting an easy target for criminals.

Q: What research challenge does this present to the Semantic Web community? How can we address the concern that Semantic and Social Web technology have more to offer Burglar Bill than to his victims?

A1: We need better technology for limiting the flow of data, proving a right to legitimate access to information, cross-site protocols for deleting leaked or retracted data that flows between sites, and calculating trust metrics for parties requesting data access.

A2: We need to find ways to reconnect people with their neighbours and neighbourhoods, so that homes don’t sit unwatched when their occupants are away.

ps. Dear Bill, I have my iphone, laptop, piggy bank and camera with me…

English, FOAF, Project ideas, SocialWeb, burglarbill, ggg, privacy

Linked Data, a Brand with Big Problems and no Brand Management

July 22nd, 2009


This post builds on the discussion that started on Twitter between Paul Miller and Ian Davis and quickly expanded to a few other folks. Paul subsequently wrote a blog post Does Linked Data need RDF?, and so did Ian Davis with The Linked Data Brand. My turn: I like Davis's suggested approach of looking as Linked Data as a brand. It made me realize something: this brand hasn't been properly managed.

Brand image, English, Ian Davis, Linked Data, Paul Miller, RDFa, Resource Description Framework, SPARQL, Semantic Web, The New York Times Company, Web 3.0, branding, tech marketing, tech ventures

ISWC Workshop on Semantic Web Applications in Scientific Discourse

July 22nd, 2009

http://esw.w3.org/topic/HCLS/ISWC2009/Workshop

The Health Care and Life Sciences Interest Group has had a workshop accepted at ISWC 2009 on Semantic Web Applications in Scientific Discourse.

Scientific research is becoming both increasingly interdisciplinary, and dependent for dissemination on the Web. Yet the form of the discourse has remained for the most part, a digital analog of the paper research article. This situation persists despite the emergence of Web 2.0 paradigms (blogs, wikis, online communities), application of Semantic Web technologies to problems in biomedicine, and the introduction of virtual research environments in certain areas. We will bring together experts in semantic technology, scientific informatics, virtual research environments, Web communities and scientific publishing to contribute to the development of new thinking on how scientific research can be communicated, characterized, annotated, searched and shared on the Web.

Details about the workshop including logistics, the deadline for paper submissions, and program committee members are available on the Web site for the workshop.

English, HCLS

Hidden Barriers for Semantic Technologies for the Enterprise

July 22nd, 2009

As part of a long term study for European Commission, Ovum has conducted in depth interviews with IT managers to determine the drivers and barriers of adopting semantic software within the enterprise. While there is some resistance to the term “Semantic,” a more substantial barrier gets back to basics: knowing the business. Semantic software vendors should take note.

Some Resistance to “Semantics” and new technology

Semantic software is growing in popularity, and cover a range of applications that understand meaning and context within a business, and can therefore be more effective in joining data, systems, and processes together to serve employees.

However, various preconceptions about technology often spring from the technology being launched prematurely or bandied too early, slowing uptake when it finally matures. Ovum has taken a close look at the use of the word, asking IT managers whether it carried negative baggage.

Overall, the word “semantic” is not a severe hindrance to the development of STEs, but does alienate some people. As one interviewee explained:

“[the term] Semantic web is a barrier. It’s associated with a vision of the web for 10 years to come. I make a point of not using ‘semantic web’”

Ovum found that some interviewees felt that while the term “semantic” may have been overused a few years ago, technology has progressed to the point of legitimacy.

The real barrier: finding people the know the business

A more critical barrier towards adopting Semantic technologies is an old-fashioned one: having people that know the business. Generally, In order to implement semantic technologies, organizations, processes, and associated data and terminology must be mapped out into a set of binding concepts, often embodied in an “ontology.”

However, it is hard to find employees who have both breadth of oversight across various departments within a company, and the depth of knowledge of the processes and terminology within those departments. Therefore, it is simply hard to find the person expert enough to populate an overarching scheme or ontology which would map out individual processes and terms from different departments.

One interviewee specifically pointed this issue out:

“My biggest concern is that we wouldn’t have the resources and find the right skills to deal with such kind of information — because you have to define and describe something, and how things should work together, and how they should be combined. To do this is difficult and the risk is we wouldn’t have the right people.

Lesson to software vendors

From this, the advice to software vendors is relatively straightforward: go a bit easy on the lingo, and most importantly: find the people within the company that know the business well enough to map it all out. Then, make sure these knowledgeable people are somewhere in the gameplan for implementation. Given that change management is assumed by most IT managers to accompany a rollout of this technology, there might be possibility to get these people in as part of the program.

English, Enterprise Software, European Commission, Ontology problems, Ovum Research, Spanish, Technologies, Value-it project, semantic technologies

1.0, 2.0 y 3.0 en 4 minutos

July 22nd, 2009
Aquí va un video de la holandesa EPN donde en 4 minutos se muestra la evolución desde la Web "prehistorica" a la Web del futuro.
Enjoy.




y si os ha gustado.. este otro de STI es incluso mejor :) lo encontrais aqui

Spanish

1.0, 2.0 y 3.0 en 4 minutos

July 22nd, 2009
Aquí va un video de la holandesa EPN donde en 4 minutos se muestra la evolución desde la Web "prehistorica" a la Web del futuro.
Enjoy.




y si os ha gustado.. este otro de STI es incluso mejor :) lo encontrais aqui

Spanish

Integrated library management systems: what we need

July 21st, 2009

blcmpAs part of the “Shock of the New” strand at the UK Umbrella conference this year, Lucy Tedd from Aberystwyth University led a session entitled “Integrated library management systems: what we need”. Attendance of this session turned out to be very supplier-heavy, and I’m not sure that’s what she anticipated. I was moderately surprised too, but thinking about it afterwards, I felt that the lack of interest from practitioners was reflective of the growing irrelevance of the traditional library management system (or ILS if you’re North American) to the needs of the modern library, particularly in academia.

It’s not that the library technology landscape has stood still, of course. Lucy was able to list quite a few innovative products– from the now-established Aquabrowser to Talis’ own Aspire resource list tool – a great product that we’re all very proud of here. But taking one step back and looking at what the library has to deliver in 2009, the library technology marketplace as a whole is failing to keep up with the pace of change.

Lucy Tedd highlighted some of the key developments of this decade. Some of them, though – such as the consolidation of the library technology marketplace with mergers, acquisitions and the increasing intervention of venture capitalists in the businesses of existing suppliers – may be symptomatic of underlying trends rather than drivers.

I felt that to get a firmer grip on the fundamental shifts in our world, I had to refer back to a session I saw last month at the annual SCONUL conference, given by Marshall Breeding (a member of Talis’Library 2.0 Gang). For the uninitiated, Marshall Breeding is an American library technology guru, author of an ongoing series of library technology guides. Where he wins out over other commentators such as Lucy Tedd is his ability to look behind headline trends, take them apart, examine the implications and project them forward. So although both Tedd and Breeding identify industry consolidation as a key trend, Breeding will go on to alert us to the disruptive impact that this has on product development, and the adverse effect this has on the lead time that libraries have to plan for a product enhancement.

Marshall Breeding hears a lot of frustration with LMS products and vendors, and is adamant that systems are not keeping up with the pace of change in libraries. Innovation, then, is falling below expectations, and Marshall reports that many US libraries are unhappy with the current state of affairs. He admitted that he wasn’t so sure about UK libraries, but following the group activity at the end of Lucy Tedd’s session, I’m quite clear that the mood here is similar to that of the US. In my group there was one librarian from Open University and one from University of Hertfordshire. Each group was asked to identify its most pressing requirement of the LMS. Both librarians agreed that the inadequacy of the LMS in managing e-resources was the biggest problem in an era in which the issuing of books is no longer the primary activity.

Marshall Breeding described the conventional LMS as untenable, now that a whole series of products required to manage fundamental library processes – such as ERM systems and knowledgebases – are located outside the LMS. In the electronic era, circulation becomes fulfilment, cataloguing is no longer MARC-centred, for example. So as the traditional modules of the LMS become less important, we need to think more in terms of SOA (Service-Oriented Architecture) – dividing functionality into small chunks that can be fitted together for multifarious purposes (a shift that my colleague Richard Wallis identified back in 2007 on this blog). This is very much the thinking of the OLE (Open Library Environment) Project, of which Marshall Breeding is a proponent.

But it’s not just a back-office problem, of course. The library OPAC, traditionally another module LMS, also suffers from the same problem, in failing to reflect the eJournals and digital objects that libraries spend so much money on. Breeding did identify further issues with library OPACs, highlighting their clunky interfaces, poor eCommerce facilities, and more worryingly, relatively weak search engines and poor relevancy ranking.

Open Source has, in the context of these difficulties, generated a lot of interest, though more in the US at present. However Breeding pointed out that Open Source offerings currently rank middle to low in terms of customer satisfaction, and the only libraries that are interested are the ones that are already doing it. There is no groundswell of interest, despite the pockets of evangelistic fervour.

Marshall Breeding also turned his attention to Web 2.0 tools, and argued persuasively against the tendency to adopt disparate tools without a broader strategy in place, which has the effect of “jettisoning library users away from our websites”. Instead, he says, Web 2.0 capabilities need to be built into the guts of our systems. I’m assuming here that he doesn’t mean library vendors reinventing social networking tools in a creepy treehouse kind of way, and that instead he’s advocating seamless integration with applications such as the VLE and Web 2.0 tools such as Twitter. Incidentally, Richard Wallis has recently been demonstrating a Juice extension enabling integration between Twitter and the OPAC.

Breeding looks forward to a future in which the library can offer a single point of access to the inside of all the eJournals that the library subscribes to. Scale is not the issue, he argues, and cites OCLC’s Lorcan Dempsey as pointing out that the whole of WorldCat will now fit on an iPod. Instead we should be looking at what the world outside the library is doing – searching the deep content directly, and identifying and examining the tools that people are using to do this. In this way, it becomes clear that the likes of Google Scholar, Amazon, Waterstones and ask.com are the competitors of the library in the 21st century, and it is incumbent upon the vendor community to help libraries with that gargantuan challenge if they are to survive.

English, Juice, Libraries, Library 2.0, Service Oriented Architecture, Talis, web 2.0

The Linked Data Brand

July 21st, 2009

Paul Miller has kicked off a twitstorm with his simple question: does linked data require RDF?. My contention is that Linked Data does absolutely require RDF. This is not a technical issue and its not one of zealots or pragmatists: its a marketing and branding issue.

The term Linked Data was coined to brand a specific class of practices: namely assigning HTTP URIs to abitrary things and making those URIs respond with RDF relating the things to other things.

Here very few of the ‘things’ are documents, instead they are people, places, objects and concepts.

That deliberately excludes many other practices of publishing data on the web such as atom feeds, spreadsheets, APIs and even many existing RDF use cases.

The purpose of giving things a brand is to engender recognition, familiarity and trust. When you open a can of Pepsi you know exactly what you are going to get. You know you will get a great user experience whatever Apple product you buy. When you buy Lego you can rely on all the pieces fitting together.

The Linked Data brand makes similar promises of quality and consistency. When you consume Linked Data you know it will be RDF so your tools will work correctly. You also know that the data will be using HTTP URIs to refer to real-world things so you can determine what the data is about. You can trust that you’re not suddenly going to be given some XML in a proprietary schema or CSV with text headings you will have to guess the meaning of.

The Semantic Web community has been notorious for its poor marketing over the past decade. Now just when it seems the community has found the right balance between technology and mass appeal it feels like people are trying to rip away that success for their own purposes. That is deliberately emotive language because brands are all about emotion.

I don’t want to see the Linked Data brand weakened because it destroys trust. That’s why I pushed back on Twitter. As all involved know I am a huge advocate of making more data available on the web for reuse. It makes me glad whenever I see people invest their time in publishing data in any format, but my heart sings when I see more Linked Data.

There are many situations where there are better approaches than Linked Data e.g. I would rather have a midi file than the RDF version. In many circumstances I would be glad of a spreadsheet – simple and convenient.

But we should not confuse these forms of data publishing with Linked Data. That would sow confusion and be counterproductive. The coming web of data will be a rich and varied space full of content and data in every format imaginable. A large part of that we will call Linked Data and when you encounter it you will be justified in expecting RDF and HTTP.

I welcome anyone who wants to share data on the web in any way. But play fair and use the Linked Data brand only when it uses the Linked Data rules.

English

RIN’s Michael Jubb Talks with Talis about bibliographic records in a networked world

July 21st, 2009

michael-jub Dr Michael Jubb, Director of the Research Information Network, is my guest for this podcast.

The RIN was established by the higher education funding councils, the research councils, and the national libraries in the UK to investigate how efficient and effective the information services provided for the UK research community are.

As part of their role, they publish many reports to inform and create debate to lead to real change.  Our conversation focuses on the recently published “Creating catalogues: bibliographic records in a networked world”, which explores the production and supply chain of bibliographic metadata for physical and electronic books, journals, and articles.  We discuss the need for the report, and therefore change in this area, its recommendations and possible ways forward.

English, Metadata, Networked linked environment, Podcast, Research, Resource sharing

PelletDb Beta Available for Download

July 20th, 2009

I won’t say much about this here, since the web page is full of details, but the PelletDb Beta Release is now available for download.

Inquiries about sales and licensing? inquiries@clarkparsia.com or +1 202 408 8770.

English, PelletDb

Raptor RDF Parser Library Version 1.4.19 Released

July 19th, 2009

It has been 13 months and thus long overdue for a new release of Raptor my RDF parsing and serializing library – if it involves going between syntax and triples, Raptor can do it.

(My excuse for the long delay I was busy working on Rasqal which is also due a release. Plus it’s lovely living in California :) )

WARNING: FUTURE ABI and API CHANGES.
The next release of Raptor 1.4.x will include bug fixes only and no new features. New development will move to Raptor 2 where a planned ABI and API break will happen. There may be preview releases of Raptor 2 with 1.9.x numbering.

The changes were as follows:

  • Many improvements to RSS tag soup (RSSes and Atom) parser and the RSS 1.0 and Atom serializers
  • Several fixes and improvements to the N-Triples, RDFa and RDF/XML parsers and Turtle serializer
  • Improved the use and configuration of static libxml functions for better compatibility
  • Several Win32 portability fixes – Lou Sakey
  • Many internal changes for upcoming Raptor V2 – primarily by Lauri Aalto
  • Many other fixes and resilience improvements.
  • Fixed reported issues: 0000259, 0000262, 0000263, 0000266, 0000269, 0000270, 0000276, 0000277, 0000287, 0000288, 0000289, 0000290, 0000293, 0000296, 0000299 and 0000303.

See the Raptor 1.4.19 Release Notes for the full details of the changes.

English, comment