Semantic Web Company talked with Kingsley Idehen who is CEO of OpenLink Software and probably one of the most profound experts on data integration issues about “Linked Data”.

The interview covers questions like:

  • How can Linked Data help to make companies more productive?
  • Do you think that the Linked Data Initiative can build upon a stable architecture or will it face more and more problems the bigger the “cloud” will grow?
  • What´s the ultimate argument for an Enterprise Architect to use languages like SPARQL at least in addition to SQL?
  • How will a “Real Time Semantic Web” change the whole game?
  • How will the “Semantic Web” be called in 10 years? Will there still be a “Semantic Web”?

Read the full version of the interview here.

The AKSW research group is pleased to announce that OntoWiki 0.9.5 is now available for download.
OntoWiki is a web-application enabling the collaborative creation and (linked data) publication of RDF knowledge bases.
More information about OntoWiki can be found at http://ontowiki.net. You can download OntoWiki in our google code file section. Enhancements in this release include:
  • Support for Semantic Pingback, a protocol which enables OntoWiki to communicate named links from linked data resources or blog systems like WordPress.
  • Support for the publication of provenance information via Linked Data.
  • A new navigation module which support the configuration and usage of arbitrary navigation hierarchies (e.g. based on classes, SKOS elements, geospatial entities or FOAF groups).
  • A bookmarklet for collecting RDFa-based information into a specific OntoWiki knowledge base.
  • More editing widgets, e.g. for phone number and mailto: resources.
  • A new mapping module for the resource visualisation and filtering based on maps.
  • Attribute / Tag clouds based on selected RDF properties.
  • A GUI for complex SPARQL filter (contains, larger, smaller, between and bound)
  • A JSON/RPC server as an additional interface (e.g. for the command line client)
  • A plugin to create nice URIs based on the content of a new resource.
A detailed log of the over 200 enhancements and bug fixes of this release is available at our issue tracker. Many thanks to the contributors of this OntoWiki release (in alphabetical order): Atanas Alexandrov, Christian Maier, Christoph Riess, Jonas Brekle, Marvin Frommhold, Michael Haschke, Michael Martin, Michael Niederstätter, Natanael Arndt, Norman Heino, Philipp Frischmuth and Tim Ermilov best regards Sebastian Tramp
I still remember when I was publishing HTML for the first time in my life: It took place in 1996 and I used Microsoft Frontpage. It was exciting because then “I was on the Internet”. Yesterday, around 15 years later something similar happened: I published Linked Data for the first time actively! Eureka! linked-data-frontend Sure, by using Semantic MediaWiki or Wordpress’s SIOC plugin “I was already on the Semantic Web” – but a lot of data which is produced by such tools is not Linked Data but simple RDF. A closer look at all the datasets in the LOD cloud also reveals that none of them can be edited with an ease, except upcoming DBpedia Live which offers “real-time semantic web”. Conclusio: So far most of the linked data in the LOD cloud was generated by DB2RDF mapping tools like D2R which can only be handled by semantic web experts and technicians. Don´t get me wrong – this is a very important basic layer for the LOD world. All automatically generated datasets like DBpedia are kind of “highways” on the linked data map. Now it´s time to pave the side streets. Just imagine, a teacher would like to publish his knowledge about Italian painters in a way it can be re-used as linked data. Should we tell him to “open an editor, to start typing RDF triples and to upload the file via FTP”? When we started to design PoolParty in 2007 we had people in our minds who would like to contribute actively to producing data for the semantic web. People working for organizations with special domain knowledge are not only able to connect the dots from the linked data highways but also know how to customize such data for their own applications. PoolParty 2.7 offers the following features and functionalities for such tasks:
  • Linked Data editing: users generate linked data to describe their resources (concepts) on top of SKOS
  • Linked data lookup: mapping between own thesauri and additional facts from the semantic web The following resources can be used at the moment: DBpedia, Umbel, Yago, DMOZ, LCSH, Geonames & Wordnet; this service is highly configurable – also internal linked data sources can be mapped and used to enrich local thesauri; the lookup service makes use of the very fast TuQS server
  • Linked data publishing: based on linked data patterns any resource can be published as linked data, ready to re-use for any linked data application; example: http://open.poolparty.punkt.at/Wine/13 which can also be viewed by linked data browsers like Zitgist’s DataViewer
  • SPARQL endpoint: another way how PoolParty’s RDF data can be accessed by semantic web developers
In addition to these features PoolParty 2.7 comes with some other new features:
  • Translation support: works for nearly any language and domain with high accuracy – thanks to Google Translate
  • Online Documentation: PoolParty’s end-user manual is open for the public, easy to access and searchable; since PoolParty 2.7 it is available not only as PDF document but also as browsable Wiki
  • Flexible Reporting Tool: As we have already blogged before, PoolParty’s new reporting tool is flexible enough to manage to export formats like, for example, Google Synonyms; also “traditional” thesaurus reports like hierarchical reports are available
  • iPhone front-end: If you have to do research using your thesauri while you are somewhere outside of the office, this could be a possible solution for you – see this screenshot!
If you also want to publish some linked data (for the first time in your life :-) ) register to get a PoolParty demo account and go for it! It´s really easy.

Lyndon Nixon

From June 9 – 11, 2010 the EuroITV Conference discusses latest advances and research of media technology, HCI, media studies, and the content creation community. Tassilo Pellegrini talked to Lyndon Nixon, STI International, about the future role of semantic technologies in the television industry and how a Social Semantic Web might influence the traditional television experience.


At this year’s EuroITV conference you will hold a workshop on the EU project NoTube. Can you give us a brief insight what this project is about?

NoTube is all about the future of television! We are seeing a significant shift in viewing patterns driven by the Web, which breaks the linear programming model and makes TV or video on demand a reality, whether it is being provided directly by the broadcasters or via a third party like Hulu or YouTube. The Web-based model taken up by viewers using their PC is being transferred back to the TV set in the lounge by IPTV applications running on Set Top Boxes or Internet TVs which come with Web access built into them. The strong interaction between the desires of users and technology has had its impact on the Web and as the gap between the Web and TV experience grows, we aim to translate features of the Web to TV, such as the personalised and community aspects. The NoTube European project puts the TV user back in the driver’s seat by generating user profiles from data the user creates on the Social Web, and in this way facilitating a personalised TV experience without an intrusive user profiling process.

What promises does the Social Semantic Web hold with respect to innovate the television experience? What is the vision?

With the hundreds of channels available via modern TV providers, content selection and dealing with the vast amount of TV-related information become significant challenges for users. TV metadata is created and distributed by a small group of people, as a result of the closed-source information exchange protocols that are the standard for providing electronic programme guide (EPG) data to users. Yet people often have several clusters of personal data on the Web, such as their profiles on social networks, or ratings of videos on YouTube and IMDB.

Analogously, there are many isolated clusters of broadcast data on the Web, such as broadcast data on EPGs and background information on Wikipedia. Within the NoTube vision context, we speculate that the conjunction of all these bits and pieces of data provide accurate information on someone’s interests, which is suitable for generating relevant recommendations on TV broadcasts. We see progress on opening up this data with open standards and APIs such as Google’s OpenSocial, Facebook’s OpenGraph, DBPedia, the BBC ontologies and FOAF. Further, we assume that Semantic Web technologies provide important building blocks for realizing this vision, as they enable the global identification mechanism of URIs and the means to define relations between data anywhere on the Web. By integrating these different pockets of data, we can provide TV viewers with personalised recommendations for their viewing.

What economic effects on the value chain do you expect from semantically empowered television? Will there be new revenue opportunities with respect to advertising or Pay TV models?

Our primary focus is on open source and open standards, so for example we are extending the open source MythTV media centre to develop first scenarios of personalised EPGs. However, down the road there are clearly commercialisation opportunities.

Another scenario in the project looks at personalised advertising, which is clearly somewhere where there are revenue opportunities. However, we take user privacy very seriously, and one aspect we need to tackle in NoTube is the fine line between analysing user activity (in order to personalise their TV experience) and using that analysis commercially.

The third NoTube scenario involves pushing personalised news streams to TV viewers. Here, one could imagine that such a service could be packaged within a Pay TV offer, and used to give competitive advantage or justify a higher fee.

Despite many attempts experience has shown that television is a rather conservative and innovation-averse medium. What can be done to stimulate the uptake of semantic technologies in the television sector?

That’s true; in the traditional broadcasting sector the larger companies are extremely slow to adopt new technologies. However, I think Web video and TV has really shook up the sector – traditional broadcasters are seeing that they lose viewer share to Web-based offers and have been quick to take their video material to the Web. There is a clear demand for this, look at the viewing numbers for BBC’s iPlayer in the UK for example.

IPTV also means that new applications and services can be built on top of traditional TV. I think once the broadcasters see the added value of offering applications and services tied into the content of their programming – such as through semantic analysis of the program metadata, which NoTube is doing – they will be encouraged to support better these efforts. The BBC is really taking a lead in this, publishing a lot of their data already in RDF.

Workshop Information

The NoTube workshop on Future Television: integrating the Social and
Semantic Web
will take place at the EuroITV 2010 conference in Tampere, Finland on June 9, 2010.
For more information please see

http://www.euroitv2010.org

and

http://www.notube.tv/news/73-futuretv-2010

For more information about NoTube, please see

http://notube.tv and follow our blog, at http://blog.notu.be

About Lyndon Nixon

Dr. Lyndon Nixon joined STI International as senior postdoctoral researcher in November 2008. Previously he was a researcher at the FU Berlin, where he acted as Industry Area Co-Manager of the EU Network of Excellence KnowledgeWeb and double Workpackage Leader in the EU project TripCom. In KnowledgeWeb, Dr. Nixon organized and led activities promoting the transfer of semantic technology to industry. He received his PhD in January 2007 with the topic ‘Semantic Web enabled Multimedia Presentation system’. His research focus is Web-based TV/video and the semantically guided integration of Web-based content, and he has several publications and has organized a number of workshops around related themes.

Extended Semantic Web Conference started yesterday in Hersonissos, Crete. AKSW is involved in this years ESWC in various ways: We co-organized the 6th Workshop on Scripting and Development (SFSW10) probably for the last time this year at ESWC, since the original aim of promoting more light-weight, pragmatic semantic web applications of the SFSW workshop series became now rather mainstream. Sören was one of the panelists of the panel on “Linked Data: Now what?”. With the two papers “LESS - Template-Based Syndication and Presentation of Linked Data” and “Improving the Performance of Semantic Web Applications with SPARQL Query Result Caching” AKSW is also well represented in the main scientific conference programme.
PoolParty 2.7 offers new and comfortable ways to enrich any SKOS thesaurus with additional facts from the semantic web (see: LOD cloud). This functionality (which was extended significantly with version 2.7 in June 2010) supports any thesaurus manager to generate much richer knowledge models (ontologies) around specific domains than ever before (without facing high extra costs due to additional research). There are at least three arguments why one should consider building such “extended thesauri”: skos-linked-data
  1. Use even more metadata to describe your resources and improve navigation and semantic search functionalities significantly
  2. Publish (at least) parts of your metadata / knowledge models as linked (open) data to stimulate innovative services around your contents on top of network effects
  3. Use linked data for data integration and semantic mashups; combine your own contents with contents from the web to improve your business intelligence
» Read the rest of this entry «

Uncovering the Secrets of Synergy

Well, in the previous section we mentioned in passing that our technology was based on a synergistic approach, combining syntax, semantics and pragmatics. In this final part of the survey, we’ll explain just how we do this, and why our system yields unparalleled results. In doing so we’ll do our best to abstract away from the underlying mathematics and details of the machine-learning algorithms, and instead present the linguistic principles by which our algorithms work using examples. To wrap things up, we’ll end this review with a snapshot of what Digital Trowel’s Sentiment Analysis looks like in action.

Our technological approach begins with the observation that sentiment is conveyed on three interacting levels of increasing structural complexity. Namely the lexical, phrasal and semantic-event level of structure. We’ll explain.

» Read the rest of this entry «

We recently heard that Oracle 11G loaded RDF faster than we did. Now, we never thought the speed of loading a database was as important as the speed of query results, but since this is the sole area where they have reportedly been tested as faster, we decided it was time loading was addressed. Indeed, without Oracle to challenge us on query performance, we would not be half as good as we are. So, spurred on by the Oracular influence, we did something about our RDF loading.

Read more on Orri Erling’s Weblog…

A short while ago the Semantic Web Journal was launched. Pascal Hitzler and Krzysztof Janowicz are editors-in-chief and AKSW’s Sören Auer serves on the editorial board. The journal published by IOS Press differs from other journals, in particular, since it follows an open and transparent peer-review process, which engages a wider community besides expert reviews. Supported by its young and agile editorial board, the SWJ can be expected to bring a lot of fresh wind to an already aging Semantic Web community. Since SWJ just started, it is right now a perfect time to submit an article to SWJ or propose a special issue. Please check out: http://www.semantic-web-journal.net.