The AKSW research group is pleased to announce that OntoWiki 0.9.5 is now available for download.
OntoWiki is a web-application enabling the collaborative creation and (linked data) publication of RDF knowledge bases.
More information about OntoWiki can be found at http://ontowiki.net. You can download OntoWiki in our google code file section. Enhancements in this release include:
  • Support for Semantic Pingback, a protocol which enables OntoWiki to communicate named links from linked data resources or blog systems like WordPress.
  • Support for the publication of provenance information via Linked Data.
  • A new navigation module which support the configuration and usage of arbitrary navigation hierarchies (e.g. based on classes, SKOS elements, geospatial entities or FOAF groups).
  • A bookmarklet for collecting RDFa-based information into a specific OntoWiki knowledge base.
  • More editing widgets, e.g. for phone number and mailto: resources.
  • A new mapping module for the resource visualisation and filtering based on maps.
  • Attribute / Tag clouds based on selected RDF properties.
  • A GUI for complex SPARQL filter (contains, larger, smaller, between and bound)
  • A JSON/RPC server as an additional interface (e.g. for the command line client)
  • A plugin to create nice URIs based on the content of a new resource.
A detailed log of the over 200 enhancements and bug fixes of this release is available at our issue tracker. Many thanks to the contributors of this OntoWiki release (in alphabetical order): Atanas Alexandrov, Christian Maier, Christoph Riess, Jonas Brekle, Marvin Frommhold, Michael Haschke, Michael Martin, Michael Niederstätter, Natanael Arndt, Norman Heino, Philipp Frischmuth and Tim Ermilov best regards Sebastian Tramp
I still remember when I was publishing HTML for the first time in my life: It took place in 1996 and I used Microsoft Frontpage. It was exciting because then “I was on the Internet”. Yesterday, around 15 years later something similar happened: I published Linked Data for the first time actively! Eureka! linked-data-frontend Sure, by using Semantic MediaWiki or Wordpress’s SIOC plugin “I was already on the Semantic Web” – but a lot of data which is produced by such tools is not Linked Data but simple RDF. A closer look at all the datasets in the LOD cloud also reveals that none of them can be edited with an ease, except upcoming DBpedia Live which offers “real-time semantic web”. Conclusio: So far most of the linked data in the LOD cloud was generated by DB2RDF mapping tools like D2R which can only be handled by semantic web experts and technicians. Don´t get me wrong – this is a very important basic layer for the LOD world. All automatically generated datasets like DBpedia are kind of “highways” on the linked data map. Now it´s time to pave the side streets. Just imagine, a teacher would like to publish his knowledge about Italian painters in a way it can be re-used as linked data. Should we tell him to “open an editor, to start typing RDF triples and to upload the file via FTP”? When we started to design PoolParty in 2007 we had people in our minds who would like to contribute actively to producing data for the semantic web. People working for organizations with special domain knowledge are not only able to connect the dots from the linked data highways but also know how to customize such data for their own applications. PoolParty 2.7 offers the following features and functionalities for such tasks:
  • Linked Data editing: users generate linked data to describe their resources (concepts) on top of SKOS
  • Linked data lookup: mapping between own thesauri and additional facts from the semantic web The following resources can be used at the moment: DBpedia, Umbel, Yago, DMOZ, LCSH, Geonames & Wordnet; this service is highly configurable – also internal linked data sources can be mapped and used to enrich local thesauri; the lookup service makes use of the very fast TuQS server
  • Linked data publishing: based on linked data patterns any resource can be published as linked data, ready to re-use for any linked data application; example: http://open.poolparty.punkt.at/Wine/13 which can also be viewed by linked data browsers like Zitgist’s DataViewer
  • SPARQL endpoint: another way how PoolParty’s RDF data can be accessed by semantic web developers
In addition to these features PoolParty 2.7 comes with some other new features:
  • Translation support: works for nearly any language and domain with high accuracy – thanks to Google Translate
  • Online Documentation: PoolParty’s end-user manual is open for the public, easy to access and searchable; since PoolParty 2.7 it is available not only as PDF document but also as browsable Wiki
  • Flexible Reporting Tool: As we have already blogged before, PoolParty’s new reporting tool is flexible enough to manage to export formats like, for example, Google Synonyms; also “traditional” thesaurus reports like hierarchical reports are available
  • iPhone front-end: If you have to do research using your thesauri while you are somewhere outside of the office, this could be a possible solution for you – see this screenshot!
If you also want to publish some linked data (for the first time in your life :-) ) register to get a PoolParty demo account and go for it! It´s really easy.
Extended Semantic Web Conference started yesterday in Hersonissos, Crete. AKSW is involved in this years ESWC in various ways: We co-organized the 6th Workshop on Scripting and Development (SFSW10) probably for the last time this year at ESWC, since the original aim of promoting more light-weight, pragmatic semantic web applications of the SFSW workshop series became now rather mainstream. Sören was one of the panelists of the panel on “Linked Data: Now what?”. With the two papers “LESS - Template-Based Syndication and Presentation of Linked Data” and “Improving the Performance of Semantic Web Applications with SPARQL Query Result Caching” AKSW is also well represented in the main scientific conference programme.
PoolParty 2.7 offers new and comfortable ways to enrich any SKOS thesaurus with additional facts from the semantic web (see: LOD cloud). This functionality (which was extended significantly with version 2.7 in June 2010) supports any thesaurus manager to generate much richer knowledge models (ontologies) around specific domains than ever before (without facing high extra costs due to additional research). There are at least three arguments why one should consider building such “extended thesauri”: skos-linked-data
  1. Use even more metadata to describe your resources and improve navigation and semantic search functionalities significantly
  2. Publish (at least) parts of your metadata / knowledge models as linked (open) data to stimulate innovative services around your contents on top of network effects
  3. Use linked data for data integration and semantic mashups; combine your own contents with contents from the web to improve your business intelligence
» Read the rest of this entry «

We recently heard that Oracle 11G loaded RDF faster than we did. Now, we never thought the speed of loading a database was as important as the speed of query results, but since this is the sole area where they have reportedly been tested as faster, we decided it was time loading was addressed. Indeed, without Oracle to challenge us on query performance, we would not be half as good as we are. So, spurred on by the Oracular influence, we did something about our RDF loading.

Read more on Orri Erling’s Weblog…

A short while ago the Semantic Web Journal was launched. Pascal Hitzler and Krzysztof Janowicz are editors-in-chief and AKSW’s Sören Auer serves on the editorial board. The journal published by IOS Press differs from other journals, in particular, since it follows an open and transparent peer-review process, which engages a wider community besides expert reviews. Supported by its young and agile editorial board, the SWJ can be expected to bring a lot of fresh wind to an already aging Semantic Web community. Since SWJ just started, it is right now a perfect time to submit an article to SWJ or propose a special issue. Please check out: http://www.semantic-web-journal.net.
Martin Hepp just published a short tutorial on using the GoodRelations vocabulary for product, price, and company data together with LESS. Martin explains, how a public SPARQL endpoint holding GoodRelations data can be accessed and results displayed using LESS. The tutorial is available at: http://www.ebusiness-unibw.org/wiki/GoodRelationsAndLESS

Turing’s Test & The Stock Market

A Non-standard Introduction to Sentiment Analysis in 3 Parts

Part 1 – CAPTCHA to Gotcha:

A Brief History of Artificial Intelligence

Alan Turing was a prominent British mathematician and one of the most inspiring pioneers of modern computer science. In 1950, at the age of 38, he published his seminal paper Computing Machinery and Intelligence, which till this day remains probably the single most influential paper in the field of Artificial Intelligence (AI).

Since Digital Trowel’s core technology is based on machine learning, a modern offshoot of AI, it would be conducive (and nice!) to get back to the basics, and learn a bit about the history that continues to shape both the science itself and the challenges we face at DT.

Big words and complications aside, Turing begins his paper with the simple yet perplexing question: “Can machines think?” Nevertheless, realizing that “thinking” is a highly ambiguous term, Turing immediately proposed an alternative question that would be free of obscurities and eschew obfuscations. Instead of dealing with machines’ capacity for thinking, he focused on their capacity to emulate human thought. In simplified terms the question he suggested was:

Could machines be made to simulate human thought well enough so as to fool a person into believing they were actually human?

This question is the essence of what has come to be called the Turing Test. It proceeds as follows: a human judge engages in a natural language conversation with one human and one machine, each of which tries to appear human. All participants are placed in isolated locations. If the judge cannot reliably tell the machine from the human, the machine is said to have passed the test. In order to test the machine’s intelligence rather than its ability to render words into audio, the conversation is limited to a text-only channel such as a computer keyboard and screen.

At the time of its publication, many people viewed the prospect of machines ever reaching the level of human computational power an impossibility, but in his paper, Turing, armed with his visionary intuition and razor-sharp mathematical analysis, set out to invalidate contemporary objections, ending with a speculation of his own, that one day machines would indeed emulate human thought, thereby passing the Turing Test!

Inspired by the challenge, Digital Trowel’s groundbreaking technology has taken several huge steps forward in proving that Turing was right. The technology we’ve developed allows computers to extract not only the facts communicated by the text, but also the underlying sentiment or, if you will, the attitude associated with the message conveyed. In simple words, we’re enabling computers to understand the full meaning not only of the text, but of the subtext – just as a human would. But hold your horses! Before we continue, let’s try to explain why the problem is so difficult, so we can more fully appreciate the profundity of Digital Trowel’s achievement and its extraordinary implications.

» Read the rest of this entry «