<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>“KWARC was!” &#187; Krextor</title>
	<atom:link href="http://kwarc.info/blog/category/krextor/feed/" rel="self" type="application/rss+xml" />
	<link>http://kwarc.info/blog</link>
	<description>KWARC research group's blog</description>
	<lastBuildDate>Mon, 30 Jan 2012 15:40:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Microdata vs. RDFa – What does it mean to us?</title>
		<link>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/</link>
		<comments>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 14:58:48 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[JOMDoc]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=919</guid>
		<description><![CDATA[Only today I became aware of microdata, the proposed way of embedding semantic annotations into HTML5. (Yes, they adopted the syntax that Michael also prefers for OMDoc, and which I personally hate, but I will get used to it.) Microdata are not to be confused with microformats, a poor man&#8217;s way of annotation that (ab)uses [...]]]></description>
			<content:encoded><![CDATA[<p>Only today I became aware of <a href="http://www.whatwg.org/specs/web-apps/current-work/multipage/microdata.html">microdata</a>, the proposed way of embedding semantic annotations into HTML5. (<a href="http://blog.whatwg.org/spelling-html5">Yes, they adopted the syntax</a> that Michael also prefers for OMDoc, and which I personally hate, but I will get used to it.) Microdata are not to be confused with <a href="http://microformats.org">microformats</a>, a poor man&#8217;s way of annotation that (ab)uses CSS classes and thus is compatible with HTML 4. Microdata are something like RDFa but</p>
<ol>
<li>are slightly easier to use for people who don&#8217;t understand XML namespaces
<ul>
<li>granted, RDFa&#8217;s excessive reliance on XML namespaces makes it hard to parse, and makes it unbearably complex to copy/paste a fragment, which is an important use case for HTML5</li>
</ul>
</li>
<li>allow for ad hoc pseudo-semantic markup when you do not use an ontology
<ul>
<li>What&#8217;s the point in annotating at all, then?</li>
</ul>
</li>
<li>compatible with the non-XML syntax of HTML5 (which should have been ditched IMHO, but, well, in the interest of reactionary users and software, <a href="http://www.smashingmagazine.com/2009/07/29/misunderstanding-markup-xhtml-2-comic-strip/">they decided differently</a>)</li>
</ol>
<p>The fight for the future of RDFa in HTML is going on, but what does that mean to KWARC? We have incorporated RDFa into <a href="http://omdoc.org">OMDoc</a> as <a href="https://svn.omdoc.org/repos/omdoc/trunk/doc/blue/foaf/note.pdf">a means of extending the metadata vocabularies</a>. RDFa, originally designed for XHTML, is prepared for being integrated into any XML language, including OMDoc. HTML5 microdata are an integral part of the HTML5 specification and would not work in other XML languages. OK, but we present OMDoc documents as HTML to make them human-readable. In this output, we want to preserve the semantics of the OMDoc markup, and for that we had always been thinking about using RDFa. (<a href="http://jomdoc.omdoc.org/ticket/266">We know exactly how to do it</a>, but just have not yet implemented that step, though.) We could use HTML5 microdata instead, but:</p>
<ol>
<li>RDFa has little software support so far, but microdata have none (beyond proofs of concept)</li>
<li>We generate XML-compliant HTML. <a href="http://www.w3.org/TR/html5-diff/#mathml-svg">The non-XML syntax of HTML5 supports embedded MathML</a>, but I doubt that it will support parallel <a href="http://www.openmath.org">OpenMath</a> markup, where elements from yet another namespace are embedded into the MathML formulae.</li>
<li>We <em>generate</em> HTML. The embedded annotations need not be authored manually, so they do not have to be easy to author.</li>
<li>We are interested in using well-defined ontologies to express semantics, so we don&#8217;t need ad hoc “semantic” markup.</li>
</ol>
<p>What do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Krextor Publicity</title>
		<link>http://kwarc.info/blog/2009/07/20/krextor-publicity/</link>
		<comments>http://kwarc.info/blog/2009/07/20/krextor-publicity/#comments</comments>
		<pubDate>Mon, 20 Jul 2009 12:25:16 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Krextor]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=911</guid>
		<description><![CDATA[I was surprised to find the following search result for Krextor The document “Krextor – An extensible XML→RDF extraction framework.pdf” is no longer available on docstoc. It has either been removed by the original owner of the document or by the docstoc staff due to copyrighted or inappropriate content. Isn&#8217;t that actually a proof of [...]]]></description>
			<content:encoded><![CDATA[<p>I was surprised to find <a href="http://www.docstoc.com/docs/8384154/XML-Krextor- -An-Extensible-XML�RDF-Extraction-Frameworkpdf">the following search result</a> for Krextor</p>
<blockquote><p>The document “Krextor – An extensible XML→RDF extraction framework.pdf” is no longer available on docstoc.<br />
It has either been removed by the original owner of the document or by the docstoc staff due to copyrighted or inappropriate content.</p></blockquote>
<p>Isn&#8217;t that actually a proof of success, in this new age of the Pirate Party? <img src='http://kwarc.info/blog/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p><a href="http://www.slideshare.net/langec/krextor-an-extensible-xmlrdf-extraction-framework">Here is where it was stolen from</a>, and <a href="http://www.semanticscripting.org/SFSW2009/short_2.pdf">here is the paper</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2009/07/20/krextor-publicity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google likes us</title>
		<link>http://kwarc.info/blog/2009/01/12/google-likes-us/</link>
		<comments>http://kwarc.info/blog/2009/01/12/google-likes-us/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 17:07:03 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[curie]]></category>
		<category><![CDATA[rdfa]]></category>
		<category><![CDATA[uri]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=849</guid>
		<description><![CDATA[I&#8217;m really not the only one who has ever implemented compact URIs (CURIEs), but when I googled for “safe curie” today, Krextor was the first hit, far ahead of the RDFa specification. Cool!]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m really not the only one who has ever implemented <a href="http://www.w3.org/TR/rdfa-syntax/#s_curies">compact URIs (CURIEs)</a>, but when I googled for “<a href="http://www.google.de/search?q=&quot;safe+curie&quot;">safe curie</a>” today, Krextor was the first hit, far ahead of the <a href="http://www.w3.org/TR/rdfa-syntax/">RDFa specification</a>. Cool!</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2009/01/12/google-likes-us/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reinventing the XML→RDF wheel?</title>
		<link>http://kwarc.info/blog/2009/01/11/reinventing-the-xml%e2%86%92rdf-wheel/</link>
		<comments>http://kwarc.info/blog/2009/01/11/reinventing-the-xml%e2%86%92rdf-wheel/#comments</comments>
		<pubDate>Sun, 11 Jan 2009 14:30:43 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=847</guid>
		<description><![CDATA[When researching into related work for Krextor, I discovered this paper about XSDL (XML Semantics Definition Language). (Note that by XSDL the authors do not mean the new name of W3C XML Schema, as the latter has only been renamed recently.) XSDL is a language that allows for solving very similar problems as Krextor – [...]]]></description>
			<content:encoded><![CDATA[<p>When researching into related work for <a href="http://kwarc.info/projects/krextor/">Krextor</a>, I discovered <a href="http://www.is.pku.edu.cn/~mayyam/papers/XSDL Making XML Semantics Explicit.pdf">this paper about XSDL (XML Semantics Definition Language)</a>. (Note that by XSDL the authors do not mean the new name of <a href="http://www.w3.org/XML/Schema">W3C XML Schema</a>, as the latter has only been renamed recently.) XSDL is a language that allows for solving very similar problems as Krextor – extracting RDF in terms of some ontology from XML documents. I had always been looking for a nice declarative way of doing so, and there it is.</p>
<p><span id="more-847"></span></p>
<p>I should have known earlier. I <em>knew</em> it earlier! This paper already existed in my document collection, added on 2007/08/08, and in my ever-growing to-do list there was a neglected entry “read documents added on 2007/08/08”!</p>
<p>For a moment it seemed to me that I had reinvented the wheel, but actually it&#8217;s not that bad. While XSDL has a solid formal specification (which Krextor does not have), it seems that it has never been implemented. Krextor has been implemented, as it emerged from a very concrete need to get a concrete task done, and it has been evaluated in various settings. Therefore, I can use XSDL as an inspiration w.r.t. the theoretical background and a nicer declarative syntax.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2009/01/11/reinventing-the-xml%e2%86%92rdf-wheel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>XML Pattern Matching and Functional Programming</title>
		<link>http://kwarc.info/blog/2008/12/02/xml-pattern-matching-and-functional-programming/</link>
		<comments>http://kwarc.info/blog/2008/12/02/xml-pattern-matching-and-functional-programming/#comments</comments>
		<pubDate>Tue, 02 Dec 2008 18:00:37 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[XSLT]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=842</guid>
		<description><![CDATA[I&#8217;m currently reconsidering whether it was a good idea to implement my XML→RDF extraction library Krextor in XSLT. Writing down my actual requirements, I realized that I need a language that supports pattern matching on XML elements and attributes, using a syntax that is close to literal XML or to XPath (for easily writing extraction [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m currently reconsidering whether it was a good idea to implement my XML→RDF extraction library <a href="http://kwarc.info/projects/krextor/">Krextor</a> in XSLT. Writing down <a href="https://trac.kwarc.info/krextor/wiki/DevelopmentNotes">my actual requirements</a>, I realized that I need a language that supports</p>
<ul>
<li>pattern matching on XML elements and attributes, using a syntax that is close to literal XML or to XPath (for easily writing extraction rules, which should also be done by other developers in future)</li>
<li>functional programming (in some way), as the whole idea of mapping XML to RDF (and thus XML nodes to URIs) can be modeled most elegantly using a functional approach. (This is rather a requirement for me implementing the generic core of Krextor, but also for extraction module developers once the XML input language is a bit more complex.)</li>
</ul>
<p>Having looked a bit into XQuery, Scala, and JavaScript (and a little bit into Haskell), I decided to stick to XSLT for now. Functional programming is awkward <a href="http://fxsl.sourceforge.net">but possible</a>, and XML pattern matching is awkward or non-intuitive in most other languages.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2008/12/02/xml-pattern-matching-and-functional-programming/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Documenting XSLT</title>
		<link>http://kwarc.info/blog/2008/11/26/documenting-xslt/</link>
		<comments>http://kwarc.info/blog/2008/11/26/documenting-xslt/#comments</comments>
		<pubDate>Tue, 25 Nov 2008 22:47:47 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[documentation]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[XSLT]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=828</guid>
		<description><![CDATA[A considerable part of the implementation of my research prototype(s) is done in XSLT. Now that the extraction of RDF from semantic markup is more and more turning in to a project of its own, more software engineering was needed – including proper documentation. It turned out that XSLTdoc is a really nice solution for [...]]]></description>
			<content:encoded><![CDATA[<p>A considerable part of the implementation of my research prototype(s) is done in <a href="http://www.w3.org/TR/xslt20">XSLT</a>. Now that the extraction of RDF from semantic markup is more and more turning in to <a href="https://trac.kwarc.info/krextor/">a project of its own</a>, more software engineering was needed – including proper documentation.</p>
<p>It turned out that <a href="http://www.pnp-software.com/XSLTdoc/">XSLTdoc</a> is a really nice solution for that: Just put a few additional XML elements in front of every template or function and run a special XSLT to generate the documentation. Works like javadoc and <a href="https://trac.kwarc.info/krextor/export/556/trunk/doc/xsltdoc/extract/util/rdfa.xsl.xd.html">looks nice</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2008/11/26/documenting-xslt/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

