<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>“KWARC was!” &#187; semantic documents</title>
	<atom:link href="http://kwarc.info/blog/category/semantic-documents/feed/" rel="self" type="application/rss+xml" />
	<link>http://kwarc.info/blog</link>
	<description>KWARC research group's blog</description>
	<lastBuildDate>Mon, 30 Jan 2012 15:40:16 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>I-Semantics Conference and Linked Data Cup</title>
		<link>http://kwarc.info/blog/2012/01/30/i-semantics-conference-and-linked-data-cup/</link>
		<comments>http://kwarc.info/blog/2012/01/30/i-semantics-conference-and-linked-data-cup/#comments</comments>
		<pubDate>Mon, 30 Jan 2012 15:40:16 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[cfp]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[i-semantics]]></category>
		<category><![CDATA[isem2012]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=1003</guid>
		<description><![CDATA[I-Semantics is a very nice conference on applied semantic web research, excellent for networking with the European research community and industry. Call for Papers I-SEMANTICS 2012 8th International Conference on Semantic Systems Graz, Austria, 5 &#8211; 7 September 2012 including Call for Submissions 5th Linked Data Cup Latest News: Wolters Kluwer Germany main sponsor of [...]]]></description>
			<content:encoded><![CDATA[<p>I-Semantics is a very nice conference on applied semantic web research, excellent for networking with the European research community and industry. </p>
<div id="outline-container-1" class="outline-2">
<h2 id="sec-1">Call for Papers</h2>
<div class="outline-text-2" id="text-1">
<p>  <a href="http://www.i-semantics.at">I-SEMANTICS 2012</a>   8th International Conference on Semantic Systems   Graz, Austria, 5 &#8211; 7 September 2012 </p>
<p>   including   Call for Submissions   5th Linked Data Cup </p>
</p></div>
</p></div>
<div id="outline-container-2" class="outline-2">
<h2 id="sec-2">Latest News:</h2>
<div class="outline-text-2" id="text-2">
<ul>
<li>Wolters Kluwer Germany main sponsor of I-SEMANTICS 2012 </li>
<li>I-SEMANTICS proceedings published by ACM ICPS </li>
<li>Important Dates (Research &amp; Application Papers &amp; I-Challenge)
<ul>
<li>Abstract Submission Deadline : April 2, 2012 </li>
<li>Paper Submission Deadline : April 13, 2012 </li>
<li>Notification of Acceptance: May 7, 2012 </li>
<li>Camera-Ready Paper: June 4, 2012 </li>
</ul>
</li>
<li>Important Dates (I-Challenge)
<ul>
<li>Paper Submission Deadline : April 13, 2012 </li>
<li>Notification of Acceptance: May 7, 2012 </li>
<li>Camera-Ready Paper: June 4, 2012 </li>
</ul>
</li>
<li>Important Dates (Posters &amp; Demo Papers &amp; PhD Track)
<ul>
<li>Submission Deadline: May 21, 2012 </li>
<li>Notification of Acceptance: June 18, 2012 </li>
</ul>
</li>
</ul>
<p>     <span id="more-1003"></span>  </div>
</p></div>
<div id="outline-container-3" class="outline-2">
<h2 id="sec-3">Scope</h2>
<div class="outline-text-2" id="text-3">
<p>  <a href="http://www.i-semantics.at">I-SEMANTICS 2012</a> is the 8th International Conference within the I-SEMANTICS series. I-SEMANTICS 2012 brings together both researchers and practitioners in the areas of Semantic Technologies, Linked Data and the Semantic Web in order to showcase cutting edge research, demonstrators and applications for the Corporate and Social Semantic Web. I-SEMANTICS 2012 is proud to announce the new format “I-CHALLENGE”, which brings to you the <b>5th Linked Data Cup</b> (formerly Triplification Challenge), the Best Paper Award and the Best Poster Award.  As in the past years the I-SEMANTICS Conference will be complemented by <a href="http://www.i-know.at">I-KNOW</a>, the 12th International Conference on Knowledge Management, aiming to reflect the increasing importance and convergence of knowledge management and semantic systems.  </p>
</div></div>
<div id="outline-container-4" class="outline-2">
<h2 id="sec-4">Topics</h2>
<div class="outline-text-2" id="text-4">
<p>  As a conference aiming to bring together science and industry, I-SEMANTICS encourages scientific research and application-oriented contributions in the field of Semantic Technologies, Semantic Web and Linked Data. The topics of interest for this year&#8217;s conference include but are not limited to:  </p>
<ul>
<li>The Web of Data
<ul>
<li>(Large scale) triplification of existing (structured) data </li>
<li>Vocabularies, taxonomies and schemas for the Web of Data </li>
<li>Querying, searching and browsing over the Web of Data </li>
<li>Data integration and interlinking for the Web of Data </li>
<li>User interaction and innovative visualizations for the Web of Data </li>
<li>Languages, tools and methodologies for representing, managing and reasoning on the Web of Data </li>
<li>(Mashup) applications utilizing (large scale) Linked Data resources </li>
<li>Recommender systems making use of the Web of Data </li>
<li>Integrating microposts into the Web of Data </li>
<li>Linked Enterprise Data and (Open) Linked Government Data </li>
<li>Connecting the Web of Data with real world sensor data </li>
<li>Location-based services and mobile semantic applications </li>
</ul>
</li>
<li>Quality of Semantic Data on the Web
<ul>
<li>Provenance information for the Web of Data </li>
<li>Large scale ontology inspection and repair </li>
<li>Co-reference detection and dataset reconciliation </li>
<li>Maintenance of Linked Data models </li>
<li>Trust, privacy and security in Semantic Web applications </li>
</ul>
</li>
<li>Corporate Semantic Web
<ul>
<li>Corporate thesauri, business vocabularies, ontologies and rules </li>
<li>Semantic business, e-commerce and m-commerce systems </li>
<li>Semantic procurement for enterprises and governments </li>
<li>Semantics, pragmatics and semiotics in organizations </li>
<li>Enterprise trust and reputation management         </li>
</ul>
</li>
<li>Semantic Content Engineering
<ul>
<li>Collaborative ontology engineering </li>
<li>Ontology modularity, alignment and merging </li>
<li>Ontology design patterns and life cycle management </li>
<li>Ontology learning and knowledge acquisition  </li>
<li>Quality criteria for collaboratively generated semantic content </li>
<li>Semantic annotation and tagging </li>
<li>Making sense of microposts </li>
<li>Semantic content management systems </li>
</ul>
</li>
<li>Semantic Multimedia
<ul>
<li>Semantic-driven multimedia applications </li>
<li>Multimedia ontologies and infrastructures </li>
<li>Content-based semantic multimedia analysis and data mining </li>
<li>Semantic-driven multimedia indexing and retrieval </li>
<li>Named entity recognition and disambiguation in multimedia documents </li>
<li>Human-computer interfaces for multimedia data access </li>
<li>Smart visualization and browsing of multimedia documents </li>
<li>User-generated semantic metadata for multimedia documents </li>
</ul>
</li>
<li>Studies, Metrics &amp; Benchmarks
<ul>
<li>Case studies of and benchmarks in semantic systems usage </li>
<li>Evaluation perspectives, methods and Semantic Web research methodologies </li>
<li>Technology assessment, acceptance/media choice theories </li>
<li>Usability and user interaction with semantic technologies </li>
<li>Applications with clear lessons learned or evaluations </li>
</ul>
</li>
<li>(Linked) Data Ecosystems &amp; Markets
<ul>
<li>Economic foundations of data assets, markets and data crowd sourcing </li>
<li>Business and governance models for data commerce </li>
<li>Production principles and measures of data creation, curation and utilization </li>
<li>Business models and economic impacts of Linked (Enterprise) Data and/or large scale semantic systems </li>
<li>Case studies for sector-specific (Linked) Data strategies </li>
</ul>
</li>
</ul></div>
</p></div>
<div id="outline-container-5" class="outline-2">
<h2 id="sec-5">I-SEMANTICS Submission Information</h2>
<div class="outline-text-2" id="text-5">
<p>  All accepted full papers and short papers of I-SEMANTICS 2012 will be published in the digital library of the ACM ICP series. Please note: Poster and Demo papers are planned to be published within CEUR-WS. Optional publication of PhD papers in CEUR-WS is under discussion. </p>
</p></div>
<div id="outline-container-5-1" class="outline-3">
<h3 id="sec-5-1">Research/Application Papers</h3>
<div class="outline-text-3" id="text-5-1">
<p>   Submissions must be original and must not have been submitted for publication elsewhere. Papers should follow <a href="http://www.acm.org/sigs/publications/proceedings-templates">the ACM ICPS guidelines for formatting</a> and must be submitted via the online submission system available at the conference website as PDF documents (other formats will not be accepted). For the camera-ready version, we will also need the source files (Latex, Word Perfect, Word). The publication will be available under the following ISBN: 978-1-4503-1112-0 </p>
<p>    Research papers report on novel research and/or applications relevant to the topics of the conference. Submissions must be original and must not have been submitted for publication elsewhere. The number of pages of research papers is limited to 8 pages for full papers and 4 pages for short papers including references and an optional appendix. </p>
</div></div>
<div id="outline-container-5-2" class="outline-3">
<h3 id="sec-5-2">Important Dates (Research &amp; Application Papers)</h3>
<div class="outline-text-3" id="text-5-2">
<ul>
<li>Abstract Submission Deadline (strict): April 2, 2012 </li>
<li>Paper Submission Deadline : April 13, 2012 </li>
<li>Notification of Acceptance: May 7, 2012 </li>
<li>Camera-Ready Paper: June 4, 2012 </li>
</ul></div>
</p></div>
<div id="outline-container-5-3" class="outline-3">
<h3 id="sec-5-3">Posters, Demos &amp; PhD Track</h3>
<div class="outline-text-3" id="text-5-3">
<p>   The conference also particularly welcomes the submission of posters, demonstrations, and PhD track submissions. The Posters, Demonstrations &amp; PhD Track complements the Research Paper Track and offers an opportunity for presenting late-breaking research results, ongoing research projects, and speculative or innovative work in progress. </p>
<p>    The informal setting of the Posters, Demonstrations &amp; PhD Track encourages presenters and participants to engage in discussions about the presented work. Such discussions can be invaluable inputs for the future work of the presenters, while offering participants an effective way to broaden their knowledge of the emerging research trends and to network with other researchers. Poster and demo submissions should consist of a 2-3 page description that allows us to judge the quality of the presentation. Submissions to this track must be in the <a href="http://www.springer.com/computer/lncs/lncs+authors">Springer LNCS format</a>, i.e. please do NOT use the ACM template here. </p>
<p>    The objective of the PhD Track is to provide doctoral students with a forum to present and discuss their research projects with experienced researchers (“mentors”) and fellow students. It addresses PhD students at an early stage of their doctoral studies, who want to receive feedback from internationally recognized researchers. Ideally, participants will have a well-defined problem statement and precise questions to discuss with their mentor. Applicants should be PhD students (from any country), conducting ongoing research in the areas of semantic technologies, Linked Data and the Semantic Web.  </p>
<p>    PhD track submissions should consist of up to 1.500 words and contain the following:  </p>
<ul>
<li>Name, affiliation and contact details of the PhD student  </li>
<li>Name(s) of the supervisor(s) </li>
<li>Summary of the research project, including:  </li>
<li>Problem: research questions, motivation, state of the art and relevance for the fields of Semantic Technologies, Linked Data and Semantic Web </li>
<li>Approach: (planned) approach and methodology </li>
<li>Current status / timeline: current status of the work and any results that have already been reached plus outlook to future work </li>
<li>A list of key questions that the PhD student wants to discuss during the event </li>
</ul>
<p>    Submissions will be reviewed by experienced researchers; each submission will receive detailed feedback. The program committee will select participants who will give a short presentation about their research project during the I-SEMANTICS PhD Track. Main focus of the event is on discussion, support and solving open questions. </p>
</div></div>
<div id="outline-container-5-4" class="outline-3">
<h3 id="sec-5-4">Important Dates (Posters &amp; Demo Papers &amp; PhD Track)</h3>
<div class="outline-text-3" id="text-5-4">
<ul>
<li>Submission Deadline: May 21, 2012 </li>
<li>Notification of Acceptance: June 18, 2012 </li>
</ul></div>
</p></div>
</p></div>
<div id="outline-container-6" class="outline-2">
<h2 id="sec-6">I-CHALLENGE</h2>
<div class="outline-text-2" id="text-6">
<p>  For the first time in 2012 we will bring to you the I-CHALLENGE, consisting of the Best Research/Application Paper Award, the Best Poster Award, the Best PhD Paper Award and the Linked Data Cup. While the best paper will be selected by the program committee, the Best Poster will be voted by the conference audience via online voting. (Please expect more details for all the Best Paper Awards in the weeks to come.) Information on the Linked Data Cup can be found below: </p>
</p></div>
<div id="outline-container-6-1" class="outline-3">
<h3 id="sec-6-1">Linked Data Cup 2012</h3>
<div class="outline-text-3" id="text-6-1">
<p>   The yearly organised Linked Data Cup (formerly Triplification Challenge) awards prizes to the most promising innovation involving linked data. Four different technological topics are addressed: triplification, interlinking, cleansing, and application mash-ups. The Linked Data Cup invites scientists and practitioners to submit novel and innovative (5 star) linked data sets and applications built on linked data technology. </p>
<p>   Although more and more data is triplified and published as RDF and linked data, the question arises how to evaluate the usefulness of such approaches. The Linked Data Cup therefore requires all submissions to include a concrete use case and problem statement alongside a solution (triplified data set, interlinking/cleansing approach, linked data application) that showcases the usefulness of linked data. Submissions that can provide measurable benefits of employing linked data over traditional methods are preferred. </p>
<p>   Note that the call is not limited to any domain or target group. We accept submissions ranging from value-added business intelligence use cases to scientific networks to the longest tail of information domains. The only strict requirement is that the employment of linked data is very well motivated and also justified (i.e. we rank approaches higher that provide solutions, which could not have been realised without linked data, even if they lack technical or scientific brilliance). </p>
</p></div>
</p></div>
<div id="outline-container-6-2" class="outline-3">
<h3 id="sec-6-2">Evaluation Criteria</h3>
<div class="outline-text-3" id="text-6-2">
<p>   The submissions will be initially evaluated with a well-known five star ranking system. Furthermore, entries will be assessed according to the extent to which they </p>
<ol>
<li>motivate the relevancy of their use case for their respective domain; </li>
<li>justify the adequacy of linked data technologies for their solution; </li>
<li>demonstrate that all alternatives to linked data would have resulted in an inferior solution; </li>
<li>provide an evaluation that can measure the benefits of linked data </li>
</ol></div>
</p></div>
<div id="outline-container-6-3" class="outline-3">
<h3 id="sec-6-3">Topics</h3>
<div class="outline-text-3" id="text-6-3">
<p>   Ideas for topics include (but are not limited to): </p>
<ul>
<li>Improving traditional approaches with help of linked data </li>
<li>Linked data use in science and education </li>
<li>Linked data supported multimedia applications </li>
<li>Linked data in the open source context </li>
<li>Web annotation </li>
<li>Generic applications </li>
<li>Internationalization of linked data  </li>
<li>Visualization of linked data </li>
<li>Linked government data </li>
<li>Business models based on linked data </li>
<li>Recommender systems supported by linked data </li>
<li>Integrating microposts with linked data </li>
<li>Distributed social web based on linked data </li>
<li>Linked data sensor networks </li>
</ul></div>
</p></div>
<div id="outline-container-6-4" class="outline-3">
<h3 id="sec-6-4">Submission and Reviewing</h3>
<div class="outline-text-3" id="text-6-4">
<p>   Submissions to the Linked Data Cup will be reviewed by members of the Linked Data Cup Board and invited experts from the Linked Data community.    Submissions should consist of 4 pages and must be original and must not have been submitted for publication elsewhere. Papers should follow the ACM ICPS guidelines for formatting as accepted submissions will be published in the I-SEMANTICS 2012 proceedings in the digital library of the ACM ICP series. Please read the submission page for detailed information on how to submit. </p>
</p></div>
</p></div>
<div id="outline-container-6-5" class="outline-3">
<h3 id="sec-6-5">Important Dates (Linked Data Cup)</h3>
<div class="outline-text-3" id="text-6-5">
<ul>
<li>Paper Submission Deadline : April 13, 2012 </li>
<li>Notification of Acceptance: May 7, 2012 </li>
<li>Camera-Ready Paper: June 4, 2012 </li>
</ul></div>
</p></div>
</p></div>
<div id="outline-container-7" class="outline-2">
<h2 id="sec-7">I-SEMANTICS Committee</h2>
<div class="outline-text-2" id="text-7">
<ul>
<li>Scientific Chair
<ul>
<li>Harald Sack (Hasso-Plattner-Institute for IT Systems Engineering) </li>
</ul>
</li>
<li>Program Chairs
<ul>
<li>H. Sofia Pinto (Technical University of Lisbon) </li>
<li>Valentina Presutti (Institute for Cognitive Science and Technology, Rome)  </li>
</ul>
</li>
<li>Track Chairs
<ul>
<li>I-CHALLENGE:
<ul>
<li>Sebastian Hellmann (University of Leipzig)  </li>
<li>Jörg Waitelonis (Hasso-Plattner-Institute for IT Systems Engineering) </li>
</ul>
</li>
<li>PhD Track Chair: Katrin Weller (Heinrich Heine University Düsseldorf)  </li>
<li>Poster Chair: Steffen Lohmann (University of Stuttgart) </li>
</ul>
</li>
<li>Industry Chair
<ul>
<li>Andreas Blumauer (Semantic Web Company) </li>
</ul>
</li>
<li>Conference Chair
<ul>
<li>Tassilo Pellegrini (Semantic Web Company / University of Applied Sciences St. Pölten, Austria) </li>
</ul>
</li>
<li><a href="http://i-semantics.tugraz.at/scientific-track/program-committee/">Program Committee</a> </li>
</ul></div>
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2012/01/30/i-semantics-conference-and-linked-data-cup/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SePublica@ESWC Workshop (May 27 or 28, Crete)</title>
		<link>http://kwarc.info/blog/2012/01/30/sepublicaeswc-workshop-may-27-or-28-crete/</link>
		<comments>http://kwarc.info/blog/2012/01/30/sepublicaeswc-workshop-may-27-or-28-crete/#comments</comments>
		<pubDate>Sun, 29 Jan 2012 23:00:00 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[cfp]]></category>
		<category><![CDATA[eswc2012]]></category>
		<category><![CDATA[sepublica]]></category>
		<category><![CDATA[workshop]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=1000</guid>
		<description><![CDATA[an ESWC 2012 Workshop. May 27 or 28, Hersonissos, Greece. http://sepublica.mywikipaper.org/ At SePublica we want to explore the future of scholarly communication and scientific publishing. As we are going through a transition between print media and Web media, SePublica aims to provide researchers with a venue in which this future can be shaped. Important Dates [...]]]></description>
			<content:encoded><![CDATA[<p>an <a href="http://www.eswc2012.org">ESWC 2012</a> Workshop.  May 27 or 28, Hersonissos, Greece. </p>
<p> <a href="http://sepublica.mywikipaper.org/">http://sepublica.mywikipaper.org/</a> </p>
<p> At SePublica we want to explore the <b>future of scholarly communication and scientific publishing</b>. As we are going through a transition between print media and Web media, SePublica aims to provide researchers with a venue in which this future can be shaped.  </p>
<div id="outline-container-1" class="outline-2">
<h2 id="sec-1">Important Dates</h2>
<div class="outline-text-2" id="text-1">
<ul>
<li>submission deadline: Feb 29 </li>
<li>acceptance notification: April 1 </li>
<li>camera ready: April 15 </li>
</ul>
<p>     <span id="more-1000"></span>  </div>
</p></div>
<div id="outline-container-2" class="outline-2">
<h2 id="sec-2">The Future of Scholarly Communication and Scientific Publishing</h2>
<div class="outline-text-2" id="text-2">
<p>  Consider research publications: Data sets and code are essential elements of data intensive research, but these are absent when the research is recorded and preserved by way of a scholarly journal article. Or consider news reports: Governments increasingly make public sector information available on the Web, and reporters use it, but news reports very rarely contain fine-grained links to such data sources.  At SePublica we will discuss and present new ways of publishing, sharing, linking, and analyzing such scientific resources as well as reasoning over the data to discover new links  and scientific insights.  </p>
</p></div>
</p></div>
<div id="outline-container-3" class="outline-2">
<h2 id="sec-3">Workshop Format</h2>
<div class="outline-text-2" id="text-3">
<p>  We are planning to have a full day workshop with two main sessions. During the first part of the workshop accepted papers will be presented; the second part of the workshop will address by means of focus groups two main questions, namely “what do we want the future of scholarly communication to be?”  and “how could data be preserved and delivered in an interactive manner over scholarly communications?”. These focus groups will be followed by a panel discussion. As an outcome of these activities we will have a communique that will be the editorial for the workshop proceedings,  </p>
</p></div>
</p></div>
<div id="outline-container-4" class="outline-2">
<h2 id="sec-4">Issues to be addressed</h2>
<div class="outline-text-2" id="text-4">
<ul>
<li>Representation:
<ul>
<li>Formal representations of scientific data; ontologies for scientific information </li>
<li>What ontologies do we need for representing structural elements in a document? </li>
<li>How can we capture the semantics of rhetorical structures in scholarly communication, and of  hypotheses and scientific evidence? </li>
<li>Integration of quantitative and qualitative scientific information </li>
<li>How could RDF(a) and ontologies be used to represent the knowledge encoded in scientific documents and in general-interest media publications? </li>
<li>Connecting scientific publications with underlying research data sets </li>
</ul>
</li>
<li>Technological Foundations:
<ul>
<li>Ontology-based visualization of scientific data </li>
<li> </li>
<li>Provenance, quality, privacy and trust of scientific information </li>
<li>Linked Data for dissemination and archiving of research results, for collaboration and research networks, and for research assessment </li>
<li> </li>
<li>How could we realize a paper with an API?  How could we have a paper as a database, as a knowledge base? </li>
<li>How is the paper an interface, gateway, to the web of data? How could such and interface be delivered in a contextual manner? </li>
</ul>
</li>
<li>Applications and Use Cases:
<ul>
<li>Case studies on linked science, i.e., astronomy, biology, environmental and socio-economic impacts of global warming, statistics, environmental monitoring, cultural heritage, etc. </li>
<li>Barriers to the acceptance of linked science solutions and strategies to address these </li>
<li>Legal, ethical and economic aspects of Linked Data in science </li>
</ul>
</li>
</ul></div>
</p></div>
<div id="outline-container-5" class="outline-2">
<h2 id="sec-5">Submission</h2>
<div class="outline-text-2" id="text-5">
<p>  <a href="https://www.easychair.org/conferences/?conf=sepublica2012">submission system</a> </p>
<ul>
<li>Research papers are limited to 12 pages and position papers to 5 pages. </li>
<li>For system/demo descriptions, a paper of minimum 2 pages, maximum 5 pages should be submitted. </li>
<li>Late-breaking news should be one page maximum. </li>
</ul>
<p>  All papers and system descriptions should be formatted according to the LNCS format.  For submissions that are not in the LNCS PDF format, 400 words count as one page. Submissions that exceed the page limit will be rejected without review. </p>
<p>   Depending on the number and quality of submissions, authors might be invited to present their papers during a poster session. The author list does not need to be anonymized, as we do not have a double-blind review process in place. </p>
<p>   Submissions will be peer reviewed by three independent reviewers; late-breaking news get a light review w.r.t. their relevance by two reviewers. Accepted papers have to be presented at the workshop (requires registering for the ESWC conference and the workshop). </p>
</p></div>
</p></div>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2012/01/30/sepublicaeswc-workshop-may-27-or-28-crete/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SePublica@ESWC Workshop on Semantic Publication (May 30, Crete), LNCS Post-proceedings, Best Paper Award by Elsevier</title>
		<link>http://kwarc.info/blog/2011/01/16/sepublica-workshop-on-semantic-publication-may-29-or-30-crete/</link>
		<comments>http://kwarc.info/blog/2011/01/16/sepublica-workshop-on-semantic-publication-may-29-or-30-crete/#comments</comments>
		<pubDate>Sun, 16 Jan 2011 12:58:48 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[ESWC 2011]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=962</guid>
		<description><![CDATA[I am a chair of the following workshop (and Michael is on the PC), which is closely related to KWARC&#8217;s research interests (specifically KWARC-relevant topics highlighted below): 1st International Workshop on Semantic Publication (SePublica 2011) at the 8th Extended Semantic Web Conference (ESWC 2011) May 30th, Hersonissos, Crete, Greece Keynote by Steve Pettifer, Manchester University, [...]]]></description>
			<content:encoded><![CDATA[<p>I am a chair of the following workshop (and Michael is on the PC), which is closely related to KWARC&#8217;s research interests (specifically KWARC-relevant topics highlighted below):</p>
<p><a href="http://sepublica.mywikipaper.org">1st International Workshop on Semantic Publication (SePublica 2011)<br />
</a>at the <a href="http://www.eswc2011.org">8th Extended Semantic Web Conference (ESWC 2011)</a><br />
May 30th, Hersonissos, Crete, Greece</p>
<p>Keynote by Steve Pettifer, Manchester University, UK: “Utopia Documents and The Semantic Biochemical Journal experiment”</p>
<p><strong>SUBMISSION DEADLINE (extended) March 4</strong></p>
<p><strong>Highlights:</strong></p>
<ul>
<li> <a href="http://sepublica.mywikipaper.org/drupal/18">Best Paper Award sponsored by Elsevier</a>: US$ 750+250 for the most innovative and feasible proposal concerning semantic publishing</li>
<li> <a href="http://sepublica.mywikipaper.org/drupal/3#lncs">Springer LNCS post-proceedings</a>: A selection of revised versions of the best submissions will be published in the “ESWC 2011 Workshop Highlights” LNCS volume.</li>
</ul>
<p>The MISSION of the SePublica workshop is to bring together researchers and practitioners dealing with different aspects of Semantic Technologies in the Publishing Industry. How is the Semantic Web impacting the publishing industry? How is our experience of publications changing because of Semantic Web technologies being applied to the publishing industry?</p>
<p><span id="more-962"></span></p>
<p>The CHALLENGE of the Semantic Web is to allow the Web to move from a dissemination platform to an interactive platform for networked information. The Semantic Web promises to “fundamentally change our experience of the Web”. In spite of improvements in the distribution, accessibility and retrieval of information, little has changed in the publishing industry so far. The Web has succeeded as a <em>dissemination platform for scientific</em> and non-scientific <em>papers</em>, news, and <em>communication </em>in general; however, most of that information remains locked up in discrete documents, which are poorly interconnected to one another and to the Web. <em>The connectivity tissues provided by RDF technology and the Social Web have barely made an impact on scientific communication</em> <em>nor on ebook publishing, neither on the format of publications, nor on repositories and digital libraries. The worst problem is in accessing and reusing the computable data which the literature represents and describes.</em></p>
<ul>
<li><em>Consider research publications: Data sets and code are essential elements of data intensive research, but these are absent when the research is recorded and preserved in perpetuity by way of a scholarly journal article.</em></li>
<li>Or consider news reports: Governments increasingly make public sector information available on the Web, and reporters use it, but news reports very rarely contain fine-grained links to such data sources.</li>
</ul>
<h2>QUESTIONS AND TOPICS OF INTEREST</h2>
<ul>
<li><em>What does a network of truly interconnected papers look like? How could interoperability across documents be enabled?</em></li>
<li><em>How could concept-centric social networks emerge?</em></li>
<li><em>Are blogs and wikis new means for scholarly communication?</em></li>
<li>What lessons can be learned from humanities and social science publishers (i.e. going beyond scientific publishing towards scholarly publishing)?</li>
<li><em>How could we move beyond the PDF? </em>How can we embed and link semantics in EPUB and other e-book formats?</li>
<li><em>How are digital libraries related to semantic e-science? What is the relationship between a paper and its digital library?</em></li>
<li><em>How could we realize a paper with an API? How could we have a paper as a database, as a knowledge base?</em></li>
<li><em>How is the paper an interface, gateway, to the web of data? How could such and interface be delivered in a contextual manner?</em></li>
<li><em>How could RDF(a) and ontologies be used to represent the knowledge encoded in scientific documents</em> and in general-interest media publications?</li>
<li><em>What ontologies do we need for representing structural elements in a document?</em></li>
<li><em>How can we capture the semantics of rhetorical structures in scholarly communication, and of hypotheses and scientific evidence?</em></li>
</ul>
<h2>AUDIENCE</h2>
<p>&nbsp;</p>
<ul>
<li>researchers from diverse backgrounds such as argumentative structures, scholarly communication, multi-modality in publications, digital libraries, semantics in publications, and ontology engineers.</li>
<li>practitioners active in the publishing industry, repositories of experimental information and document standards.</li>
</ul>
<h2>IMPORTANT DATES</h2>
<p>Paper/Demo Submission Deadline (extended): March 4, 23:59 Hawaii Time<br />
Acceptance Notification: April 1<br />
Camera Ready Version: April 15<br />
SePublica Workshop: May 30</p>
<h2>SUBMISSION AND PROCEEDINGS</h2>
<p>Research papers are limited to 12 pages and position papers to 5 pages. For system descriptions, a 5 page paper should be submitted. All papers and system descriptions should be formatted according to <a href="http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0">the LNCS format</a>.</p>
<p><em>We encourage the submission of semantic documents. LaTeX documents in the LNCS format can, e.g., be annotated using</em> <a href="http://salt.semanticauthoring.org">SALT</a> or <em><a href="http://trac.kwarc.info/sTeX/">sTeX</a></em>. <em>We also invite submissions in XHTML+RDFa</em> or in the format or YOUR semantic publishing tool.</p>
<p>However, to ensure a fair review procedure, authors must additionally export them to PDF.  For submissions that are not in the LNCS PDF format, 400 words count as one page. Submissions that exceed the page limit will be rejected without review.</p>
<p>Depending on the number and quality of submissions, authors might be invited to present their papers during a poster session.</p>
<p>Please <a href="http://www.easychair.org/conferences/?conf=sepublica2011">submit your paper via EasyChair</a></p>
<p>The author list does not need to be anonymized, as we do not have a double-blind review process in place.</p>
<p>Submissions will be peer reviewed by three independent reviewers. Accepted papers have to be presented at the workshop (requires registering for <a href="http://www.eswc2011.org">the ESWC conference</a> and the workshop) and will be included in the workshop proceedings that are published online at <a href="http://ceur-ws.org">CEUR-WS</a>.</p>
<h2>PROGRAM COMMITTEE</h2>
<ul>
<li><a href="http://www.unbsj.ca/sase/csas/facultyPage.php?member=Dr.%20Christopher%20Baker">Christopher Baker</a>, University of New Brunswick, Saint John, Canada</li>
<li><a href="http://www.hcklab.org/people/pc/">Paolo Ciccarese</a>, Harvard Medical School, USA (SWAN)</li>
<li><a href="http://www.mghmind.org/Faculty/clark_tim.html">Tim Clark</a>, Harvard Medical School, USA (SWAN)</li>
<li><a href="http://www.dia.fi.upm.es/index.php?page=oscar-corcho&amp;hl=en_US">Oscar Corcho</a>, Politecnica de Madrid, Spain</li>
<li><em><a href="http://metameso.org/~joe/">Joe Corneli</a>, Open University, UK (PlanetMath)</em></li>
<li><em><a href="http://openspring.net/">Stéphane Corlosquet</a>, Massachusetts General Hospital, USA (Drupal)</em></li>
<li><em><a href="http://www.ida.liu.se/~her/">Henrik Eriksson</a>, Linköping University, Sweden</em></li>
<li><a href="http://i9606.blogspot.com/">Benjamin Good</a>, Genomic Institute, Novartis, USA</li>
<li><em><a href="http://kwarc.info/kohlhase/">Michael Kohlhase</a>, Jacobs University, Germany</em></li>
<li><a href="http://knowledgehives.com/lang-en/team">Sebastian Kruk</a>, knowledgehives.com, Poland</li>
<li><a href="http://www.salzburgresearch.at/person/kurz-thomas/">Thomas Kurz</a>, Salzburg Research, Austria</li>
<li><a href="http://aig.cs.man.ac.uk/people/srp/">Steve Pettifer</a>, Manchester University, UK</li>
<li><a href="http://samwald.info/">Matthias Samwald</a>, Information Retrieval Facility, Austria</li>
<li><a href="http://jodischneider.com/">Jodi Schneider</a>, DERI, NUI Galway, Ireland</li>
<li><a href="http://www.dsoergel.com/">Dagobert Soergel</a>, University of Maryland, USA</li>
<li><a href="http://www.cs.man.ac.uk/~stevensr/">Robert Stevens</a>, Manchester University, UK</li>
</ul>
<h2>ORGANIZING COMMITTEE</h2>
<ul>
<li><em><a href="http://www.alexandergarcia.name/">Alexander García Castro</a>, University of Bremen, Germany</em></li>
<li><em><a href="http://kwarc.info/clange/">Christoph Lange</a>, Jacobs University Bremen, Germany</em></li>
<li><em><a href="http://elsatglabs.com/labs/anita/">Anita de Waard</a>, Elsevier, USA/Netherlands</em></li>
<li><a href="http://evansandhaus.com/">Evan Sandhaus</a>, New York Times, USA</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2011/01/16/sepublica-workshop-on-semantic-publication-may-29-or-30-crete/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TEI Guidelines mention MathML, OpenMath, and OMDoc</title>
		<link>http://kwarc.info/blog/2010/07/31/tei-guidelines-mention-mathml-openmath-and-omdoc/</link>
		<comments>http://kwarc.info/blog/2010/07/31/tei-guidelines-mention-mathml-openmath-and-omdoc/#comments</comments>
		<pubDate>Sat, 31 Jul 2010 13:06:35 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[mathml]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[OpenMath]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[formulae]]></category>
		<category><![CDATA[formulæ]]></category>
		<category><![CDATA[markup]]></category>
		<category><![CDATA[tei]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=959</guid>
		<description><![CDATA[Someone in the humanities must be interested in OMDoc. I was really surprised to find a reference to OMDoc in the section “Formulæ and Mathematical Expressions” guidelines (a.k.a. specification) for TEI. TEI (Text Encoding Initiative) is the standard semantic markup language for humanities, social sciences and linguistics, much like DocBook for technical manuals. All that [...]]]></description>
			<content:encoded><![CDATA[<p>Someone in the humanities must be interested in <a href="http://www.omdoc.org">OMDoc</a>. I was really surprised to find a reference to OMDoc in <a href="http://www.tei-c.org/release/doc/tei-p5-doc/en/html/FT.html#FTFOR">the section “Formulæ and Mathematical Expressions”</a> guidelines (a.k.a. specification) for TEI. <a href="http://www.tei-c.org">TEI (Text Encoding Initiative)</a> is the standard semantic markup language for humanities, social sciences and linguistics, much like <a href="http://www.docbook.org">DocBook</a> for technical manuals. All that TEI itself has is an element <em>&lt;formula notation=&#8221;&#8230;&#8221;/&gt;</em>, where <em>notation </em>refers to the language in which the formula is represented. But the guidelines refer to some mathematical markup languages, from which the document author is asked to “make an informed choice”:</p>
<ul>
<li>TeX – the obvious candidate, also used in some examples</li>
<li>MathML – the obvious candidate when XML is desired.  They give one Presentation MathML example but also mention Content MathML.</li>
<li>OpenMath – much less expected. Nice to see that here. Oh the other hand, the links to the OpenMath standard are outdated. I should probably report that.</li>
<li>OMDoc – I didn&#8217;t expect that at all.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2010/07/31/tei-guidelines-mention-mathml-openmath-and-omdoc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DITA/OMDoc Compatibility (or topic-based writing in OMDoc)</title>
		<link>http://kwarc.info/blog/2010/04/09/ditaomdoc-compatibility-or-topic-based-writing-in-omdoc/</link>
		<comments>http://kwarc.info/blog/2010/04/09/ditaomdoc-compatibility-or-topic-based-writing-in-omdoc/#comments</comments>
		<pubDate>Fri, 09 Apr 2010 13:42:33 +0000</pubDate>
		<dc:creator>kohlhase</dc:creator>
				<category><![CDATA[kohlhase]]></category>
		<category><![CDATA[MiKo's blog]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[Writing]]></category>
		<category><![CDATA[DITA]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[rdfa]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=943</guid>
		<description><![CDATA[When I was at the WritersUA conference before easter, the compatibility (and transformation) between DITA (as a topic-centered format) and DocBook (as a narrative one) was one of the topics with wider interest. In OMDoc we have always maintained that we can follow both the topic-centered approach (which is quite natural for mathematical texts and [...]]]></description>
			<content:encoded><![CDATA[<p>When I was at the WritersUA conference before easter, the compatibility (and transformation) between DITA (as a topic-centered format) and DocBook (as a narrative one) was one of the topics with wider interest. In OMDoc we have always maintained that we can follow both the topic-centered approach (which is quite natural for mathematical texts and indeed for wiki-based approaches like the one in SWiM) as well as the narrative one. So I got thinking how we would really do the topic-centered approach in OMDoc.</p>
<p>When I was reading Christine Müller&#8217;s Ph.D. thesis that looked a the integration of topic-based and narrative writing styles, I noticed that she says that OMDoc does not have support for topic-style writing. I think that this is wrong. Taking her example (slightly simplified)</p>
<pre>&lt;concept id="A.dita"&gt;
 &lt;title&gt;Natural Numbers&lt;/title&gt;
 &lt;conbody&gt;
 &lt;p&gt;The set of &lt;term&gt;natural numbers&lt;/term&gt;
 defined &lt;cite&gt;here&lt;/cite&gt; or in &lt;xref href="nat.dita#nat1"/&gt;.
 &lt;/p&gt;
 &lt;para conref="topic/p2"/&gt;
 &lt;/conbody&gt;
 &lt;related-link&gt;http://example.com/nats.html&lt;/related-link&gt;
&lt;/concept&gt;</pre>
<p>it is obviously directly  expressible in OMDoc as</p>
<pre>&lt;omdoc&gt;
 &lt;omgroup type="concept" xml:id="A.dita"&gt;
 &lt;metadata&gt;
 &lt;dc:title&gt;Natural Numbers&lt;/dc:title&gt;
 &lt;link rel="dita:related-link" resource="http://example.com/nats.html"/&gt;
 &lt;/metadata&gt;
 &lt;omgroup type="conbody"&gt;
 &lt;omtext&amp;gt
 &lt;CMP&gt;
 &lt;p&gt;The set of &lt;phrase role="term"&gt;natural numbers&lt;/phrase&gt;
 defined &lt;cite&gt;here&lt;/cite&gt; or in &lt;ref type="cite" href="nat.dita#nat1"/&gt;.
 &lt;/p&gt;
 &lt;/CMP&gt;
 &lt;/omtext&gt;
 &lt;ref href="topic/p2" type="include"/&gt;
 &lt;/omgroup&gt;
 &lt;/omgroup&gt;
&lt;/omdoc&gt;</pre>
<p>(again slightly simplified; I am leaving out the relevant namespace declarations). It should be directly obvious that we can define an OMDoc sublanguage that is isomorphic to DITA. Indeed I think that this is an exercise that would be worth doing. After all, there was a message from Bryce Nordgren  about opening oup a Math domain in DITA (see http://openmath.org/pipermail/om/2009-February/001203.html for details), which could use this isomorphism as a guiding light.</p>
<p>Of course DITA not only has topics, but also topic maps, let me again use an example from Christine&#8217;s thesis.</p>
<pre>&lt;map title="title"&gt;
 &lt;topichead navtitle="navi-title" audience="math"/&gt;
 &lt;topicref href="A.dita" collection-type="sequence"&gt;
 &lt;topicref href="A1.dita"/&gt;
 &lt;topicref href="A2.dita"/&gt;
 &lt;/topicref&gt;
 &lt;reltable&gt;
 &lt;relrow&gt;
 &lt;relcell&gt;A.dita&lt;/relcell&gt;
 &lt;relcell&gt;B.dita&lt;/relcell&gt;
 &lt;/relrow&gt;
 &lt;/reltable&gt;
&lt;/map&gt;</pre>
<p>The first part of this map is just what we have always thought of as a narrative structure in our NarCon approach in OMDoc. So we can directly represent it as something like</p>
<pre>&lt;omdoc&gt;
 &lt;metadata&gt;
 &lt;dc:title&gt;title&lt;/dc:title&gt;
 &lt;link rel="dita:audience" resource="something:math"/&gt;
 &lt;link rel="dita:navtitle" resource="navi-title"/&gt;
 &lt;/metadata&gt;
 &lt;omgroup xml:id="A.narrative" type="sequence"&gt;
 &lt;ref type="include" href="A1.omdoc"/&gt;
 &lt;ref type="include" href="A2.omdoc"/&gt;
 &lt;/omgroup&gt;
&lt;/omdoc&gt;</pre>
<p>I must confess that I do not really understand what the href on the top-level topicref means, so I have left it out. Note that I am only interested in the general compatibility of the formats and not the details of the translation, which will have to be worked out. That leaves us with the reltable, which (as far as I can understand it a way to specify cross-references that is a better alternative to &lt;related-links&gt;, since it is more portable and attached to DITA maps (which we can think of as discourse-level presentation of the content structure given by the graph of DITA topics). So I would just add the following metadata section to the &lt;omgroup&gt; element:</p>
<pre>&lt;metadata&gt;
 &lt;link rel="dita:related-link" resource="http://example.com/nats.html"/&gt;
&lt;/metadata&gt;</pre>
<p>OK, that ends our little comparison exercise. There are a couple of conclusions I would like to draw from this:</p>
<ol>
<li> OMDoc can do topic-oriented writing quite nicely</li>
<li> the OMDoc1.3-style metadata help significantly</li>
<li>rather than develop a DITA ontology (hinted at with the dita: namespace prefixes) we should develop ontologies that describe the various aspects of topic-based writing in generality and find the respective markup primitives. For instance dita:audience seems weird, there must be an ontology in the eLearning realm that already formalizes this.</li>
<li>The OMDoc-1.6 idea of leaving out the &lt;metadata&gt; element and freely intermixing the metadata &lt;link&gt;, &lt;resource&gt; and &lt;meta&gt; with the OMDoc content will make the translation much simpler and direct, e.g. for the &lt;reltable&gt; and &lt;related-link&gt; elements from DITA which are situated at the end in the original.</li>
</ol>
<p>OK, that is all I have to say at the moment, please give me feedback.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2010/04/09/ditaomdoc-compatibility-or-topic-based-writing-in-omdoc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microdata vs. RDFa – What does it mean to us?</title>
		<link>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/</link>
		<comments>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 14:58:48 +0000</pubDate>
		<dc:creator>Christoph</dc:creator>
				<category><![CDATA[clange]]></category>
		<category><![CDATA[JOMDoc]]></category>
		<category><![CDATA[Krextor]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[semantic documents]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://kwarc.info/blog/?p=919</guid>
		<description><![CDATA[Only today I became aware of microdata, the proposed way of embedding semantic annotations into HTML5. (Yes, they adopted the syntax that Michael also prefers for OMDoc, and which I personally hate, but I will get used to it.) Microdata are not to be confused with microformats, a poor man&#8217;s way of annotation that (ab)uses [...]]]></description>
			<content:encoded><![CDATA[<p>Only today I became aware of <a href="http://www.whatwg.org/specs/web-apps/current-work/multipage/microdata.html">microdata</a>, the proposed way of embedding semantic annotations into HTML5. (<a href="http://blog.whatwg.org/spelling-html5">Yes, they adopted the syntax</a> that Michael also prefers for OMDoc, and which I personally hate, but I will get used to it.) Microdata are not to be confused with <a href="http://microformats.org">microformats</a>, a poor man&#8217;s way of annotation that (ab)uses CSS classes and thus is compatible with HTML 4. Microdata are something like RDFa but</p>
<ol>
<li>are slightly easier to use for people who don&#8217;t understand XML namespaces
<ul>
<li>granted, RDFa&#8217;s excessive reliance on XML namespaces makes it hard to parse, and makes it unbearably complex to copy/paste a fragment, which is an important use case for HTML5</li>
</ul>
</li>
<li>allow for ad hoc pseudo-semantic markup when you do not use an ontology
<ul>
<li>What&#8217;s the point in annotating at all, then?</li>
</ul>
</li>
<li>compatible with the non-XML syntax of HTML5 (which should have been ditched IMHO, but, well, in the interest of reactionary users and software, <a href="http://www.smashingmagazine.com/2009/07/29/misunderstanding-markup-xhtml-2-comic-strip/">they decided differently</a>)</li>
</ol>
<p>The fight for the future of RDFa in HTML is going on, but what does that mean to KWARC? We have incorporated RDFa into <a href="http://omdoc.org">OMDoc</a> as <a href="https://svn.omdoc.org/repos/omdoc/trunk/doc/blue/foaf/note.pdf">a means of extending the metadata vocabularies</a>. RDFa, originally designed for XHTML, is prepared for being integrated into any XML language, including OMDoc. HTML5 microdata are an integral part of the HTML5 specification and would not work in other XML languages. OK, but we present OMDoc documents as HTML to make them human-readable. In this output, we want to preserve the semantics of the OMDoc markup, and for that we had always been thinking about using RDFa. (<a href="http://jomdoc.omdoc.org/ticket/266">We know exactly how to do it</a>, but just have not yet implemented that step, though.) We could use HTML5 microdata instead, but:</p>
<ol>
<li>RDFa has little software support so far, but microdata have none (beyond proofs of concept)</li>
<li>We generate XML-compliant HTML. <a href="http://www.w3.org/TR/html5-diff/#mathml-svg">The non-XML syntax of HTML5 supports embedded MathML</a>, but I doubt that it will support parallel <a href="http://www.openmath.org">OpenMath</a> markup, where elements from yet another namespace are embedded into the MathML formulae.</li>
<li>We <em>generate</em> HTML. The embedded annotations need not be authored manually, so they do not have to be easy to author.</li>
<li>We are interested in using well-defined ontologies to express semantics, so we don&#8217;t need ad hoc “semantic” markup.</li>
</ol>
<p>What do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2009/10/28/microdata-vs-rdfa/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Submitting content to OMBase and logging</title>
		<link>http://kwarc.info/blog/2008/03/24/submitting-content-to-ombase/</link>
		<comments>http://kwarc.info/blog/2008/03/24/submitting-content-to-ombase/#comments</comments>
		<pubDate>Mon, 24 Mar 2008 05:43:00 +0000</pubDate>
		<dc:creator>kohlhase</dc:creator>
				<category><![CDATA[kohlhase]]></category>
		<category><![CDATA[MKM]]></category>
		<category><![CDATA[OMBase]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[semantic documents]]></category>

		<guid isPermaLink="false">http://kwarc.info/kohlhase/blog/?p=20</guid>
		<description><![CDATA[While I was reading up on the REST papers in my last post, I stunbled upon the following best practice for making sure that material is only submitted once to a RESTful application. This is something we should adopt in OMBase as well, just to be safe. Another thing that we should think of in [...]]]></description>
			<content:encoded><![CDATA[<p>While I was reading up on the <a href="http://www.infoq.com/articles/tilkov-rest-doubts">REST papers</a> in my last post, I stunbled upon the following <a href="http://bitworking.org/news/201/RESTify-DayTrader">best practice</a> for making sure that material is only submitted once to a RESTful application. This is something we should adopt in <a href="http://kwarc.info/projects/ombase/">OMBase </a>as well, just to be safe.</p>
<p>Another thing that we should think of in this  arena is to enable some form of RESTful logging facility, so that users can find out what happened to the content. The technology that seems best suited for that seems to be <a href="http://cyber.law.harvard.edu/rss/rss.html">RSS</a> or <a href="http://www.ietf.org/rfc/rfc4287.txt">Atom Syndication</a> (probably the latter). The nice thing is that we could log all the changes to any URI we use in the system. I am not sure under which URL we would address the log, one idea is to just make use of the the mime type <code>application/atom+xml</code> just as for the <code>xhtml</code> presentation as suggested in my <a href="http://kwarc.info/kohlhase/blog/?p=19">last post</a> that would at least alleviate the choice of URL.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2008/03/24/submitting-content-to-ombase/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Integrating Presentation into OMBase</title>
		<link>http://kwarc.info/blog/2008/03/24/integrating-presentation-into-ombase/</link>
		<comments>http://kwarc.info/blog/2008/03/24/integrating-presentation-into-ombase/#comments</comments>
		<pubDate>Mon, 24 Mar 2008 05:24:51 +0000</pubDate>
		<dc:creator>kohlhase</dc:creator>
				<category><![CDATA[kohlhase]]></category>
		<category><![CDATA[mathml]]></category>
		<category><![CDATA[OMBase]]></category>
		<category><![CDATA[OMDoc]]></category>
		<category><![CDATA[OpenMath]]></category>
		<category><![CDATA[semantic documents]]></category>

		<guid isPermaLink="false">http://kwarc.info/kohlhase/blog/?p=19</guid>
		<description><![CDATA[I have just been reading up on REST again, since I found a very palatable pair of articles (REST intro, and  practices). This got me thinking about the state of OMBase, and the integration of our presentation pipeline into the OMBase interface. It is RESTful, since we have MMT addressing via URIs implemented. You just [...]]]></description>
			<content:encoded><![CDATA[<p>I have just been reading up on REST again, since I found a very palatable pair of articles<a href="http://www.infoq.com/articles/rest-introduction"> (REST intro</a>, and  <a href="http://www.infoq.com/articles/tilkov-rest-doubts">practices</a>). This got me thinking about the state of <a href="http://kwarc.info/projects/ombase/">OMBase</a>, and the integration of our <a href="http://kwarc.info/projects/mmlkit/">presentation pipeline</a> into the OMBase interface. It is RESTful, since we have MMT addressing via URIs implemented. You just use a GET to retrieve them.</p>
<p>What I have talked with <a href="http://kwarc.info/frabe">Florian</a> about, but maybe not with the OMBase team, is how to integrate presentation. That should be very simple from the interface point of view: we just take the same URLs, but a different HTTP header.</p>
<p><code>GET /arith1/lcm<br />
Host: cds.omdoc.org<br />
Accept: application/omdoc+xml</code></p>
<p>gives you the OMDoc file and</p>
<p><code>GET /arith1/lcm<br />
Host: cds.omdoc.org<br />
Accept: application/xhtml+xml</code></p>
<p>gives you the presented version (with the standard options). Now, we have written a paper about presentation and submitted it to MKM and <a href="http://kwarc.info/cmueller">Christine</a> has spent a lot of ingenuity on defining user options to the presentation process.This should be easy to integrate with the URI query interface:</p>
<p><code>GET /arith1/lcm?ext=foo.ntn∫=lang:ntn;style:physics<br />
Host: cds.omdoc.org<br />
Accept: application/xhtml+xml</code></p>
<p>That should do the trick.</p>
<p><code></code></p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2008/03/24/integrating-presentation-into-ombase/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Ontology repair in Physics</title>
		<link>http://kwarc.info/blog/2008/02/21/ontology-repair-in-physics/</link>
		<comments>http://kwarc.info/blog/2008/02/21/ontology-repair-in-physics/#comments</comments>
		<pubDate>Thu, 21 Feb 2008 10:46:04 +0000</pubDate>
		<dc:creator>kohlhase</dc:creator>
				<category><![CDATA[kohlhase]]></category>
		<category><![CDATA[MKM]]></category>
		<category><![CDATA[semantic documents]]></category>

		<guid isPermaLink="false">http://kwarc.info/kohlhase/blog/?p=17</guid>
		<description><![CDATA[I am just sitting the CIAO workshop and Alan Bundy and Michael Chan are talking about a very nice topic: the evolution of ontologies in Physics. They are applying this to historical examples like the latent heat problem and the MOND theory that is hot in Physics at the moment. The idea is that when [...]]]></description>
			<content:encoded><![CDATA[<p>I am just sitting the CIAO workshop and Alan Bundy and Michael Chan are talking about a very nice topic: the evolution of ontologies in Physics. They are applying this to historical examples like the latent heat problem and the MOND theory that is hot in Physics at the moment. The idea is that when experiments contradict theory, there is a clash between the theory ontology Ot and the sensory Ontology Os, which they solve by renaming apart selected concepts between the ontologies to resolve the contradiction. So they change the ontologies by renaming. The nice thing is that they can interpret the operation of renaming as a conservative theory extension which gives a nice interpretation of minimal theory change/repair.</p>
<p>You can find the details <a href="http://dream.inf.ed.ac.uk/projects/ontoevol/">here.</a></p>
<p>Even though I totally buy into their observations, I think that  it would be better to keep the theories as they are and interpret the repair operations as theory morphims. That would be a non-desctructive operation, and the operations would become very natural theory morphisms.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2008/02/21/ontology-repair-in-physics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Success Rates in the arXMLiv project</title>
		<link>http://kwarc.info/blog/2007/12/09/success-rates-in-the-arxmliv-project/</link>
		<comments>http://kwarc.info/blog/2007/12/09/success-rates-in-the-arxmliv-project/#comments</comments>
		<pubDate>Sun, 09 Dec 2007 06:50:19 +0000</pubDate>
		<dc:creator>kohlhase</dc:creator>
				<category><![CDATA[kohlhase]]></category>
		<category><![CDATA[mathml]]></category>
		<category><![CDATA[MathSearch]]></category>
		<category><![CDATA[semantic documents]]></category>

		<guid isPermaLink="false">http://kwarc.info/kohlhase/blog/?p=16</guid>
		<description><![CDATA[I have been silent for a long time, since the semester and various papers have kept me busy. But the semester is over, now&#8230; We have been making some progress on the conversion of the arXiv  collection from LaTeX to XHTML+MathML (see the arXMLiv project at KWARC), and I have announced that we have over [...]]]></description>
			<content:encoded><![CDATA[<p>I have been silent for a long time, since the semester and various papers have kept me busy. But the semester is over, now&#8230;</p>
<p>We have been making some progress on the conversion of the <strong><a href="http://www.arxiv.org" title="arXiv" target="_blank">arXiv</a>  </strong>collection from LaTeX to XHTML+MathML (see the <a href="http://kwarc.info/projects/arxmliv" title="arXMLiv Project" target="_blank">arXMLiv</a> project at KWARC), and I have announced that we have over 50% &#8220;success rate&#8221;. I have been asked by Aaron Krowne what success rate means and when we are going to reach 100%.  Here is the story.</p>
<p>First I would like to briefly talk about what we are doing. We are using Bruce Miller&#8217;s<a href="http://dlxf.nist.gov/LaTeXML" title="LaTeXML" target="_blank"> LaTeXML</a> converter over the ca 370 000 documents contained in the arXiv. Heinrich Stamerjohanns has build a test harness for LaTeXML that parses the log files and makes the statistics available <a href="http://arxmliv.kwarc.info" title="arXMLiv Statistics" target="_blank">on the web</a>. This is a very powerful way of doing things, it has exposed a lot of problems in LaTeXML and has allowed Bruce to make the program much more stable (see e.g. <a href="http://arxmliv.kwarc.info/history.php" title="LaTeXML error development" target="_blank">the fatal error development</a>). At 370000 LaTeX documents from all over the world over 15 years, there is almost no error you will not encounter. The other result is that we are sitting on what is probably the largest collection of documents with MathML in them worldwide.</p>
<p>The main technical task of the arXMLiv project is to supply LaTeXML bindings for       the (thousands of) LaTeX classes and packages used in the arXiv collections. A group of Jacobs University Undergrads are helping with this. Since we are still in a development mode, we do only download last year&#8217;s collection of articles after newyear (about 80000+ new ones in a couple of weeks).</p>
<p align="left">Now, let&#8217;s come back to the questions: Technically success means that the LaTeXML program does not throw any, errors, i.e. that all macros are known. Whether the transform is mathematically correct, is another matter, this needs human testers, and organizing that is an interesting problem in itself. We have first ideas in this direction and will try to make progress on this front in the next months.</p>
<p align="left">&nbsp;</p>
<p align="left">And now to the percetages:  I am not sure whether we will hit 90% at all. The problem is that this about the number of files that can still be  successfully  by LaTeX, since arXiv does not have a viable package management system.</p>
<p>Furthermore, arXiv papers use about 7000 packages and classes, of which I guess three quarters are used by less than five papers. So we<br />
are only going to bother about giving LaTeXML bindings for the more important ones (following the 80/20 rule). Moreover, the older the papers get, the less likely they are are to be successful, so I guess conversion success rates will go up automatically when we add<br />
the 2007 papers (ca 80000+). Finally, the success rates vary considerably over the different categories of the arXiv. The success rate actually dropped by 10% by to 50% by starting a big new category (we had been at about 60% before).</p>
<p>My personal suspicion is that we will reach 70% in the next three months, then the going will become slower, and I am not sure how much we<br />
will go beyond 80% realistically with the resources (a couple of undergrads) any time soon. To reach this, we would have to take the project global, which I would not mind, but which I am not necessarily seeing as one of my priorities. But you are of course invited to join our little project, so just contact me if you are interested.</p>
]]></content:encoded>
			<wfw:commentRss>http://kwarc.info/blog/2007/12/09/success-rates-in-the-arxmliv-project/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

