2009年2月19日星期四

20090218-维基相关调研

Harvesting Wiki Consensus-Using Wikipedia Entries as Ontology Elements

only 3% of the sample have turned into a disambiguation page during its lifespan. we can estimate that each month, about 2,465,000 change operations are made by Wikipedia users, but only 5% of concepts change in a major sense during their lifespan. We think this is a fundamental argument in favor of community-centric ontology building.

On the basis of a quantitative analysis of current Wikipedia entries and their properties we have provided substantial evidence that the URIs of Wikipedia entries are surprisingly reliable identifiers for ontology concepts. In addition, we have demonstrated how the more than one million entries in Wikipedia can be used as ontology elements, opening this enormous source of named classes for making the Semantic Web a reality.

---------------------

Collaborative Ontology Building with WikiOnt - A multi-agent based ontology building environment


describe collaborative ontology building in analogy to Wikis, but (1) do notborrow more from the Wiki community than the pure name, (2) take a very richontology meta-model as the starting point, (3) do not elaborate on the communityfocus of ontology building, and (4) do not address the advantage of addingmultimedia elements in the informal descriptions of concepts.

D. Collaborative Knowledge Base Construction
Some collaborative knowledge base construction projects, although not focused on
ontology building, address similar problems.

4) WikiPedia: WikiPedia is a wiki-based open-content encyclopedia that is
available in several languages. There are 315,000 articles in English alone as of
July, 2004. It is an open encyclopedia that is editable by participants.
WikiPedia works and assumes that that most of people in the community behave in a
manner that benefits the community. Articles in WikiPedia are written in natural
language, and the relation between items is not formal. Nevertheless, articles
can be seen as concepts and links between them seen as properties among them, in
a informal sense.

----------------------

OntoWiki Community-driven Ontology Engineering and Ontology Usage based on Wikis(2005)

Standard Wiki technology can be easily used as an ontology development environment without modification, reducing entry barriers for the participation of users in the creation and maintenance of lightweight ontologies.

----------------------

数据采集

nekohtml:它的作用是将半结构化的HTML代码转化成严格的结构化的XML文档,具有自动补齐
甚至修正HTML错误的健壮性功能;xercesImpl:它的作用是对XML文档建DOM树的功能,它是应用Xpath的一个前提条件;xalan:它的作用是充当一个“Xpath引擎”,即将用户输入的XPath处理后得到与其对应的
DOM树结点。xstream:实现XML和Java对象间的相互转换。comms-codec、commons-httpclient.jar:下载网页。

----------------------
本体(Ontology)作为一种能在语义和知识层次上描述信息系统的概念模型的形式化规范说明,提供了一套对信息和知识进行规范化描述和建模的方法

----------------------

没有评论:

发表评论