Topic Map conversion of YSO

From WandoraWiki
Revision as of 17:01, 19 August 2008 by Akivela (Talk | contribs)

Jump to: navigation, search

YSO is the Finnish General Upper Ontology based on the Finnish General Thesaurus maintained by The National Library of Finland. The Finnish General Thesaurus was converted to YSO by Semantic Computing Research Group (SeCo) during FinnONTO project. SeCo also hosts YSO in their National Finnish Ontology Service ONKI. Detailed description of YSO is available at Tietolinja magazine 1/2007 (in Finnish). YSO is an acronym of words Yleinen Suomalainen Ontologia. YSO contains ca. 20 000 concepts.

Topic Map conversion of YSO is based on RDF dump kindly provided by the SeCo team. Topic Map YSO was created using Wandora's RDF import feature. Machine translated YSO was processed manually to fix topic names and associations. Topic Map conversion is not identical to RDF version. Differences are discussed below.

Contents

Download Topic Map conversion of YSO

Topic Map conversion of YSO is available as Wandora project file and XTM dump.

  • Wandora project file (2.4 MB) is targeted to Wandora users.
  • XTM dump (zipped 2.3 MB, uncompressed 58.8 MB) can be used in any XTM capable Topic Map application.

History

  • 2008-08-14 Initial release.

Metrics

Metrics were measured from YSO layer of Wandora project file.

  • Number of term topics: 20490
  • Number of domain topics: 61


  • Number of topics: 20747
  • Number of associations: 103397
  • Number of topic base names: 20738
  • Number of subject identifiers: 39894
  • Number of subject locators: 0
  • Number of occurrences: 8439
  • Number of distinct topic classes: 8
  • Number of distinct types of associations: 7
  • Number of distinct roles in associations: 10
  • Number of distinct players in associations: 20708
  • Average coefficient for layer YSO is 0.5076


Yso connections.gif

Conversion details

Below is a screenshot of Wandora with Topic Map conversion of YSO. Wandora's topic panel has todellisuus topic (reality in Finnish) open. Topic has variant name in Swedish and English. Topic plays also a role in Associative Relation and Related-Term associations. Term's domain and superclass have also been specified with equivalent associations.


Yso screenshot.gif


In general each YSO term topic is an instance of topic term (yso). YSO term has a base name of equivalent Finnish word and contains Finnish, Swedish, and English variant names. In some cases English variant is missing. Some terms also contain alternative labels as altLabel (yso) occurrences. Term may also contain short description as a comment (rdfs) occurrence.

Each term topic has two subject identifiers. First identifier refers to term's YSO id. Second refers to equivalent YSA id i.e. id of term in Finnish General Thesaurus. For example, previous screenshot had a topic with subject identifiers

 http://www.yso.fi/onto/ysa/Y5462
 http://www.yso.fi/onto/yso/p5016

Term topic may contain associations:

  • Superclass-Subclass associations specify standard sub-superclass relations between terms.Root node of Superclass-Subclass associations is topic yso-käsitteet. Subclasses of root node are muuttuva, pysyvä, and abstrakti. Graph below views Superclass-Subclass associations of topic ilmiöt (phenomena in English). Dark blue arrows and texts were added in PhotoShop.


Yso graph example.gif


  • Associative Relation associations specify a general relation between two terms. It looks like most Associative Relation associations have a symmetric duplicate where players have been switched. This duplicates the amount of Associative Relation associations.
  • Broader-Narrower associations express the fact that a term is some way more general than another.
  • Homonym associations link terms with identical names but different meaning. Identical base names merge in topic maps and base names contain additional number to distinguish different term topics. For example osakeyhtiöt has a homonym with base name osakeyhtiöt_2.
  • Meronym associations specify part-whole relations between term topics.
  • Related-Term associations specify a general relation between two terms. It looks like these associations are similar to Associative Relation associations. However, association groups are not identical and they have been left separate.
  • Term-Domain associations specify term's domain. YSO contains 61 different domains. Domains are instances of domain (yso) topic.

Limitations

  • Original YSO RDF(S) contains separate RDF resources for YSA terms and YSO concepts. These two resources have been merged in Topic Map conversion of YSO. YSO Topic Map contains single topic for one YSA term and YSO concept pair. As a consequence each term topic has two subject identifiers, one referring the term and one referring the concept.

License of The Topic Map Conversion of YSO

Copyright (c) 2007-2008, FinnONTO Consortium
All rights reserved.

YSA contributed by The National Library of Finland, 2007.
TOPIC MAP CONVERSION OF YSO contributed by Wandora Team, 2008.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Personal tools