HSOpen Apurahat as a topic map

From WandoraWiki
Revision as of 19:57, 3 June 2011 by Akivela (Talk | contribs)

Jump to: navigation, search

Helsingin Sanomat, a major Finnish newspaper and media company, arranged an open-data-hackathon with a title HS Open #2 - Follow the Money 23rd of May 2011. HS released two data sets for the event. First data set included money support reports from members of Finnish parliament. Second data set was about grants received by Finnish artists between 2000-2010. Wandora Team participated the hackathon and would like to thank the organizers. To contribute the event we created a special importer for Wandora application that converts the data set of artist' grants to a topic map format. As a result, the data set can imported to any topic map application, such as Wandora, for further analysis. After creating the importer, we are ready to publish the topic map conversion of the original data set.

HSOpen Apurahat topic map and it's license

Topic Maps conversion of HSOpen Apurahat data set is distributed as

Conversion date was 3rd of June 2011. Original data set is available here. To open XTM 2.0 serialization in Wandora see How to import existing topic map to Wandora. To open Wandora project file see How to save and load project.

License of the Topic Maps conversion of HSOpen Apurahat is Creative Commons Attribution. Source of the original data set is Helsingin Sanomat. Topic Maps conversion of HSOpen Apurahat has been created by Aki Kivelä, Wandora Team.

Topic map conversion details

  • The language of topic map is Finnish.
  • Number of topics in conversion is 11993. Number of associations is 38952. Number of person (Henkilö) topics is 5238. Number of grant associations (Päätös) is 16594. More about topic maps related in is below


Apurahat 01.gif


  • Original data set contains artist's birth day, month and year. Topic Maps conversion contains only artist's birth year. Month and day information have been dropped away.
  • Original data set contains errors. These errors are in Topic Maps conversion also.
  • Connection distribution of topic map is


Apurahat 02.gif


Screen captures and some visualizations

First screen capture views the HSOpen Apurahat topic map open in Wandora application. User is looking at grant categories and has opened grants for critics and reviewers (arvostelijat in Finnish).


Apurahat 04.gif


Next Wandora user looks at grants for cinematography (elokuvataide in Finnish) and opens one specific artist. Wandora views all grants received by selected artist, language, home town, sex and birth time of the artist.


Apurahat 05.gif


Wandora user mouse clicks the home-town (Kotipaikka in Finnish) topic in left column and selects the MIKKELI town. Wandora views all artists that live in MIKKELI.


Apurahat 06.gif


Wandora user switch the table view to graph view and opens grant categories once again. Wandora views a simple star graph where grant-category (Hakemusluokka in Finnish) is in the middle and category topics are branches. Length of the branch represents the weight of the category. The more grants (and artists) in the category the more weight category has. Category labels and light grey arrows have been added in Adobe Photoshop.


Apurahat 08.gif


Now Wandora user expands all category nodes and Wandora views all persons that have received a grant in given category. Magenta circles represent graph nodes. Darker edges are relationships between artists and grant categories. What is interesting, are the edges between categories. It appears that single artist may have received a grant from two different categories, for example from literature (kirjallisuus) and visual arts (kuvataide). These multitalented artists appear near one category but then there is an edge that connects the artist to another category also.


Apurahat 09.gif


Wandora starts to animate nodes and edges. After a while, the graph has opened and looks very dense. Again, here the artists who have received grants of several categories appear to connect different category "balloons". Graph also views clearly the relative size of categories. Literature (kirjallisuus) appears to be the most granted art form in Finland.


Apurahat 11.gif


Wandora user now closes categories of literature (kirjallisuus) and visual arts (kuvataide) and tries to find details in the messy center of the graph.


Apurahat 12b.gif


Wandora user zooms in the black frame viewed in image above. When user is close enough, node labels appear.


Apurahat 13.gif