HSOpen Apurahat as a topic map
Helsingin Sanomat, a major Finnish newspaper and media company, arranged an open-data-hackathon with a title HS Open #2 - Follow the Money 23rd of May 2011. HS released two data sets for the event. First data set included money support reports from members of Finnish parliament. Second data set was about grants received by Finnish artists between 2000-2010. Wandora Team participated the hackathon and would like to thank the organizers. To contribute the event we created a special importer for Wandora application that converts the data set of artist' grants to a topic map format. As a result, the data set can imported to any topic map application, such as Wandora, for further analysis. After creating the importer, we are ready to publish the topic map conversion of the original data set.
HSOpen Apurahat topic map and it's license
Topic Maps conversion of HSOpen Apurahat data set is distributed as
- XTM 2.0 topic map serialization.
- XTM 2.0 topic map serialization (including Wandora's base ontology).
- Wandora project file.
Conversion date was 3rd of June 2011. Original data set is available here. To open XTM 2.0 serialization in Wandora see How to import existing topic map to Wandora. To open Wandora project file see How to save and load project.
License of the Topic Maps conversion of HSOpen Apurahat is Creative Commons Attribution 1.0 Finland. Source of the original data set is Helsingin Sanomat. Topic Maps conversion of HSOpen Apurahat has been created by Aki Kivelä, Wandora Team.
Topic map conversion details
- The language of topic map is Finnish.
- Number of topics in conversion is 11993. Number of associations is 38952. Number of person (henkilö) topics is 5238. Number of grant associations (päätös) is 16594. More about topic maps related in is below
- Original data set contains artist's birth day, month and year. Topic Maps conversion contains only artist's birth year. Month and day information have been dropped away.
- Original data set contains errors. These errors are in Topic Maps conversion also.
- Connection distribution of topic map is
Screen captures and some visualizations
First screen capture views the HSOpen Apurahat topic map open in Wandora application. User is looking at grant categories and has opened grants of critics and reviewers (arvostelijat in Finnish).
Next Wandora user looks at grants of cinematography (elokuvataide in Finnish) and opens one specific artist. Wandora views all grants received by selected artist, language, home town, sex and birth time of the artist.
Wandora user mouse clicks the home-town (Kotipaikka in Finnish) topic in left column and selects the MIKKELI town. Wandora views all artists living in MIKKELI.
Wandora user switches the table view to graph view and opens grant categories once again. Wandora views a simple star graph where grant-category (Hakemusluokka in Finnish) is in the middle and category topics are branches. Length of the branch represents the weight of the category. The more grants (and artists) in the category the more weight category has. Category labels and light gray arrows have been added in Adobe Photoshop.
Now Wandora user expands all category nodes and Wandora views all persons that have received a grant in given category. Magenta circles represent graph nodes. Darker edges are relationships between artists and grant categories. What is interesting, are the edges between categories. It appears that single artist may have received a grant from two different categories, for example from literature (kirjallisuus) and visual arts (kuvataide). These multi talented artists appear near one category but then there is an edge that connects the artist to another category also.
Wandora starts to animate nodes and edges. After a while, the graph has opened. Again, here the artists who have received grants of several categories appear to connect different category "balloons". Graph also views clearly the relative size of categories. Literature (kirjallisuus) appears to be the most granted art form in Finland.
Wandora user now closes categories of literature (kirjallisuus) and visual arts (kuvataide) and tries to find details in the messy center of the graph.
Wandora user zooms in the black frame viewed in image above. When user is close enough, node labels appear.