Beyond the book

Measure the number of contributions to Wikipedia entries from individuals from different counties as a measure of cultural bias.

3
mentions
1
contributor
43 commits | Last update: September 27, 2018

Cite this software

Choose a version:
DOI:
[[ releases.length > 0 ? releases[selectedIndex].doi : conceptDOI ]]
Copy to clipboard
Choose a citation style:
Download file

What Beyond the book can do for you

  • Provides humanities scholars with a tool to mine Wikipedia.
  • Mine Wikipedia revision history.
  • Ease of use for humanities scholars
  • Can be used as a proxy to measure the cultural bias from a country towards a specific topic

The book translation market is a topic of interest in literary studies, but the reasons why a book is selected for translation are not well understood. The "Beyond the Book" project investigates whether web resources like Wikipedia can be used to establish the level of cultural bias. This work describes the eScience tools used to estimate the cultural appeal of a book: semantic linking is used to identify key words in the text of the book, and afterwards the revision information from corresponding Wikipedia articles is examined to identify countries that generated a more than average amount of contributions to those articles. Comparison between the number of contributions from two countries on the same set of articles may show with which knowledge the contributors are familiar. We assume a lack of contributions from a country may indicate a gap in the knowledge of readers from that country. We assume that a book dealing with that concept could be more exotic and therefore more appealing for certain readers, while others are therefore less interested in the book. An indication of the 'level of exoticness' thus could help a reader/publisher to decide to read/translate the book or not. Experimental results are presented for four selected books from a set of 564 books written in Dutch or translated into Dutch, assessing their potential appeal for a Canadian audience. A qualitative assessment of quantitative results provides insight into named entities that may indicate a high/low cultural bias towards a book.

Read more
Tags
  • Text analysis & natural language processing
Programming Language
  • Python
License
  • Apache-2.0
Source code

Participating organizations

Contributors

  • Carlos Martinez-Ortiz
    Netherlands eScience Center
Contact person
Carlos Martinez-Ortiz
Netherlands eScience Center