Fetching Automatic Authority Data in ILS from Wikidata via OpenRefine


  • DLIS, University of Kalyani, Kalyani, Nadia − 741235, West Bengal
  • DLIS, University of Kalyani, Kalyani, Nadia − 741235, West Bengal




Authority Data, Data Wrangling, Koha, OpenRefine, SPARQL, Wikidata


Authority data is vital for effective library and information services. It serves a major purpose in realizing the collocation function of library catalogues and indexes. Unfortunately, however, authority control has been neglected in library catalogues and other bibliographic databases in India. This paper seeks to demonstrate how authority data can be fetched automatically from Wikidata, a sibling project of Wikipedia. For this purpose, the query language SPARQL is required to formulate the names of persons of Indian origin along with their date of birth and place in Wikidata. The collected datasets are processed and implemented as MARC21-based authority data in KOHA, an open-source library management software. The ways in which the library and information science community can use these free, open-source platforms to gather, organize and share data and how they enhance the retrieval efficiency are shown.


Download data is not yet available.


Metrics Loading ...


Allison-Cassin, S. and Scott, D. (2018). Wikidata: A platform for your library’s linked open data. The Code4Lib Journal, 40. Available at: https://journal.code4lib.org/ articles/13424

Bielefeldt, A., Gonsior, J., and Krötzsch, M. (2018). Practical linked data access via SPARQL: The Case of Wikidata, 10.

Cyganiak, R. (n.d.). A relational algebra for SPARQL, 20.

Hernández, D., Hogan, A., Riveros, C., Rojas, C., and Zerega, E. (2016). Querying Wikidata: Comparing SPARQL, relational and graph databases. In Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F. and Gil, Y., edsitors. The Semantic Web - ISWC 2016 Springer International Publishing, 9982, 88-103. https://doi.org/10.1007/978-3-319-46547-0_10 DOI: https://doi.org/10.1007/978-3-319-46547-0_10

Klein, M., and Kyrios, A. (2013). VIAFbot and the integration of library data on Wikipedia. The Code4Lib Journal, 22. Available at: https://journal.code4lib.org/ articles/8964

MARC2WIKI. (n.d.). Available at: https://thisismattmiller. github.io/wiki2MARC/

Mukhopadhyay, P. (n.d.). Authority recommender system in library retrieval: Fusing FAST with VuFind, 5.

Reliability of Wikipedia. (2022). In Wikipedia. Available at: https://en.wikipedia.org/w/index.php? title=Reliability_of_Wikipedia&oldid=1102778247

SPARQL - Wikibooks, open books for an open world. (n.d.). Available at: https://en.wikibooks.org/wiki/ SPARQL

Veen, T. van. (2019). Wikidata: Information Technology and Libraries, 38(2), 72-81. https://doi.org/10.6017/ital. v38i2.10886 DOI: https://doi.org/10.6017/ital.v38i2.10886

Whittaker, B., and Spillane, J. (2001). Using the web for name authority work. Library Resources and Technical Services, 45. https://doi.org/10.5860/lrts.45n2.73 DOI: https://doi.org/10.5860/lrts.45n2.73

Wikidata. (n.d.). Available at: https://www.wikidata.org/ wiki/Wikidata:Main_Page



How to Cite

Pal, A., & Mukhopadhyay, P. (2022). Fetching Automatic Authority Data in ILS from Wikidata via OpenRefine. Journal of Information and Knowledge, 59(6), 353–362. https://doi.org/10.17821/srels/2022/v59i6/170677

Most read articles by the same author(s)