Library Carpentry: Towards a New Professional Dimension (Part II – Automatic Authority Control to Enhance Retrieval)
Keywords:Authority Data, Data Reconciliation, Geodetic Search, Geographic Name Authority, Geospatial Data, Linked Open Data (LOD), Openrefine
AbstractAuthority control for bibliographic data management in Indian libraries is generally a neglected area and thereby library OPACs of the country (including OPAC of the National Library) supports only the finding function of a catalogue and not the collocating function. In this context, the part II of the three part series on library carpentry (part I has been published in April issue) is an attempt to apply library carpentry methods in building authority datasets from scratch. It deals with the methodologies for developing authority datasets by applying data wrangling techniques and subsequent transformations of these datasets into ready-to-import MARC 21 format (for authority data). Like the previous part of the series, this research study is also represented through a case study. The case study narrates development of geographic name authority datasets for - states and union territories (level I), districts of India (level II), sub-districts (level III) and community development blocks of India (level IV). It also demonstrates that how the merged geographic name authority file for India can be implemented in an open source ILS and can become instrumental in enhancing retrieval efficiency through geodetic search feature in an open source library discovery system. It concludes that the proposed mechanisms and methodology (supported with proofs of the concept) may lead to a new era of authority-controlled cataloguing in Indian libraries.
Allison-Cassin, S. and Scott, D. (2018). Wikidata: A platform for your library’s linked open data. The Code4Lib Journal, 40. https://journal.code4lib.org/articles/13424.
Anderson, B. (1992). Authority control: Whose pain is it? Library Acquisitions: Practice & Theory, 16(1), 63-66. https://doi.org/10.1016/0364-6408(92)90111-5.
Bensmann, F., Zapilko, B. and Mayr, P. (2017). Interlinking large-scale library data with authority records. Frontiers in Digital Humanities, 4. https://doi.org/10.3389/fdigh.2017. 00005.
Carlson, S. and Seely, A. (2017). Using Openrefine’s reconciliation to validate local authority headings. https://doi.org/10.1080/01639374.2016.1245693.
Diaz-Valenzuela, I., Martín-Bautista, M. J. and Vila, M. A. (2010). An automatic data mining authority control system: A first approach. 2010 10th International Conference on Intelligent Systems Design and Applications, p. 569-574. https://doi.org/10.1109/ISDA.2010.5687205.
Diaz-Valenzuela, I., Martin-Bautista, M. J., Vila, M.-A. and Campaña, J. R. (2013). An automatic system for identifying authorities in digital libraries. Expert Systems with Applications, 40(10), 3994-4002. https://doi.org/10.1016/j.eswa.2013.01.010.
Dillon, M., Knight, R. C., Lospinuso, M. F. and Ulmschneider, J. (1981). The use of automatic indexing for authority control. Journal of Library Automation, 14, 268-277. https://doi.org/10.6017/ital.v14i4.5305.
Dryden, J. (2008). From authority control to context control. Journal of Archival Organization, 5(1-2), 1-13. https://doi.org/10.1300/J201v05n01_01.
Harlow, C. (2015). Data munging tools in preparation for RDF: Catmandu and LODRefine. The Code4Lib Journal, 30. https://journal.code4lib.org/articles/11013.
Jørgensen, P. H. (2001). Practical application of FRBR and RDF for cataloguing, authority control and searching of relationships among electronic resources [English abridged text of the version presented at the International Conference] (L. Sardo, S. Gambari, & M. Guerrini, Eds.). Editrice Biblio grafica. http://eprints.rclis.org/4114/1/jorgensen_eng.pdf.
Leiva-Mederos, A., A. Senso, J., Domínguez-Velasco, S. and Hípola, P. (2013). AUTHORIS: A tool for authority control in the semantic web. Library Hi Tech, 31(3), 536- 553. https://doi.org/10.1108/LHT-12-20112-0135.
Library of Congress (2017). Frequently asked questions on establishing geographic names for NACO - name authority cooperative program of the PCC - program for cooperative cataloging (Library of Congress). NACO - Name Authority Cooperative Program. https://www.loc.gov/aba/pcc/naco/ geogfaq.html.
Library of Congress, Network Development and MARC Standards Office (NDMSO). (2020). Marc 21 format for authority data. MARC 21 format for Authority Data: Table of Contents (Network Development and MARC Standards Office, Library of Congress). https://www.loc.gov/marc/ authority/.
Manghi, P. and Mikulicic, M. (2011). PACE: A General-Purpose Tool for Authority Control. In E. García- Barriocanal, Z. Cebeci, M. C. Okur, & A. Öztürk (Eds.), Metadata and Semantic Research, p. 80-92. Springer. https://doi.org/10.1007/978-3-642-24731-6_8.
Martín, L. T., Peña, F. O. F., Mederos, A. A. L., Goovaerts, M., Reyes, D. C. and Álvarez, W. A. R. (2015). Software applications ecosystem for authority control. In: E. Garoufallou, R. J. Hartley, & P. Gaitanou (Eds.), Metadata and Semantics Research, Vol. 544, Springer International Publishing; p. 214-224. https://doi.org/10.1007/978-3-319- 24129-6_19.
Mbaakanyi, D. M., Ubogu, F. N. and Lumande, E. (1993). Subject authority control in a computerised system: Use of CDMARC Subjects in an academic library. The Electronic Library, 11(4/5), 311-316. https://doi.org/10.1108/eb045253.
McCutcheon, S. (2011). Basic, fuller, fullest: Treatment options for electronic theses and dissertations. Library Collections, Acquisitions, and Technical Services, 35(2), 64-68. https://doi.org/10.1016/j.lcats.2011.03.019.
McKay, B. and Geer, B. (1995). Series authority records in an integrated system: Enhancing catalog effectiveness. Serials Review, 21(4), 23-34. http://dx.doi.org/10.1080/009 87913.1995.10764274.
Mukhopadhyay, P. (2019). Place vs. Space in authority data: Enhancing library retrieval through geodetic search. NDLI-UNESCO International Symposium on Knowledge Engineering for Digital Library Design-2019, New Delhi. http://kedl2019.ndl.gov.in/index.php/parthasarathimukhopadhyay/.
Mukhopadhyay, P. and Mukhopadhyay, M. (2018). Search enhancements in library discovery: Towards a new dimension of GIS based information retrieval. LIS Education in India: Current Scenario and Future Strategies - Proceedings and Papers, 57, 280-291. http://www.iaslic1955.org.in/fckeditor/userfiles/file/IASLIC%20Special%20 Publication%20_2018_%20Vol%2057.pdf.
NDL India. (2020, January 28). Parthasarathi Mukhopadhyay: Enhancing library retrieval through geodetic search [Video]. YouTube. https://www.youtube. com/watch?v=VoQlHzLLd-4.
O’Neill, E. T. (2002). FRBR: Functional Requirements for Bibliographic Records. Library Resources & Technical Services, 46(4), 150-159. https://doi.org/10.5860/lrts.46n4.150.
Park, J. and Lu, C. (2009). Application of semi-automatic metadata generation in libraries: Types, tools, and techniques. Library and Information Science Research, 31(4), 225-231. https://doi.org/10.1016/j.lisr.2009.05.002.
Ryan, C., Grant, R., Carragáin, E. Ó., Collins, S., Decker, S. and Lopes, N. (2015). Linked data authority records for Irish place names. International Journal on Digital Libraries, 15(2), 73-85. https://doi.org/10.1007/s00799-014-0129-8.
Siegfried, S. L. and Bernstein, J. (1991). Synoname: The Getty’s new approach to pattern matching for personal names. Computers and the Humanities, 25(4), 211-226. https://doi.org/10.1007/BF00116076.
Smith-Yoshimura, K. (2016, September 28). Metadata reconciliation. Hanging Together. https://hangingtogether. org/?p=5710.
Smith-Yoshimura, K. (2020). Transitioning to the Next Generation of Metadata. OCLC Research. https://doi. org/10.25333/rqgd-b343.
Tillett, B. and Taylor, A. G. (2004). Authority control in organizing and accessing information. Routledge. https://doi.org/10.4324/9780203051092.
Tsui, S. L. and Hinders, C. F. (1999). Cost-Effectiveness and Benefits of Outsourcing Authority Control. Cataloging and Classification Quarterly, 26(4), 43-61. https://doi.org/10.1300/J104v26n04_04.
Vellucci, S. L. (2004). Commercial Services for Providing Authority Control: Outsourcing the Process. Cataloging and Classification Quarterly, 39(1-2), 443-456. https://doi.org/10.1300/J104v39n01_12.
Wiederhold, R. A. and Reeve, G. F. (2021). Authority Control Today: Principles, Practices, and Trends. Cataloging and Classification Quarterly, 59(2-3), 129-158. https://doi.org/10.1080/01639374.2021.1881009.
Zhu, L. and Seggern, M. V. (2005). Vendor-supplied authority control: Some realistic expectations. Technical Services Quarterly, 23(2), 49-65. https://doi.org/10.1300/ J124v23n02_04.
How to Cite
All the articles published in Journal of Information and Knowledge are held by the Publisher. Sarada Ranganathan Endowment for Library Science (SRELS), as a publisher requires its authors to transfer the copyright prior to publication. This will permit SRELS to reproduce, publish, distribute and archive the article in print and electronic form and also to defend against any improper use of the article.