Thesis and dissertation subject metadata in repositories

an exploratory study on vocabulary control

Authors

DOI:

https://doi.org/10.19132/1808-5245284.123710

Keywords:

subject metadata, institutional repositories, dissertations, theses, indexing policy

Abstract

The metadata of theses and dissertations are inserted in the repositories automatically or manually, in Dublin Core format, different from the one used by the libraries, generating duplicity of work. The objective is to compare the procedures of thematic treatment of theses and dissertations existing in the online catalogue and in the repository, through the analysis of subject metadata. An exploratory study was carried out with a case study on the use of vocabulary control in theses and dissertations metadata, with the institutional repository of São Paulo State University and the Athena online catalog of the same institution. The exploratory study had two stages: documentary study of the historical trajectory of treatment of theses and dissertations at university and analysis of procedures of thematic treatment of theses and dissertations in databases, by authors and catalogers. The analysis of thematic treatment procedures employed observation of functioning and of routines and patterns, for evaluation of vocabulary control in the analysis of subject metadata. The results obtained revealed that the vocabulary control is not done in the search interfaces for information retrieval and neither in the alphabetical list of keywords, which means lack of guidance to authors, regarding the assignment of descriptors. The analyses showed two thematic treatments, which results in different subject representations, in natural language, in the repository, and in controlled language, in the catalogue. Two proposals were elaborated for the thematic treatment in subject metadata of theses and dissertations in repositories and library catalogues. It is concluded that the proposals may be assisted by the elaboration of an indexing policy of the repository that foresees the sharing of data resulting from self-archiving of theses and dissertations with the catalogue and that can accept the attribution of controlled vocabulary descriptors, besides the attribution of keywords.

Downloads

Download data is not yet available.

Author Biographies

Mariângela Spotti Lopes Fujita, Universidade Estadual Paulista Júlio de Mesquita Filho

Professora aposentada da UNESP - Campus de Marília; Docente Permanente do Programa de Pós-Graduação em Ciência da Informação, Area: Ciência da Informação, Subárea: Biblioteconomia; Especialidade: Indexação

Rosane Ribas, São Paulo State University - UNESP

Specialist and Bachelor in Librarianship. She is the librarian responsible for the Technical Information and Documentation Group of the Rectory of the Universidade Estadual Paulista (Unesp), where she works in the legal area, researching University standards, writing and formatting administrative acts, cataloging and indexing. She collaborated with the Unesp Library Network Indexing Policy and coordinates the Unesp Thesaurus Permanent Commission.

Milena Rodrigues, São Paulo State University - UNESP

Bachelor in Librarianship with an MBA in Information Unit Management. Librarian in the Technical Section for Acquisition and Treatment of Information at the Faculty of Sciences and Letters - Campus of Araraquara, Universidade Estadual Paulista (UNESP). She works with courses in Human Sciences, classification, cataloging and indexing in the MARC21 format (bibliographic and authority). Member of the Unesp Language Group of the Unesp Library Network, current Permanent Commission of the Unesp Thesaurus where he collaborates with authority and terms records.

Telma Silveira, São Paulo State University - UNESP

Bachelor of Librarianship. She is currently director of the Library of the Faculty of Philosophy and Sciences of the Universidade Estadual Paulista (Unesp) Campus de Marília, where she worked for ten years as supervisor of the Technical Section for Acquisition and Treatment of Information. She has experience in classification, cataloging in MARC21 format (bibliographic and authority) and indexing. She collaborated with the UNESP Library Network Indexing Policy.

References

ATHENA. São Paulo, 2022. Disponível em: http://athena.biblioteca.unesp.br/. Acesso em: 09 set. 2022.

COORDENADORIA GERAL DE BIBLIOTECAS. Tutorial para verificação de dissertações e teses submetidas no Repositório Institucional Unesp. Versão 4.4. São Paulo: Coordenadoria Geral de Bibliotecas - Unesp, 2017.

FUJITA, M. S. L. Sistematização de modelo de avaliação do controle de vocabulários em repositórios: relato de pesquisa com o Repositório Institucional Unesp. RDBCI: Revista Digital de Biblioteconomia e Ciência da Informação, Campinas, São Paulo, v. 20, n. 00, p. 1-22, 2022.

FUJITA, M. S. L.; TOLARE, J. B. Vocabulários controlados na representação e recuperação da informação em repositórios brasileiros. Informação & Informação, Londrina, v. 24, p. 93-125, 2019.

GERHARDT, T. E.; SILVEIRA, D. T. Métodos de pesquisa. Porto Alegre: Editora da UFRGS, 2009.

GRUPO GESTOR DO REPOSITÓRIO INSTITUCIONAL UNESP. Política de Gestão do Repositório Institucional da Unesp: regulamento interno. São Paulo: Unesp, 2016.

HANRATH, S.; RADIO, E. User search terms and controlled subject vocabularies in an institutional repository. Library Hi Tech, Wagon Lane, v. 35, n. 3, p. 360-367, 2017.

MONDOUX, J.; SHIRI, A. Institutional repositories in Canadian post-secondary institutions: user interface features and knowledge organization systems. Aslib Proceedings: New Information Perspectives, Wagon Lane, v. 61, n. 5, p. 436-458, 2009.

PARK, E. G; RICHARD, M. Metadata assessment in e‐theses and dissertations of Canadian institutional repositories. The Electronic Library, Wagon Lane, v. 29, n. 3, p. 394-407, 2011.

REPOSITÓRIO INSTITUCIONAL UNESP. São Paulo, 2022. Disponível em: http://hdl.handle.net/11449/215686. Acesso em: 09 set. 2022.

TARVER, H. et al. An exploratory analysis of subject metadata in the digital public library of America. In: INTERNATIONAL CONFERENCE ON DUBLIN CORE AND METADATA APPLICATIONS, 20., 2015, São Paulo. Proceedings […]. São Paulo, 2015, p. 30-40.

UNIVERSIDADE ESTADUAL PAULISTA “JÚLIO DE MESQUITA FILHO”. Coordenadoria Geral de Bibliotecas. Manual de política de indexação para as bibliotecas universitárias da Unesp. São Paulo: Unesp, 2017.

ZAVALINA, O. L. Contextual metadata in digital aggregations: application of collection-level subject metadata and its role in user interactions and information retrieval. Journal of Library Metadata, Philadelphia, v. 11, n. 3-4, p. 104-128, 2011a.

ZAVALINA, O. L. Free-text collection-level subject metadata in large-scale digital libraries: a comparative content analysis. In: DCMI INTERNATIONAL CONFERENCE ON DUBLIN CORE AND METADATA APPLICATIONS, 11., The Hague, 2011b. Proceedings [...]. The Hague, 2011b. p. 147-157.

ZAVALINA, O. L. Exploring the richness of collection-level subject metadata in three large-scale digital libraries. International Journal of Metadata, Semantics, and Ontologies, Denton, v. 7, n. 3, p. 209-221, 2012.

ZAVALINA, O. L. Complementarity in subject metadata in large-scale digital libraries: a comparative analysis. Cataloging & Classification Quarterly, Denton, v. 52, n. 1, p. 77-89, 2014.

Published

2022-09-27

How to Cite

FUJITA, M. S. L.; RIBAS, R.; RODRIGUES, M.; SILVEIRA, T. Thesis and dissertation subject metadata in repositories: an exploratory study on vocabulary control. Em Questão, Porto Alegre, v. 28, n. 4, p. 123710, 2022. DOI: 10.19132/1808-5245284.123710. Disponível em: https://seer.ufrgs.br/index.php/EmQuestao/article/view/123710. Acesso em: 24 jun. 2025.

Issue

Section

Article

Most read articles by the same author(s)