A method for the identification of collaboration in large scientific databases
DOI:
https://doi.org/10.19132/1808-5245212.140-161Palabras clave:
Extraction and data integration. Information retrieval. Identification of collaboration.Resumen
The analysis of scientific collaboration networks has contributed significantly to improve the understanding of the collaboration process between researchers. Additionally, it has helped to understand how scientific productions by researchers and research groups evolve. However, the identification of collaborations in large scientific databases is not a trivial task, given the high computational cost of the prevalent methods. This paper proposes a method for identifying collaborations in large scientific databases, namely, ISColl – Identification of Scientific Collaboration. Unlike methods that use techniques such as exhaustive comparisons of publication pairs, the proposed method produces satisfactory results with a low computational cost, thus providing an interesting alternative for the modelling and characterization of large scientific collaboration networks. To demonstrate the potential of the proposed technique, tests were conducted using scientific publications data registered in the Lattes Platform of CNPq, with the obtained results yielding excellent accuracy during the identification of scientific collaborations.
Descargas
Citas
ALVES, A. D.; YANASSE, H. H.; SOMA, N. Y. LattesMiner: a multilingual DSL for information extraction from lattes platform. In: CONFERENCE ON SYSTEMS, PROGRAMMING, AND APPLICATIONS: SOFTWARE FOR HUMANITY, 2011, Portland. Proceedings… New York: ACM 2011. p. 85-92.
BAEZA-YATES, R. A.; RIBEIRO-NETO, B. A. Recuperação de informação: conceitos e tecnologia das máquinas de busca. 2. ed. Porto Alegre: Bookman, 2013.
CAÑIBANO, C.; BOZEMAN, B. Curriculum vitae method in science policy and research evaluation: the state-of-the-art. Research Evaluation, Oxford, v. 18, n. 2, p. 86-94, 2009.
DIAS, T. M. R. et al. Modelagem e caracterização de redes científicas: um estudo sobre a Plataforma Lattes. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING, 2., 2013, Maceió. Anais... [S.l.]: UFMG, UFRJ, 2013.
DIGIAMPIETRI, L. A.; et al. BraX-Ray: an x-ray of the brazilian computer science graduate programs. PLoS One, San Francisco, v. 9, p. e94541, 2014.
DIGIAMPIETRI, L.; MUGNAINI, R.; ALVES, C. Analysis of participation in supervised production of advisors: a case study in computer science. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING, 2., 2013, Maceió. Anais... [S.l.]: UFMG, UFRJ, 2013.
DIGIAMPIETRI, L. et al. Dinâmica das relações de coautoria nos programas de Pós-Graduação em Computação no Brasil. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING, 2012, Curitiba. Anais... Curitiba: UFPR, 2012.
DING, Y. Scientific collaboration and endorsement: network analysis of coauthorship and citation networks. Journal of Informetrics, Amsterdam, v. 5, n. 1, p. 187-203, 2011.
FERNANDES, G. O.; SAMPAIO, J. O.; SOUZA, J. M. XMLattes: a tool for importing and exporting curricula data. In: WORLD CONGRESS IN COMPUTER SCIENCE, COMPUTER ENGINEERING, AND APPLIED COMPUTING, 2011, Las Vegas. Proceedings... Las Vegas: WORLDCOMP, 2011.
GAYEN, A.; CHANDRA, J. Role of trust in evolution of scientific collaboration networks. In: INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING, 7., 2014, Beijing. Proceedings... New York: ACM, 2014.
LAENDER, A. et al. Ciência Brasil - the brazilian portal of science and technology. In: SEMINÁRIO INTEGRADO DE SOFTWARE E HARDWARE, 38., 2011, Natal. Anais eletrônicos... Natal, 2011.
LEE, D. et al. Complete trails of coauthorship network evolution. Physical Review E, New York, v. 82, 026112, 2010.
LOPES, G. R. Avaliação e recomendação de colaborações em redes sociais acadêmicas. 2012. Tese (Doutorado em Ciência da Computação) – Curso de Pós-Graduação em Computação, Universidade Federal do Rio Grande do Sul, Porto Alegre, 2012.
LOPES, G. R. et al. Ranking strategy for graduate programs evaluation. In: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS , 7., 2011, Sydney. Proceedings... Sydney: ICITA, 2011. p. 59-64,
MENA-CHALCO, J. P.; CESAR-JUNIOR, R. M. ScriptLattes: an open-source knowledge extraction system from the Lattes platform. Journal of the Brazilian Computer Society, Porto Alegre, v. 15, n. 4, p. 31-39, 2009.
MENA-CHALCO, J. P.; DIGIAMPIETRI, L. A.; CESAR-JUNIOR, R. M.. Caracterizando as redes de coautoria de currículos Lattes. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING, 2012, Curitiba. Anais... Curitiba: UFPR, 2012.
NEWMAN, M. E. J. The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, Washington, v. 98, n. 2, p. 404-409, 2001a.
NEWMAN, M. E. J. Scientific collaboration networks. I. Network construction and fundamental results. Physical Review E, New York, v. 64, n. 1, p. 016131_1-016131_8, 2001b.
NEWMAN, M. E. J. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E, New York, v. 64, n. 1, p. 016132_1-016132_7, 2001c.
NEWMAN, M. E. J. Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences, Washington, v. 101, suppl. 1, p. 5200-5205, 2004.
PETERSEN, A. M. et al. Persistence and uncertainty in the academic career. Proceedings of the National Academy of Sciences, Washington, v. 109, n. 14, p. 5213-5218, 2012.
PROCOPIO, S. P., LAENDER, A. H. F., MORO, M. M. Analysis of network co-authoring the brazilian symposium on databases. In: SIMPÓSIO BRASILEIRO DE BANCO DE DADOS, 26., 2011, Florianópolis. Anais... Porto Alegre: SBC, 2011.
REVOREDO, K. et al. Mining scientific literature for analysis of collaboration in research communities. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING, 2012, Curitiba. Anais... Curitiba: UFPR, 2012.
Descargas
Publicado
Cómo citar
Número
Sección
Licencia
Derechos de autor 2015 Thiago Magela Rodrigues Dias, Gray Farias Moita

Esta obra está bajo una licencia internacional Creative Commons Atribución 4.0.
Autores que publican en esta revista están de acuerdo con los siguientes términos:
Los autores mantienen los derechos autorales y ceden a la Revista el derecho de la primera publicación, con el trabajo licenciado bajo la Licencia Creative Commons Attribution (CC BY 4.0), que admite compartir el trabajo con reconocimiento de la autoria.
Los autores tienen autorización para asumir contratos adicionales en forma separada, para la distribución no exclusiva de la versión del trabajo publicada en esta Revista, como para publicar en repositorio institucional, con reconocimiento de autoria y publicación inicial en esta Revista.
Los artículos son de acceso abierto y gratuitos. Según la licencia, usted debe dar crédito de manera adecuada, brindar un enlace a la licencia, e indicar si se han realizado cambios. No puede aplicar términos legales ni medidas tecnológicas que restrinjan legalmente a otras a hacer cualquier uso permitido por la licencia.