Automating Big Data Cleaning: An Example Using Local Bibliometric Data

dc.contributor.authorCarson, Jana
dc.contributor.authorGordon, Shannon
dc.date.accessioned2017-09-08T13:51:20Z
dc.date.available2017-09-08T13:51:20Z
dc.date.issued2017-04-06
dc.description.abstractThe University of Waterloo recognizes bibliometric data as an important piece of evidence-based research assessment, and recommends bibliometric data as one measure, among many, for capturing research productivity trends, and elements of research impact. Even when working from a basket of measures, bibliometric data remains complex and requires significant cleaning due to issues of name ambiguity. This session will explore an innovative collaboration between the Library and Institutional Analysis and Planning (IAP) to support the integrity of local, discipline-level bibliometric data by automating key data processes of an internal project. This session will introduce how bibliometric data is relevant to the University, the process used to gather and vet local bibliometric data, and the ways in which key data processes have been successfully automated using Python and a database to support efficient reporting. Given known challenges presented by name ambiguity, this collaborative framework makes it possible to support the integrity of local bibliometric data—a key step in supporting this and similar in-demand analyses at the University.en
dc.identifier.urihttp://hdl.handle.net/10012/12333
dc.language.isoenen
dc.subjectBibliometricsen
dc.subjectResearch Productivityen
dc.subjectBig Dataen
dc.subjectData Cleanupen
dc.subjectCollaborationen
dc.subjectUniversity Partnershipsen
dc.titleAutomating Big Data Cleaning: An Example Using Local Bibliometric Dataen
dc.typeConference Presentationen
dcterms.bibliographicCitationCarson, J., & Gordon, S. (2016). Automating Big Data Cleaning: An Example Using Local Bibliometric Data. Conference presentation as presented at WatITIs, Waterloo, On.en
uws.contributor.affiliation1Waterloo Libraryen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelLibrarianen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Waterloo 2017 Staff Conf.pdf
Size:
1.52 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.46 KB
Format:
Plain Text
Description: