BCC2010/MD/1

Report from the ALCTS Metadata Interest Group

and other metadata-related meetings

Jenn Riley, Chair, Metadata Subcommittee, MLA BCC

ALCTS Metadata Interest Group (Formerly Networked Resources and Metadata Interest Group) Annual Meeting, January 17, 2010, 8:00-10:00 AM

The ALCTS Networked Resources and Metadata Interest Group has become the Metadata Interest Group (MIG), effective at ALA Annual 2009. The current Chair is well-known to MLA members: Brad Eden, Associate University Librarian for Technical Services and Scholarly Communication, University of California, Santa Barbara. The MIG meeting at ALA Midwinter 2010 featured two presentations from leaders in the metadata community, followed by a brief business meeting.

In the first presentation, Jennifer Bowen of the University of Rochester  eXtensible Catalog (XC) project reported on the metadata management component of this project. One software package developed by the project is known as the Metadata Services Toolkit. This component allows project staff to set up periodic harvests of metadata from various sources (including library catalogs) via OAI-PMH, provides tools for cleaning up and normalizing this metadata, then re-exposes the enhanced metadata for use in the XC discovery system or any other discovery layer desired. The Metadata Services Toolkit is made up of several components: a normalization service, transformation service, aggregation service, and (planned) authority control service. This software has been released under an open source license and is available for others to implement at their own institutions. Slides from Bowen’s presentation to the MIG may be found at http://www.presentations.ala.org/index.php?title=Image:Bowen_ALCTS_Metadata_Interest_Group_Midwinter_2010.pdf. Bowen also gave presentations on other aspects of the XC project at other ALA Midwinter 2010 events, including to the FRBR Interest Group and the Authority Control Interest Group (ACIG). The eXtensible Catalog project home page, which includes links to download the open source XC toolkit, is http://www.extensiblecatalog.org.

In the second presentation, Roy Tennant of OCLC Research reported on informal work he has performed towards a “Massive Metadata Mashup.” OCLC’s Cloud Library project is investigating the intersection of materials in offsite storage and digitized content to see if there are operational efficiencies to be found for libraries. Tennant’s work involved downloading the freely-available Hathi Trust metadata records, using the OCLC XID service to enhance these records with OCLC numbers, then merge these enhanced records with WorldCat records and data on holdings in storage facilities to gain an overall picture of the overlap. While the project lessons overall are interesting, to the MIG audience, the mechanisms to perform this work are perhaps more important. Tennant presented three lessons learned related to metadata issues: identifiers are essential when dealing with large metadata sets, sometimes you need to go outside of standards to move forward with a project, and when dealing with large data processing activities, small incorrect assumptions or mistakes can have large consequences. Slides from Tennant’s presentation to the MIG may be found at http://presentations.ala.org/index.php?title=Image:RoyTenant_metadata_compressed.pdf.

The Metadata Interest Group business meeting was brief, and primarily focused on reports from liaisons. Jenn Riley, MLA liaison to MIG, reported on MLA activities in the business meeting, including the work of the MLA BCC Metadata Subcommittee in partnership with the ALA Association for Library Collections and Technical Services (ALCTS) Preservation and Reformatting Section (PARS) Audio Metadata Task Force to create an online resource explaining the use of a variety of metadata standards related to digital audio, and metadata-related sessions to be presented at the upcoming MLA 2010 Annual Meeting in San Diego. Among the business discussed of interest to the MLA community was an ALCTS preconference to ALA Annual 2010 on June 25, 2010, covering the XSLT technology: http://www.ala.org/ala/mgrps/divs/alcts/confevents/upcoming/ala/ac10/sxlt.cfm, and a session the MIG will host at ALA Annual 2010 on projects spanning the divisions between libraries, archives, and museums.

The Metadata Interest Group maintains a blog at which summaries and links to presentation slides from other ALA Midwinter 2010 meetings and presentations relevant to metadata issues are listed. This blog may be found at http://blogs.ala.org/nrmig.php.

ALCTS Preservation and Reformatting Section (PARS), Audio Metadata Task Force

The BCC Metadata Subcommittee is partnering with the ALCTS PARS Audio Metadata Task Force to create an online resource helping implementers to select appropriate metadata standards for digital audio projects. The Task Force did not have a formal meeting at Midwinter, but several representatives did meet informally to plan a strategy for completing the project by February 2010. This informal meeting was also attended by Charles Kolb, a program officer at the National Endowment for the Humanities (NEH), who expressed interest in endorsing the resource developed for use by its grantees.

Other ALA Midwinter 2010 Sessions of Interest to MLA Members in the Area of Metadata

The FRBR Interest Group met Friday, January 15, from 10:30 AM – Noon. Discussions from this session of interest to the MLA metadata community include:

1)      The eXtensible Catalog (XC) project’s early decisions to display search results always as Manifestations, with data elements related to search terms from other entities, rather than displaying the entity with the term that matched the user’s query; and their planning for “hourglass” displays (grouped around a Manifestation) and “pyramid” displays (grouped around a Work).

2)      IFLA plans to embark on work to harmonize the FRBR, FRAD, and FRSAD models once the final version of FRSAD is published.

3)      The European Library (TEL) project has posted online a prototype FRBRized system of literature written by Nobel Prize winners. The system may be found at http://digmap2.ist.utl.pt:8080/lgteFrbr/indexFrbrClustering.jsp.

At the Cataloging and Classification Section (CCS) Forum, two presentations are potentially of interest to the MLA metadata community. First is the presentation “Could we Create a Semantic Web Data Model for Subject Cataloging?” by Martha Yee, Cataloging Supervisor at the UCLA Film & Television Archive. Yee’s research is currently focusing on how library metadata might be represented in the context of the Semantic Web, demonstrating both what she has been able to model in this way, and potential barriers as well. Powerpoint slides for this presentation may be found at http://myee.bol.ucla.edu/Boston%20powerpoint.ppt. The second relevant presentation was by Tina Gross, Daniel N. Joudrey, and Arlene G. Taylor, on “The Effect of Controlled Vocabulary on Keyword Searching.” This project is a follow-up to a 2005 study published in College & Research Libraries assessing what proportion of searches performed would fail to match if controlled subject headings were not applied to records. The new study was performed on a catalog that had been enhanced with table of contents data since the first study was performed, which is commonly cited as one method to improve search results. The new study has resulted in a preliminary conclusion that for English-language materials nearly 25% of searches match only data in controlled subject headings, down from just over 35% in the first study.

One additional relevant presentation at ALA Midwinter was hosted by the Preservation and Reformatting Section (PARS) Intellectual Access to Preservation Metadata Interest Group. Lisa Schmidt, Electronic Records Archivist at Michigan State University, discussed digital curation planning at MSU. Work done so far includes a survey of units on campus that manage digital content, to understand their current metadata and data curation practices, which will push the development of specifications for campus-wide curation services. Presentation slides may be found at http://msudcp.archives.msu.edu/wp-content/uploads/2010/01/ALA-Midwinter-2010.ppt3.pdf. 

Submitted by Jenn Riley


Return to the 2010 Documents Menu
Return to BCC Historical Documents Menu
Return to BCC Home Page

Last updated March 11, 2010