Project to control headings in WorldCat

Posted August 22, 2008 by Lois Reibach
Categories: Glenn Patton, OCLC, Worldcat

Back in the spring OCLC began a project to control headings in Worldcat.  In his blog Outgoing Thom Hickey described this project here.  Among the reasons behind the project:

Getting more of WorldCat’s headings linked to authority records has a number of benefits.  It gives us a chance to merge some variant forms of headings and makes it easier to update the database when names change.  This has become a substantial problem for us since LC changed their policy on adding death dates to headings.  Right now we are working our way through the a set of fairly easy 26 million headings, personal names that match an authority record on multiple subfields.  If this works, we will look at controlling names that are harder to match.

The project was on hiatus for part of the summer, but as Glenn Patton reports to OCLC-CAT in a post on Aug 22, 2008 [permission to quote]

On August 4, 2008, OCLC restarted the project to control headings in WorldCat.  As of yesterday, almost 1.3 million headings have been controlled since the project was restarted, bringing the total number of newly controlled headings to just under 17 million.

I think this is an excellent use of resources, and am glad to see that the project has resumed.

UPDATE:  On September 5, OCLC completed the first phase of the project to control more headings in WorldCat.  A total of about 25.5 million new personal name headings were controlled during this first phase. – from post by Glenn Patton to OCLC-CAT on Sept. 9, 2008.

Subject headings derived from authority records

Posted August 21, 2008 by Lois Reibach
Categories: authority data, subject access

On Everybody’s Libraries John Mark Ockerbloom describes a project to add subject headings to the records for an online book collection.  The collection had brief records that lacked subject access, but did have call numbers.  Recognizing that subject authority records frequently include call number ranges, he created a program that used the call number to search authority records and return subject headings.

After filtering out some bad data in a few authority records, and suppressing some terms to break ties, I ran the program, and instantly got subject terms for tens of thousands of books. Some of them were pretty generic (thousands of books were simply labeled “English literature”, for instance), but many were quite specific, and I’d say over 90% of the time were useful descriptions of the book. The maps I built largely based on these assigned subjects worked pretty well from day one.

I really like the way he leveraged the work of the catalogers that created the authority data and the existing call numbers to create subject access for a collection.

IFLA discussion on FRSAR

Posted August 12, 2008 by Lois Reibach
Categories: FRAD, FRSAR, Glenn Patton

Over on The FRBR Blog, William Denton reports on the Working Group on FRSAR meeting. The section most relevant to this blog was:

Glenn Patton of OCLC was at the table to discuss FRAD, he chairing the WG on that, and FRAD took up the first hour of the meeting. He said they’d made some changes to the latest draft based on comments, and part of it will be removed and made available separately.

There was some involved discussion of what is an entity, why name and controlled access point are two different entities and not the same, etc. Patton said they wanted to separate the attributes of a person, the names by which they are known, and the access points by which they are found. It seemed easiest to do this by having different entities. Person is related to name is related controlled access point. “Define your entities at least in part by what you want to do with them,” he said, which is why, for example, Family and Agency are different from Corporate Body though they’re really kinds of Corporate Body.

Since I’ve been trying to get a handle on attributes of a person and how this relates to controlled access point, I found these comments interesting.

International Society for Knowledge Organization 2008 conference

Posted August 9, 2008 by Lois Reibach
Categories: Names Project

Amanda Hill reports on ISKO 2008 here.  It was the first I had heard of the Names Project, which I see I will have to follow, given these comments:

The questions at the end of my talk included one from a librarian who took exception to the Names project’s aim of not having a preferred form of name for individuals. I had thought that this would be more acceptable now than it had been in the past – it’s a distinction between authority control and access control that has been fairly widely discussed in the library literature, but apparently it is still controversial. Interesting!

Karen Coyle’s fall 2007 post on name authority control

Posted August 4, 2008 by Lois Reibach
Categories: Karen Coyle, authority control

When I had bookmarked this post to come back to, I hadn’t noticed that it appeared last September.  However, an issue I just saw raised on AUTOCAT reminded me that I hadn’t come back and commented.  In her post, Coyle focuses on the following issues:

There are some problems with the current method used by libraries to realize authority control, not the least of which is that it is a difficult and expensive process and the number of authors is growing rapidly as we all become creators in this information age. I want to address here 3 aspects of name authority control that are especially non-functional: 1) the use of dates as distinguishing characteristics is not easy for the catalogers creating the authority record 2) the use of dates as distinguishing characteristics does not help the users 3) the name heading is not a legitimate identifier because it may change.

I don’t have a good answer to the problem, but I remember far too many times spent clicking on every heading from a given time period, trying to identify a common name from the religious world in the middle of the last millenium.  I don’t think eliminating the dates is the answer, as there is value in being able to limit to a time period, but it would be helpful to librarians and patrons alike if an identifying phrase could be added to the established heading on a regular basis.

Experimental Spanish version of LCSH

Posted July 25, 2008 by Lois Reibach
Categories: LCSH

An alert just brought me news that there is a new version of lcsh-es.org LC Subject Headings in Spanish can be searched here and is described as:

This an experimental Spanish-English/English-Spanish list of subject headings. Please use it with caution! The data is not completely current a much of it has not been checked thoroughly. For more information see the Sources and About pages.

I think this source would have been helpful to students in the hispanic theology program at my former institution that couldn’t always think of the correct term to search in English.

Fiscal Year 2008 Recap of OCLC Quality Control and CIP Upgrading Activities

Posted July 24, 2008 by Lois Reibach
Categories: Glenn Patton, OCLC, Worldcat

Over the next few days, I want  to go back and post on some things that occurred just as I was contemplating or launching this blog.  On July 9, Glenn Patton, Director, WorldCat Quality Management, reported to OCLC-CAT [permission to quote]

For the fiscal year just concluded (July 2007 through June 2008), OCLC staff have:
- corrected 2,137,903 bibliographic records
- performed 207,742 merges resulting in the deletion of 317,022 bibliographic records
- added 165 new authority records
- updated 395,817 authority records
- referred 765 authority records (name and subject) to the Library of Congress for resolution
Many of the changes resulted from the processing of 182,348 change requests, an increase of 21% over the same period of fiscal year 2007.
During the same time period, OCLC CIP Upgrade Unit staff have:
- upgraded 20,501 CIP records
- added 1,006 new authority records
- updated 857 authority records
These figures for authority records include 450 new series authority records and updates to 247 existing series authority records.  This activity with series authorities continues to grow as a result of OCLC’s adapting to the Library of Congress’s stopping series authority work.

I’m interested in the increase in change requests and wondering if it has to do with the increased use of Worldcat as a front end.  I know that as my institution taught more students to access our holdings through Worldcat for various reasons, I sent in more change requests for corrections I couldn’t make myself, particularly if they affected access.

LCSH 2008, weeks 26-28

Posted July 22, 2008 by Lois Reibach
Categories: LCSH

On lower east side librarian Jenna has pulled information from Library of Congress Subject Headings Weekly Lists and added her own commentary.  For example

150 Balloon sculpture [May Subd Geog] [sp 85011322]
* 450 UF Balloon modelling
* 450 UF Balloon twisting
* 450 UF Sculpture, Balloon
* 550 BT Sculpture CANCEL
If I were a balloon artist, I might be up in arms about my art no longer being considered a form of sculpture.

Makes for some useful and entertaining reading.

AALL: The Future of Subject Access in the 21st Century

Posted July 21, 2008 by Lois Reibach
Categories: Barbara Tillett, LCSH, Lois Mai Chan

Beverly Burmeister reports on a meeting that took place at AALL.  The speakers were Barbara Tillett and Lois Mai Chan.  From the middle of the report:

For the future, Dr. Tillett predicted that the global use of LCSH will expand and that the process will be improved to reduce costs and at the same time maintain the system.

In her presentation, Dr. Chan offered her suggestions for changes to LCSH which would be theoretically sound but at the same time be more useful to OPAC and web users. 

Interesting predictions on how LCSH can be maintained yet brought in line with current trends.

 

Literature on authority control

Posted July 21, 2008 by Lois Reibach
Categories: Robert E. Wolverton, authority control

The issue of Library Resources & Technical Services (vol. 52, no. 3, July 2008) that I recently received contains a review of the literature of cataloging and classification from 2005-06.  While I haven’t worked my way through everything listed, I wanted to point out a piece that is especially relevant to readers of this blog.  Robert E. Wolverton, “Becoming an Authority on Authority Control: An Annotated Bibliography of Resources,” Library Resources & Technical Services 50, no. 1 (2006):31-41.