Current Articles

DataONE to Deal with Data Deluge

Friday, November 20th, 2009 | Category: Digital Preservation

By Patricia Cruse, Director, University of California Curation Center

Researchers at the University of California have partnered with dozens of other universities and agencies to create DataONE (http://dataone.org), a global data access and preservation network for earth and environmental scientists that will support breakthroughs in environmental research. DataONE (Data Observation Network for Earth) is one of two $20 million awards made this year as part of the National Science Foundation’s (NSF) DataNet program. The collaboration of universities and government agencies coalesced to address the mounting need for organizing and serving up vast amounts of highly diverse and inter-related but often incompatible scientific data. Resulting studies will range from research that illuminates fundamental environmental processes to identifying environmental problems and potential solutions.

The National Center for Ecological Analysis and Synthesis (NCEAS) at UC Santa Barbara, the Department of Computer Science and Genome Center at UC Davis, and the California Digital Library at the UC Office of the President are integrally involved in the NSF DataONE initiative. Across these UC partners, the several million dollar award will drive advanced research and data acquisition, storage, mining, integration, and visualization for DataONE. The resulting computing and processing “cyberinfrastructure” will be made permanently available for use by the broader UC community and international science communities. DataONE is led by the University of New Mexico, and includes additional partner organizations across the United States as well as from Europe, Africa, South America, Asia, and Australia.

Read more at the UC Newsroom, which sent out a press release on 11-18-2009.

The press release can also be found http://www.cdlib.org/ and http://www.cdlib.org/news/index.html.

HathiTrust Large Scale Search

Friday, November 20th, 2009 | Category: Collection Development

By Heather Christenson, CDL Mass Digitization Project Manager

Effective November 18th, the HathiTrust Digital Library is now providing full-text searching capabilities across the entire library of 4.6 million volumes (1.6 billion pages) in the collection. Researchers can now search public domain and in-copyright works by keyword or phrase.

Based on open source Solr/Lucene technology, the service expands on an experimental search of public domain volumes introduced in November 2008. The CDL Discovery & Delivery team participated in testing the full-text search ahead of this release.

Full-text search will continue to be supported across the repository as it grows at a rate of hundreds of thousands of volumes every month. The UC Libraries currently have over 750,000 digital volumes in the HathiTrust, and the number continues to grow.

UC is a founding member of the HathiTrust, a collaborative enterprise of 25 leading research libraries. UC participation is coordinated by the California Digital Library (CDL), which brings its extensive experience in digital curation and shared online services to the HathiTrust.

The HathiTrust large scale search is available at: http://catalog.hathitrust.org.

For more information, please see the official press release: http://www.hathitrust.org/press.

Follow John Muir on Twitter and Facebook

Thursday, November 19th, 2009 | Category: Digital Special Collections

By Sherri Berger, Digital Special Collections Program Coordinator

This December, hear renowned California writer and naturalist John Muir (1838-1914) in his own words as he travels to California, encounters Yosemite for the first time, and works to preserve the open land he calls home.

To raise awareness of Muir’s newly digitized letters, Digital Special Collections will be quoting portions of them on Calisphere’s Twitter and Facebook pages.  Each installment or “tweet” will contain a segment of Muir’s stirring prose and a link to the original document and transcript.  The story will unfold over one week, starting December 1.

To hear Muir’s story, become a fan on Facebook (www.facebook.com/calisphere) or follow us on Twitter (www.twitter.com/calisphere).  Not a member of either network?  No problem—both accounts are open for viewing by all.

After the event, check back in on Calisphere’s social networking pages to stay up-to-date on new content and developments, as well as learn about related news, tools, and resources scouted on the Web.  We also welcome your questions and comments in these new forums.

This online event aims to engage students, educators, and the general public with the recent online publication of more than 6,500 of Muir’s letters—a collaborative achievement of CDL, The Bancroft Library at the University of California Berkeley, and the University of the Pacific Library (Learn more).

Meet Stephen Abrams

Thursday, November 19th, 2009 | Category: Staff News

By Ellen Meltzer, Information Services Manager; Photo by Craig Thompson, Web Producer

Stephen Abrams

How extraordinary to have an undergraduate senior thesis portend the themes throughout one’s career! That’s the case for Stephen Abrams, CDL’s Senior Manager for Digital Preservation Technology who arrived at CDL in February of 2008.  (Members of the University of California Curation Center, UC3 (previously known as the Digital Preservation Program),of which Stephen is a member, also include Patricia Cruse, Scott Fisher, Erik Hetzner, John Kunze, Margaret Low, David Loy, Mark Reyes, Tracy Seneca, Marisa Strong and Perry Willet.)

Stephen provides leadership in guiding the UC3 primarily in 3 areas:

First, the Digital Preservation Repository (DPR).  The DPR is the primary technical infrastructure that manages long term retention of digital objects.  The DPR is moving to a new generation of software; the earlier software was originally designed nearly 6 years ago.  In the intervening years, Stephen points out that we’ve have learned a great deal about the best way to provide preservation services and are at the beginning of a major project to re-conceive and re-implement the repository.  One of main goals we’re trying to accomplish is to ensure the new repository will be more responsive to needs of customers, especially as our customers are becoming more varied, both in the types of units that contribute to the repository and the types of contents we’re preserving.  Traditionally we have worked closely with campus libraries to preserve cultural heritage texts and images.  More recently, we’ve expanded our scope to include new campus constituencies interested in data sets in the social and experimental sciences.

Stephen states that we need to expand our capacity to deal with new content types and an increasingly diverse set of users while still continuing to support our traditional users.  One way to do this is by a new conceptualization of the repository.  Previously, we thought of the repository as a large monolithic system or place, managed centrally.  That concept breaks down when dealing with diverse sets of content with diverse sets of requirements.  CDL is now working on devolving our preservation functions into a set of independent, but interoperable micro-services.  Since each is small and self-contained, they are collectively easier to develop, maintain, and enhance.  Although each is narrow-scoped in function, complex behavior can nevertheless emerge through the strategic combination of the services.

Second, Stephen oversees the Web Archiving Service (WAS), keeping an eye on it to ensure that it remains consistent with our other initiatives. The Web Archiving Service, ably run by Web Archiving Coordinator Tracy Seneca, has been in operation for about a year; recently, we began providing public access to web resources (see http://cdlinfo.cdlib.org/blog/2009/07/08/public-access-to-web-archiving-service-goes-live/).

Third, Stephen serves as lead on the multi-year, multi-institutional, NDIIPP-funded JHOVE2 initiative.  In this project, the CDL is collaborating with Stanford and Portico to develop a next- generation open source format-aware characterization system.  (At this point, I needed to ask what that was.) 

Stephen explained that characterization is an automated process of determining the significant properties of digital objects.  Any digital object is a representation governed by rules of format that specify syntactic and semantic requirements.  During characterization we can examine an object and, by being cognizant of the underlying format rules, we can extract the significant properties.  In a digital document, for example, we want to know the fonts used to be able to ensure that we can properly continue to display the text in the future. For digital images, we need to understand the way in which color is represented to ensure accurate reproduction.

JHOVE1, which Stephen helped create, was widely used in the preservation community; now it’s 5-6 years old and has some inadequacies.  One of the goals of JHOVE2 is to remedy that, and to provide new features.

Characterization becomes important when operating a Preservation Repository.  Sometimes it’s clear what format you’re expecting to receive—depositors can tell you in great detail; other times you don’t know what you have until it arrives.  It’s useful, still, to verify what you did actually receive; people and systems make mistakes. Sometimes you get things you don’t expect.  Characterization also helps to categorize items in order to take advantage of efficiencies by automating processes.  This can only be done effectively if parallel workflows are properly classified.  Characterization is a way to decide which workflow something goes into.  Audio files are different from documents; color images are different from bi-tonal ones.  This is far more than you may want to know on these subjects, but Stephen is someone who is passionate about what he does and I felt he could have continued to speak rapturously about these subjects.

Immediately before arriving at CDL, Stephen served as Digital Library Program Manager at Harvard University Library. And prior to his work at Harvard, he spent 9 years at MIT working as a research engineer in the Department of Ocean Engineering where he worked on grant-funded software for the design and manufacture for naval vessels.  His expertise was on scientific and engineering visualization, where he turned numbers into pictures.  As the Cold War wound down in the late eighties, there were fewer funding sources for these projects.  He began working on information retrieval problems for the Department of Commerce and Interior.  The information retrieval problems lead Stephen to the world of digital libraries.

It was hard for me to imagine that even before this, Stephen spent 9 years at a small company in Pennsylvania: Swanson Analysis Systems—leading developers of finite element analysis used in structural analysis.  There he also worked on the development of engineering visualization solutions.

Now, back to where we began.  Stephen’s undergraduate thesis was on a problem in celestial mechanics — the Three-body problem (I encourage you to look this up in Wikipedia, or elsewhere). One aspect of his research was to develop a graphics display system, in which he had to program the math involved and program for visualization.  With an undergraduate degree in mathematics from Boston University and a Master’s Degree in art and architecture from Harvard, Stephen went looking for work on the scientific side of the two choices “It pays better”, he quipped.  The themes that interested him in his undergraduate thesis have followed him throughout his career.

Stephen was aware for some time of the interesting and innovative work going on at the CDL, the University of California, and partner institutions. Coming here provided Stephen with the opportunity to apply himself more deeply to the “incredibly important” problems in digital preservation.  Of course, transplanted easterners always are drawn by the weather, but there were many things professionally and personally that drew him here.

The challenges are real: There is more useful work that could be done than time to do it.  The main thing is trying to prioritize appropriately—you put together a multi-year road map so that we can be where we need to be at the end of the day; approaching larger problems through small incremental steps.   In addition, he finds there’s such a broad constituency at UC with people working on amazingly innovative things.  Attempting to come up with comprehensive and effective solutions for any one thing can be a great challenge–just trying to ensure our services remain responsive to users as their needs are known now and as they change is daunting.  We’re so glad Stephen is on board to help tackle these demanding issues.

Next Generation Melvyl Pilot Enhancements - November 8, 2009

Thursday, November 12th, 2009 | Category: Bibliographic Services

By Ellen Meltzer, CDL Information Services Manager

Several enhancements were made to Next Generation Melvyl with OCLC’s Sunday, November 8, install
The changes include:

  • Parenthetical (Boolean) support in search queries.  You can now use parentheses to create more precise searches.  A search on dog (walking OR feeding OR grooming) will return results for dog walking OR dog feeding OR dog grooming.
  • Additional custom web links.  With the approval of UC Heads of Public Services (HOPS), two additional links soon will be added for each campus in the dropdown menu under the library name (e.g., UCR Libraries).  These will link to the campus Article Database and E-journal links, among the most heavily used links in current Melvyl.
  • Improvements in treating some item types as a different item type by configuring certain tags, subfields and/or values contained in the data.
  • Improvements to “Browse similar items” in the “carousel” on the “Similar items” section of the detailed record.
  • Changes to Details section for remote database records.
  • Upon saving a search, users will now receive confirmation that their search has been saved and will see a link to their profile page to view their saved searches.

Please see the PDF for more details.

Mellon Planning Grant Awarded to UC Libraries for a Western Regional Storage Trust

Tuesday, November 3rd, 2009 | Category: Collection Development

Emily Stambaugh, CDL Manager of Shared Print

The Andrew W. Mellon Foundation has awarded the University of California Libraries a nine month planning grant to organize the “Western Regional Storage Trust (WEST)” — a shared print repository service, focused initially on retrospective journal archives. UC Libraries in collaboration with regional library partners will band together to prepare service models for consolidating print journal holdings in responsible ways to provide efficiencies throughout the libraries. WEST is envisioned as a robust partnership that will allow libraries to build and manage a cooperative regional archive at the network level.  The proposal calls for library leaders in the Western Region of the United States to convene in Oakland, CA to (1) design operating, governance, and business models to support cooperative print archives among diverse partners; (2) establish standards for low-level validation and disclosure; and (3) develop selection criteria incorporating risk-management principles to ensure persistence within the broader context of similarly intentioned national and international efforts.

Library leaders from UC, Orbis-Cascade, GWLA, SCELC, Stanford, CalTech, Occidental College and more will band together to design the Western Regional Storage Trust incorporating sustainable models for participation amongst a wide variety of partners.  We are pleased to announce that Lizanne Payne, Director of the Washington Research Library Consortium, will serve as the consultant for this process.

For more information, please contact Emily Stambaugh, CDL Shared Print Manager.

The Bibliographic Services Team has a new name

Tuesday, November 3rd, 2009 | Category: Bibliographic Services

By Patricia Martin, Director, Discovery & Delivery

Do you know what the name Bibliographic Services means? We couldn’t agree on it either, so we decided to change the name of our team from Bibliographic Services to a name that better describes the services we provide as a CDL team.  The team that brings you Melvyl®, Next Generation Melvyl®, UC-eLinks, and Request is now the Discovery & Delivery team.

How is this name more relevant to what we’re doing? We’ve seen a shift in the way scholars do research. Discovery and delivery are tightly aligned services — researchers expect access to publications at the same time as they find them. The core library services we provide extend beyond managing bibliographic data — we’re connecting people to what they want. UC-eLinks, for example, is a popular web application that provides UC faculty and students with a quick and reliable way to link directly to articles from the library catalog or other sites like PubMed or Google Scholar.

What can you expect from the Discovery & Delivery team looking forward? We’re building on our years of metadata expertise and expanding further into the delivery realm. We’re exploring new territory in collaborative initiatives like Hathi Trust, where members of our team recently implemented an open-source page turner for the mass digitized books on the Hathi Trust website.

Where will you see our team’s new name? You will see Discovery & Delivery on the CDL website early next year. Our name is 100% acronym free, but you can call us the D&D team for short. We have already garnered several nicknames, including “disco-tech” for our technical team.

John Muir Correspondence: On Calisphere, OAC and Web 2.0

Thursday, October 29th, 2009 | Category: General, Digital Special Collections

By Mary Elings, Archivist for Digital Collections at The Bancroft Library, UC Berkeley and Sherri Berger, Digital Special Collections Program Coordinator, CDL

Muir Letters Online

CDL’s Digital Special Collections, The Bancroft Library, and The University of the Pacific Library are pleased to announce the availability on the OAC and Calisphere of over 6,500 letters from the correspondence of John Muir, 1838-1914.

One of the most important historical figures in California history, Muir was a renowned California naturalist, explorer, writer, and conservationist.  Online access to his correspondence will provide users with new insight into Muir’s life, as well as topics such as California history, Yosemite National Park, the Sierra Club, and the American environmental conservation movement.

Previously, access to the thousands of letters written and received by Muir was limited to original copies scattered across the United States and a few microfilm versions in California.  Now the digital collection is available to everyone online.

The Bancroft Library partnered with The University of the Pacific Library to digitize and publish these important historical documents, with technical support from CDL.  The project was supported by the U.S. Institute of Museum and Library Services under the provisions of the Library Services and Technology Act, administered in California by the State Librarian.

Coming Soon: Follow Muir on Facebook and Twitter!

To celebrate the publication of the Muir letters and engage a broad audience with them, Digital Special Collections will be hosting a Web 2.0 “event” in early December, details forthcoming.  For a week, Muir will “speak” to the public, quoting portions of his correspondence through a series of chronological installments on Calisphere’s Facebook and Twitter accounts.  Hear Muir in his own words as he explores Yosemite and works to protect the vast American West.

To participate in the event and stay updated on Calisphere news and developments, become a fan on Facebook (www.facebook.com/calisphere) or follow us on Twitter (www.twitter.com/calisphere).

Emily Stambaugh in Print

Wednesday, October 28th, 2009 | Category: Staff News

By Jayne Dickson, CDLINFO Editor

The Association of Research Libraries (ARL) is initiating a new series of invited reports addressing emerging roles for research libraries.  The New Roles for New Times series will begin publication with five reports in 2010.  The reports will identify and delineate emerging roles for research library staff and present research on early experiences among ARL member libraries in developing the roles and delivering services.

Emily Stambaugh, CDL’s Manager of Shared Print, is writing the report on New roles in providing print collections: remote storage and collection consolidation.  Other reports being developed are:

  • Transforming liaison librarian work
    Karen Williams, University of Minnesota
  • Repository services
     Sarah Shreeves, University of Illinois
  • Digital curation and preservation
     Tyler Walters, Georgia Tech
  • Library roles in promoting graduate students’ development of research skills and understanding of scholarly communication
    Lucinda Covert-Vail and Scott Collard, NYU

Each report will describe the emerging role, articulating the audience affected by the new role and the benefits various constituencies experience as a result of the new role.  Reports will highlight existing work, report authors’ findings, and offer analysis of trends, best practices, and key issues. Reports will be freely available as PDF files on ARL’s New Roles for New Times Web site http://www.arl.org/rtl/nrnt/.

Complementing the report set, ARL will work with the New Roles authors to organize corresponding webcasts on each topic.  Webcasts will be scheduled to follow a report’s release.

CDL and CDLINFO are now on Twitter!

Tuesday, October 20th, 2009 | Category: General

By Joan Starr, Manager, CDL Strategic and Project Planning

CDL has joined many of our colleague and partner institutions, like UC Riverside Libraries, UCSF Libraries, UC Press, Internet Archive, Hathi Trust, OCLC, and many others in creating a Twitter account.  We have done this for several reasons, including:

  • as an additional way to get CDLINFO out into the world;
  • as a way to amplify the voice and message of the accounts the main CDL account will follow. “Following” simply means to receive the other Twitter account’s updates, or “tweets;”
  • and as a way to promote our visibility, in keeping with our values of openness and sharing.

The CDL account is called CalDigLib (http://www.twitter.com/caldiglib) and we encourage you to follow it if you are a Twitter user.  Even if you are not a Twitter user, you can view it by simply going to the URL. You will find this content:

  • CDLINFO articles: the headline  with a link to the full article
  • Tweets from accounts followed by CalDigLib — accounts that feature CDL Program and Service news, announcements, resources, etc.

If you currently receive CDLINFO via RSS and you are a Twitter user, you may wish to consider following the new CDL Twitter account and receiving your CDL news in this manner.  If you currently receive CDLINFO via email, and have been looking for a reason to try Twitter, this might be a good time to take the leap!

The first CDL Program we will be following is the brand-new eScholarship account.  As time goes by, more Programs and Services will build Twitter into their marketing and communication plans.  Why? Because, as our friend Roy Tennant recently blogged, "Twitter is the new RSS" (http://www.libraryjournal.com/blog/1090000309/post/290048229.html).  For some of our audiences, at least, this is increasingly the best way to connect.

We hope some of you will join us by following this new account.  Of course, we know some of you are already there! We look forward to a lively exchange as we all get to know this new channel for communication.

For any questions and more information, please contact Joan Starr (joan.starr@ucop.edu) or (@joan_starr on Twitter).

Next Page »

Powered by WordPress and CDL Web Production