<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/wordpress-mu-1.0" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>CDLINFO</title>
	<link>http://cdlinfo.cdlib.org</link>
	<description>California Digital Library News</description>
	<pubDate>Wed, 04 Nov 2009 21:42:37 +0000</pubDate>
	<generator>http://wordpress.org/?v=wordpress-mu-1.0</generator>
	<language>en</language>
			<item>
		<title>iPRES 2009 hosted by CDL</title>
		<link>http://cdlinfo.cdlib.org/blog/2009/10/15/ipres-2009-hosted-by-cdl/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2009/10/15/ipres-2009-hosted-by-cdl/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 22:06:42 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>General</category>

		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2009/10/15/ipres-2009-hosted-by-cdl/</guid>
		<description><![CDATA[<p>On October 5-6 2009, over 300 people from 22 countries attended iPRES 2009 at the Mission Bay Conference Center on the UCSF Mission Bay campus. iPRES 2009 was the sixth in an annual series of conferences devoted to digital preservation, and with the 300 attendees, the largest ever.</p>  ]]></description>
			<content:encoded><![CDATA[     <link rel="alternate" type="application/atom+xml" title="CDLINFO Category: General" href="http://cdlinfo.cdlib.org/blog/category/general/feed/" />
     <link rel="alternate" type="application/atom+xml" title="CDLINFO Category: Digital Preservation" href="http://cdlinfo.cdlib.org/blog/category/digital-preservation/feed/" />
<p>By Perry  Willett, CDL Digital Preservation Services Manager</p>
<p>On October 5-6 2009, over 300 people from 22 countries attended <a href="http://www.cdlib.org/iPres/">iPRES 2009</a> at the Mission Bay Conference Center on the UCSF Mission Bay campus. iPRES 2009 was the sixth in an annual series of conferences devoted to digital preservation, and with the 300 attendees, the largest ever. &nbsp;This year&#8217;s conference was coordinated by the UC Curation Center (the new name of CDL&#8217;s Digital Preservation Program),  with the program committee chaired by Trisha Cruse. &nbsp;Perry Willett was the project manager for the conference; Beaumont Yung and Rondy Epting-Day provided significant administrative support, as did Megan Amaral,  CDL&rsquo;s student intern from SJSU.</p>
<p>The theme of this year&#8217;s conference was &quot;Moving into the mainstream. &nbsp;Enabling  our digital future.&quot; &nbsp;The program was packed with thoughtful and thought-provoking presentations on all aspects  of digital preservation. Some notable presentations included  keynote addresses by David Kirsch (University of Maryland) on public interest in corporations&#8217; business archives;  Micah Altman (Harvard University)  on public archives  for scientific  data; and a panel discussion by  members of the Blue Ribbon Task Force  on Sustainable Digital Preservation and Access.</p>
<p>The conference  was extremely successful, and we&#8217;ve received  much &quot;iPraise.&quot; In  addition to hosting the conference, this was an important opportunity for CDL to showcase our recent work on curation  micro-services  and web archiving, and to speak with  current and potential partners  interested in working with us. &nbsp;The paper by Stephen Abrams, John Kunze and David Loy (delivered by Stephen) on curation  micro-services  was particularly well-received, with several highly positive Twitter tweets  and blogposts during and after the conference.</p>
<p>CDL staff  participated in many ways, with Stephen Abrams, Trisha Cruse, John Kunze, Tracy Seneca,  and Perry Willett serving on the program committee. &nbsp;Stephen and Tricia were also presenters at the conference, and Tracy  and Heather Christenson gave poster sessions. &nbsp;Many people served as reviewers  for the program, including (in  addition to the CDL staff members already mentioned) Scott Fisher, Martin Haye, Erik Hetzner, John Ober and Lisa Schiff. &nbsp;Many people from UC campuses also served as reviewers and helped with the local arrangements. &nbsp;Thanks to all of them for their contributions.</p>
<p>In addition  to the scholarly program, we had a  full slate of social events including the conference reception at the California Academy of Sciences on Monday evening. &nbsp;During the conference,  Rick Prelinger presented &quot;Lost  Landscapes of San Francisco,&quot; a film that includes rare footage of  the city from newsreels, industrial  documentaries and amateur films. &nbsp;A special &quot;apr&egrave;s iPRES&quot; event was held at the Hi Dive in San Francisco  on Tuesday evening after the conference. &nbsp;The conference was part of what was informally called &quot;digital preservation week in San Francisco,&quot;  with additional events and meetings later in the week sponsored by the  International Internet Preservation Consortium, the JHOVE2 project, and Sun Microsystems.</p>
<p>The conference  website at <a href="http://www.cdlib.org/iPres/">http://www.cdlib.org/iPres</a> contains the complete program, along with an archive of Twitter tweets by conference attendees, photographs from the conference and social events on Flickr, and (eventually) video, PowerPoint and full  papers from the presentations. &nbsp;See the <a href="http://www.cdlib.org/iPres/networking.html">Amplified Conference</a> page  for photos, blogposts and tweets.</p>
<p>Special thanks go to the vendors who supported the conference:  Sun Microsystems, Isilon Systems,  ExLibris, Institute of Museum and Library  Services (IMLS), Tessella, the Joint  Information Systems Committee (JISC), NetApp, FileTek, Library of Congress and  the National Digital Information Infrastructure  and Preservation Program (NDIIPP), and DuraSpace. &nbsp;Their support went a long way toward underwriting the costs  of the conference.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2009/10/15/ipres-2009-hosted-by-cdl/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Public Access to Web Archiving Service Goes Live</title>
		<link>http://cdlinfo.cdlib.org/blog/2009/07/08/public-access-to-web-archiving-service-goes-live/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2009/07/08/public-access-to-web-archiving-service-goes-live/#comments</comments>
		<pubDate>Wed, 08 Jul 2009 20:15:11 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>General</category>

		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2009/07/08/public-access-to-web-archiving-service-goes-live/</guid>
		<description><![CDATA[<p>The California Digital Library is pleased to announce public access to the archives built with the Web Archiving Service. The tools that CDL provides to capture, curate and preserve websites now also enable librarians to publish the archives they’ve created for the benefit of researchers and the general public.

</p>]]></description>
			<content:encoded><![CDATA[<p>By Tracy Seneca, Web Archiving Coordinator</p>
<p>The California  Digital Library is pleased to announce public access to its Web Archives.&nbsp; CDL&rsquo;s Web Archives are built and published  using its Web Archiving Service  (WAS), which enables librarians to capture,  curate, and preserve websites for the benefit of researchers and the  general public.&nbsp; New archives  are continually being built and  published, and will appear along with the current archives available at <a href="http://webarchives.cdlib.org/">http://webarchives.cdlib.org/</a>.</p>
<p>This first set of archives includes  materials from the California state government agencies, and local government agencies from Orange County,  San Diego, Los  Angeles and more.&nbsp; Also included  are archives  of Middle Eastern political organizations, American left-wing  organizations, and web content related to events such as the  2007 Southern California Wildfires and the  2003 California Recall Election.</p>
<p>As government agencies and public policy  organizations increasingly turn to the web as a primary means of publication, libraries are challenged to provide lasting access to the  budgets, studies and reports that they have long collected  for the benefit of the research community.&nbsp; The WAS service  also allows libraries to expand their collecting  scope to more ephemeral materials such as press releases, local commission meeting minutes, public forum sites, and blogs.&nbsp; All of these provide a glimpse of history  in the making for future researchers.&nbsp; The value of this  service is described in a recent Chronicle of Higher Education article, <a href="http://chronicle.com/free/v55/i41/41a00103.htm">Scholars Race to Preserve  Guant&aacute;namo Records</a>, which focuses on an archive currently  being built by New York   University using CDL&rsquo;s  Web Archiving  Service.</p>
<p>The <a href="http://was.cdlib.org/">Web Archiving Service</a>&nbsp;also enables the University of California (UC) libraries to work collaboratively on a monumental task: archiving the web sites of the  State of California. &nbsp; The State of California web domain (.ca.gov) represents the  third largest subdomain of the U.S.  government web presence.&nbsp; The UC campuses have worked individually to capture and archive local information, and collectively to archive state publications.&nbsp; Together, these  archives  represent a major achievement and a series of rich resources  for California  researchers.&nbsp; These archives can  also provide lasting access to the individual state publications that are catalogued and made available via UC&#8217;s <a href="http://melvyl.cdlib.org/">Melvyl catalog</a>.</p>
<p>The archives represent the  culmination of the Web-at-Risk grant, funded by Library of Congress&rsquo;  National Digital Information Infrastructure  Preservation Program, and led by the  California Digital Library.&nbsp; With our grant partners at the  University of North Texas and New   York University, and  our curatorial partners at the UC campuses,  NYU and Stanford University, we are embarking on new era  in collection  building for libraries.</p>
<p>If you have any questions about the  archives  or about the Web Archiving  Service, please contact <a href="mailto:washelp@ucop.edu" title="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2009/07/08/public-access-to-web-archiving-service-goes-live/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CDLers in Print</title>
		<link>http://cdlinfo.cdlib.org/blog/2009/05/27/cdlers-in-print/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2009/05/27/cdlers-in-print/#comments</comments>
		<pubDate>Wed, 27 May 2009 18:17:20 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2009/05/27/cdlers-in-print/</guid>
		<description><![CDATA[<p>Patricia Cruse, director of the CDL Digital Preservation Program, and Beth Sandore, have edited a special issue of Library Trends.  Also included is an article by CDL’s Web Archiving Service Manager, Tracy Seneca, entitled, The Web-at-Risk at Three: Overview of an NDIIPP Web Archiving Initiative. </p>]]></description>
			<content:encoded><![CDATA[<p>By Ellen Meltzer, Manager, Information Services</p>
<p>Patricia Cruse, director of the CDL Digital Preservation Program, and Beth  Sandore, associate university librarian for information technology planning and associate dean of libraries at the University of Illinois Library at Urbana-Champaign (who completed a rotation to the CDL in 2007) have edited a <a href="http://muse.jhu.edu/journals/library_trends/v057/57.3.cruse.html">special issue</a> of <em>Library Trends.&nbsp; </em>The issue is comprised of sixteen articles that tell fascinating stories about the ground-breaking efforts of numerous partners within the Library of Congress National Digital Information Infrastructure and Preservation Program (NDIIPP). &nbsp;Also included is an article by CDL&rsquo;s Web Archiving Service Manager, Tracy Seneca, entitled,<em> The  Web-at-Risk at Three: Overview of an  NDIIPP Web Archiving  Initiative. </em></p>
<p>Since its inception in 2004, NDIIPP has grown from an experimental program into a true partnership of concerned  organizations working together to sustain access to digital information critical to scholarship and cultural heritage  nationwide. </p>
<p>Congratulations,  Trisha and Tracy! </p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2009/05/27/cdlers-in-print/feed/</wfw:commentRss>
		</item>
		<item>
		<title>End of Bush&#8217;s Term: Will It Disappear from the Web?</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/09/05/end-of-bushs-term-will-it-disappear-from-the-web/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/09/05/end-of-bushs-term-will-it-disappear-from-the-web/#comments</comments>
		<pubDate>Fri, 05 Sep 2008 18:16:06 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>General</category>

		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/09/05/end-of-bushs-term-will-it-disappear-from-the-web/</guid>
		<description><![CDATA[<p>Will the Homeland Security and No Child Left Behind websites disappear on January 20th 2009?  To ensure that the historical record of the current administration is not lost, a partnership of government and nonprofit agencies has taken responsibility for its preservation. </p>]]></description>
			<content:encoded><![CDATA[<p>By Hunter Stern, CDL Technical Writer</p>
<p>Will the Homeland Security and No Child Left Behind websites disappear on January 20th  2009?&nbsp; The answer might surprise you.&nbsp; January 20th 2009 will mark the beginning of a new presidential administration and the coincident end of the current administration, putting much of the online material related to its  policies and initiatives at risk.&nbsp; According to the Washington Post, &ldquo;Many federal agency records exist only in digital form and are in danger of disappearing when the administration changes&rdquo;  (August 20, 2008).</p>
<p>The University of California community, not to mention  scholars the world over, require perpetual access to these online materials in  the normal conduct of research, teaching, and learning. Even without a change  in administration, government records stored in digital form are notoriously  volatile.&nbsp; Web pages on government sites have an average life span of only  44 days.</p>
<p>To ensure  that the historical record of the current administration is not lost, a  partnership of government and nonprofit agencies has taken responsibility for  its preservation.&nbsp; The University  of California &ndash; California Digital  Library (CDL), in partnership with the Library of Congress, the Government  Printing Office (GPO), the Internet Archive (IA), and the University of North  Texas Libraries (UNTL) are planning the harvest and archival storage of more  than 100 million US  government web pages from the second George W. Bush administration.&nbsp; This  effort will involve the comprehensive harvest of the .gov domain as well as  focused Web harvests of specific government agencies.&nbsp; The goal is to  conduct a broad capture of all Federal government Web sites, and a deep capture  of specific high-priority sites that have been chosen by the project&rsquo;s  curators.&nbsp; Each partner plays a critical role in the project.</p>
<p>The  California Digital Library, a recipient of a Library of Congress National  Digital Information Infrastructure and Preservation Program (NDIIPP) grant, leads the Web-at-Risk project, a goal of which is &ldquo;to develop tools that enable librarians and archivists to capture, curate, preserve, and provide access to  web-based government and political information.&rdquo;&nbsp; These tools will be put  to use doing deep crawls of specific government agencies ranked as priority  sites by the project&rsquo;s curators.&nbsp; In addition to CDL, UNTL will be  responsible for conducting deep crawls.</p>
<p>The broad crawl will be the responsibility of the Internet Archive, a non-profit group  providing universal and permanent access to digital information for educators,  researchers, and the general public.&nbsp; IA will use its advanced  Web-crawling software, called Heritrix, to capture the intended sites.</p>
<p>In order to  prioritize the vast list of URLs included in the scope of the crawl, the University of North Texas has designed a software tool  that allows curators to nominate URLs for harvest and tag them with numeric rankings.</p>
<p>The Library of Congress, which has preserved congressional Web sites since December 2003, will focus on developing the overall harvesting plan.&nbsp; The GPO and the  libraries in its Federal Depository Library Program will assist in the curation  process.</p>
<p>For more  information on the End of Term project contact Patricia Cruse (patricia.cruse@ucop.edu), Director, Digital Preservation Program.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/09/05/end-of-bushs-term-will-it-disappear-from-the-web/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CDL Staff at IFLA Conference</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/09/04/cdl-staff-at-ifla-conference/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/09/04/cdl-staff-at-ifla-conference/#comments</comments>
		<pubDate>Thu, 04 Sep 2008 18:02:19 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/09/04/cdl-staff-at-ifla-conference/</guid>
		<description><![CDATA[<p>Margaret Low, software engineer in Digital Preservation, recently returned from the IFLA Annual Conference in Quebec where she presented a very well-received paper on the CDL preservation infrastructure.  </p>

]]></description>
			<content:encoded><![CDATA[<p>By Stephen Abrams, CDL Manager for Digital Preservation Technology</p>
<p>Margaret Low, software engineer in Digital Preservation, recently  returned from the IFLA (International Federation of Library Associations and  Institutions) Annual Conference in Quebec,  where she presented a very well-received paper on the CDL preservation  infrastructure.&nbsp; The paper is available  for download at: <a href="http://www.ifla.org/IV/ifla74/papers/084-Low-en.pdf" title="http://www.ifla.org/IV/ifla74/papers/084-Low-en.pdf">http://www.ifla.org/IV/ifla74/papers/084-Low-en.pdf</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/09/04/cdl-staff-at-ifla-conference/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CDL Staff in Print</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/07/15/cdl-staff-in-print/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/07/15/cdl-staff-in-print/#comments</comments>
		<pubDate>Tue, 15 Jul 2008 18:27:38 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/07/15/cdl-staff-in-print/</guid>
		<description><![CDATA[<p>CDL's Preservation Program staff member Erik  Hetzner presented a paper at JCDL (Joint Conference on Digital Libraries) in Pittsburgh, Pennsylvania  last month.&#160; His paper is entitled,  &#34;A simple method for citation metadata extraction using hidden Markov  models&#34; and is now available online.</p>]]></description>
			<content:encoded><![CDATA[<p>By John Kunze, CDL Preservation  Technologies Architect</p>
<p>CDL&#8217;s Preservation Program staff  member Erik Hetzner presented a paper at JCDL (Joint Conference on Digital  Libraries) in Pittsburgh, Pennsylvania last month.&nbsp; His paper is entitled, &quot;A simple method  for citation metadata extraction using hidden Markov models&quot; and is  available at <a href="http://gales.cdlib.org/~egh/hmm-citation-extractor/sp181-hetzner.pdf">http://gales.cdlib.org/~egh/hmm-citation-extractor/sp181-hetzner.pdf</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/07/15/cdl-staff-in-print/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CDL recruiting for Digital Preservation Services Manager</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/07/11/cdl-recruiting-for-digital-preservation-services-manager/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/07/11/cdl-recruiting-for-digital-preservation-services-manager/#comments</comments>
		<pubDate>Fri, 11 Jul 2008 20:19:15 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>General</category>

		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/07/11/cdl-recruiting-for-digital-preservation-services-manager/</guid>
		<description><![CDATA[<p>The California Digital Library (CDL) is recruiting for a Digital Preservation Services Manager.</p>]]></description>
			<content:encoded><![CDATA[<p>By Patricia Cruse, CDL  Director of the Digital Preservation Program</p>
<p>UNIVERSITY OF CALIFORNIA, CALIFORNIA DIGITAL LIBRARY</p>
<p>TITLE: Digital  Preservation Services Manager</p>
<p>CATEGORY: Full-Time</p>
<p>SALARY: Salary  commensurate with qualifications and experience. <br />
  Excellent benefits.</p>
<p>TO APPLY:&nbsp;  http://jobs.ucop.edu/applicants/Central?quickFind=52447</p>
<p>POSITION DESCRIPTION: </p>
<p>Want to be part of a  dynamic team that is working to preserve digital information for future  generations?&nbsp; At the California Digital Library (CDL), we&#8217;ve developed a  world-class program to preserve digital material that supports the University of California&#8217;s research, teaching, and  learning mission and you can be a part of it.&nbsp; A key member of the team is the  Digital Preservation Services Manager &#8212; reporting to the Director of the Digital Preservation Program the Manager is responsible for the day-to-day  management of digital preservation services (production and development)  through project management, the provision of support services (whether offered  in person or online), and liaison with digital preservation service providers  and support staff.&nbsp; In addition, the Services Manager will be responsible for translating experience of users&#8217; needs and perceptions of system capabilities  in a manner that informs further refinement and extension of the digital preservation technology and service infrastructure.</p>
<p>This is an ideal  opportunity for someone with solid people skills and a passion for working in a  collaborative and dynamic environment.</p>
<p>The California Digital  Library (CDL) supports the assembly and creative use of the world&#8217;s scholarship  and knowledge for the UC libraries and the communities they serve.&nbsp; In  partnership with the UC libraries, the California Digital Library established  the digital preservation program to ensure long-term access to the digital information that supports and results from research, teaching and learning at  UC.</p>
<p>JOB REQUIREMENTS: </p>
<p>Bachelor&#8217;s degree in the  social sciences, public administration, library and information science or a  related field and at least three years&#8217; relevant experience with development or  delivery of online information services in educational, digital preservation,  library, research, and/or cultural heritage settings or an equivalent  combination of education and experience.</p>
<p>Demonstrated experience  to plan, evaluate, budget for and manage complex projects from their inception  through to their final delivery.</p>
<p>Plans projects and  assignments and monitors performance according to priorities as demonstrated by  regularly meeting established deadlines in an environment of multiple projects  and changing priorities.</p>
<p>Strong logic and  quantitative reasoning skills as demonstrated by ability to review and assess a  range of variables to define key issues, evaluate reasonable alternatives and  translate findings into recommended changes, actions or strategies.</p>
<p>Proven experience with  and general understanding of the academic user community and the digital  library/scholarly information services domain.</p>
<p>Demonstrated experience  working with user community and technology/programming staff to build use  cases, functional requirements and user interface design.</p>
<p>Excellent written and  verbal communication skills as demonstrated by the ability to understand and  articulate technical ideas and issues at a conceptual level and explain them  clearly and concisely to non-technical staff.</p>
<p>Demonstrated ability to  operate under general direction, able to develop creative solutions to  problems, and tackle issues in a self-motivated manner in a service-oriented  geographically distributed team environment.</p>
<p>Demonstrated ability to  plan, evaluate, budget for and manage complex projects from their inception  through to their final delivery.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/07/11/cdl-recruiting-for-digital-preservation-services-manager/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Bagit: Transferring Digital Content</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/07/02/bagit-transferring-digital-content/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/07/02/bagit-transferring-digital-content/#comments</comments>
		<pubDate>Wed, 02 Jul 2008 20:39:49 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/07/02/bagit-transferring-digital-content/</guid>
		<description><![CDATA[<p>The CDL Digital Preservation Group, under the leadership of John Kunze, has co-developed with the Library of Congress a format for transferring digital content.</p>]]></description>
			<content:encoded><![CDATA[<p>By Trisha  Cruse, Director of Digital Preservation</p>
<p>The CDL  Digital Preservation Group, under the leadership of John Kunze, has co-developed with the Library of Congress a format  for transferring digital content.&nbsp; &ldquo;The BagIt  format specification is based on the concept of &lsquo;bag it and tag it,&rsquo; where digital content is packaged (the bag) along with a small amount of  machine-readable text (the tag) to help automate the content&rsquo;s receipt, storage and retrieval.&nbsp; There is no software to install.&nbsp; BagIt is an attempt to simplify large-scale data transfers between cultural institutions.&rdquo;</p>
<p>Find at more from the Library of  Congress press release: <br />
<a href="http://www.digitalpreservation.gov/news/2008/20080602news_article_bagit.html"> http://www.digitalpreservation.gov/news/2008/20080602news_article_bagit.html</a>
   </p>
<p>The full BagIt specification is available at <a href="http://www.cdlib.org/inside/diglib/bagit/bagitspec.html">http://www.cdlib.org/inside/diglib/bagit/bagitspec.html</a></p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/07/02/bagit-transferring-digital-content/feed/</wfw:commentRss>
		</item>
		<item>
		<title>CDL Guidelines for Digital Objects, Version 2.0: Updated for METS File element</title>
		<link>http://cdlinfo.cdlib.org/blog/2007/11/15/cdl-guidelines-for-digital-objects-version-20-updated-for-mets-file-element/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2007/11/15/cdl-guidelines-for-digital-objects-version-20-updated-for-mets-file-element/#comments</comments>
		<pubDate>Thu, 15 Nov 2007 19:55:55 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<category>Technology</category>

		<category>Digital Special Collections</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2007/11/15/cdl-guidelines-for-digital-objects-version-20-updated-for-mets-file-element/</guid>
		<description><![CDATA[<p>The &#34;CDL Guidelines for Digital Objects,  Version 2.0&#34; (CDL GDO) has been updated to include specifications for use  of the METS File &#60;file&#62; element.&#160; </p>]]></description>
			<content:encoded><![CDATA[<p>By Adrian  Turner, CDL Data Acquisitions<strong></strong></p>
<p>The  &quot;CDL Guidelines for Digital Objects, Version 2.0&quot; (CDL GDO) has been  updated to include specifications for use of the METS File &lt;file&gt;  element.&nbsp; You can find the updated  version at <a href="http://www.cdlib.org/inside/diglib/guidelines/">http://www.cdlib.org/inside/diglib/guidelines/</a> .</p>
<p>The  revision applies to Sections 2.1, 2.2.2, 3.1, and 3.2.4 only:</p>
<ul>
<li>To  support the orderly transmission and ingest of digital objects, the CDL  recommends the inclusion of checksum (MD5, SHA-1, or CRC32) and byte size  values in the METS File &lt;file&gt; element.&nbsp;  Note that this information is preferred, but not required.</li>
<li>The  subheadings within Sections 2.1 and 3.1 have been relabeled, and are now  consistently based on METS element names.
  </ul>
<p>Please  contact the CDL at oacops@ucop.edu if you have any questions.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2007/11/15/cdl-guidelines-for-digital-objects-version-20-updated-for-mets-file-element/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Digital Preservation News</title>
		<link>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 21:33:56 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<category>Digital Publishing</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/</guid>
		<description><![CDATA[<p>The CDL Digital Preservation Group has been busy with a variety of exciting activities.</p>]]></description>
			<content:encoded><![CDATA[<p>By Trisha  Cruse, CDL Director of Digital Preservation</p>
<p>The CDL  Digital Preservation Group has been busy with a variety of exciting activities,  reported below. </p>
<p><strong>Release 4 of the Web Archiving Service</strong><br />
  On September 18th the Web Archiving Group released a new version of the  Web Archiving Service &ndash; special thanks to Tracy   Seneca, Scott Fisher,  Margaret Low, Erik Hetzner, Mark Reyes, and Mike Wooldridge for getting this  release out the door.&nbsp; So far the group has received very positive feedback  from users on the service&rsquo;s functionality and the user interface.&nbsp; We are  also extremely pleased with the performance; we are up to 500 captures with  relatively few hiccups.</p>
<p>We have also put together an overview of the service that is available on YouTube &lt;http://tinyurl.com/2tdrwq<strong>&gt;</strong>.&nbsp; This brief overview explains why the content targeted for this project is at  risk, how we plan to address this in the Web Archiving Service, and provides an explanation of the collections our curators are working on. Warning: the  YouTube video quality is a bit sketchy so we have also made this presentation  available in a high-quality video format; contact tracy.seneca at ucop dot edu  for further information. </p>
<p><strong>A kinder  and gentler ARK  page</strong><br />
 Thanks to  Kirsten Neilsen and John Kunze there is now a kinder, gentler introduction to ARK identifiers on Inside CDL &lt;<a href="http://www.cdlib.org/inside/diglib/ark/" title="http://www.cdlib.org/inside/diglib/ark/">http://www.cdlib.org/inside/diglib/ark/</a>&gt;.&nbsp; Don&rsquo;t know what that is?&nbsp; Then definitely take a look.&nbsp; Our hope is that this will help others  recognize and appreciate the true beauty and splendor of ARKs. &nbsp;The new  page has already been re-purposed in a German &quot;technology watch&quot;  newsletter, &lt;<a href="http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf" title="http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf">http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf</a>&gt;  which is the very first edition of a bi-annual publication from the  Interoperable Metadata Center for Excellence and the German Networked  Information Initiative.</p>
<p><strong>Tidal  wave of web data knocking on our door</strong><br />
 For the  past several years the Digital Preservation group has been working with Andreas  Paepcke and Hector Garcia-Molina at Stanford   University on web  crawling activities.&nbsp; Their research group has a wealth of experience  collecting web data and while CDL&rsquo;s Digital Preservation group was getting their  &ldquo;web crawling sea legs&rdquo; they asked Stanford&rsquo;s group to collect data on our  behalf.&nbsp; Over the years Stanford has collected over 100 TB of data ranging  from dot.gov sites, election data, Katrina,   Virginia Tech tragedy, etc.&nbsp;  However, they have been using a different crawler than the Web Archiving  Service (WAS) crawler (Heritrix).&nbsp; As a consequence their crawler output  is incompatible with most web archiving services, including ours.&nbsp; However, there is good news &#8212; they have recently created a tool that will turn  the output of their crawler data into something that CDL&#8217;s service can  understand.&nbsp; Erik Hetzner, Mike Wooldridge, and Scott Fisher are just  beginning to play around with this, but we are hoping for a positive outcome.</p>
<p><strong>Contributing  to the community by documenting Heritrix</strong><br />
 As  mentioned above, our Web Archiving Service uses Heritrix, the Internet  Archive&#8217;s (IA) open-source, extensible, web-scale, archival-quality web crawler  project.&nbsp; &quot;Heritrix&quot; (often misspelled heretrix, heratrix,  heritix, etc.) is an archaic word for &quot;heiress&quot;, which the IA chose because the project seeks to collect and preserve the digital artifacts of our  culture for the benefit of future researchers and generations.&nbsp; One of the  challenges of using Heritrix is that there is a dearth of documentation.&nbsp; Over the next several months Hunter Stern, CDL&#8217;s technical writer, will be working with Heritrix programmers at CDL and IA to better document the crawler.&nbsp; This collaboration will help us tremendously and benefit the  crawler community as well.</p>
<p><strong>Moving  big data: Mass Transit Project</strong><br />
Over the  past couple of years the Digital Preservation Group has been working with the  campuses to move large chunks of content into the Digital Preservation  Repository (DPR).&nbsp; In the process we have encountered a few speed bumps along the way. The issues are two-fold but related: the files are large and the  network transfer rates have been unaccountably slow.&nbsp; Though we have worked  towards resolving this, we have more work to do in understanding the best  transfer tools and in monitoring our networks to make sure there are no log  jams and that they are ready to be used to their full potential  bandwidth.&nbsp; The goal is to make sure we&#8217;re making the best use of our Internet2 pathways to/from the campuses and the data centers for the benefit of  all CDL projects.</p>
<p>The Digital  Preservation group has embarked on two efforts to speed up movement of large  files into the DPR. &nbsp;First, they are collaborating with San Diego  Supercomputer Center (SDSC) to understand how to transfer data across the  network more quickly and efficiently.&nbsp; Second, they are implementing (on a trial  basis) a method of pulling in large numbers of external data objects into a  kind of preservation holding tank in order to reduce the impact of network  speed and latency on the overall DPR ingest process.&nbsp; They are very excited about the collaboration  with SDSC and Kirsten Neilsen will be leading the project for CDL &ndash; we&rsquo;re  calling the project &ldquo;Mass Transit&rdquo; and there is a project Wiki &lt;http://masstransit.sdsc.edu/&gt;. </p>
<p>If you want  any additional information on any of these projects please contact Trisha Cruse  (patricia.cruse@ucop.edu). </p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
