<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/wordpress-mu-1.0" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>CDLINFO</title>
	<link>http://cdlinfo.cdlib.org</link>
	<description>California Digital Library News</description>
	<pubDate>Fri, 02 May 2008 21:27:20 +0000</pubDate>
	<generator>http://wordpress.org/?v=wordpress-mu-1.0</generator>
	<language>en</language>
			<item>
		<title>University of California  eScholarship&#174; Repository Exceeds 5 Million Full-Text Downloads; 20,000 Papers</title>
		<link>http://cdlinfo.cdlib.org/blog/2008/01/15/university-of-california-escholarship-repository-exceeds-5-million-full-text-downloads-20000-papers/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2008/01/15/university-of-california-escholarship-repository-exceeds-5-million-full-text-downloads-20000-papers/#comments</comments>
		<pubDate>Tue, 15 Jan 2008 19:36:32 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Publishing</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2008/01/15/university-of-california-escholarship-repository-exceeds-5-million-full-text-downloads-20000-papers/</guid>
		<description><![CDATA[<p>The University of California announced this week that its widely-used eScholarship® Repository has surpassed the 5 million mark for full-text downloads of its open access scholarly content.  </p>]]></description>
			<content:encoded><![CDATA[     <link rel="alternate" type="application/atom+xml" title="CDLINFO Category: Digital Publishing" href="http://cdlinfo.cdlib.org/blog/category/digital-publishing/feed/" />
<p>By Catherine Mitchell, CDL Acting Director of Publishing Services</p>
<p>The University of California announced this week that its widely-used eScholarship&reg; Repository has surpassed the 5 million mark for full-text downloads of its open access scholarly content.&nbsp; This major milestone reflects the impressive adoption and usage rate the Repository has enjoyed since its inception in 2002, with University of California academic units and departments from its 10 campuses publishing or depositing over 20,000 papers and works.</p>
<p>The eScholarship Repository, a service of the California Digital Library, provides a robust full-spectrum, open access publishing platform for pre-prints, post-prints, peer-reviewed articles, edited volumes and peer-reviewed journals.&nbsp; The Repository houses a broad range of scholarly content from disciplines across the Humanities, Social Sciences, Mathematics and Sciences.</p>
<p>The rate of usage of these materials has grown exponentially in the past 5 years, now often exceeding 55,000 full-text  downloads per week.</p>
<p>As evidenced by this rate of activity, the eScholarship Repository represents one of the University of California&rsquo;s most  successful and sustained efforts to improve and provide innovative alternatives  to the troubled scholarly publishing system &ndash; a system that increasingly  struggles to serve the needs and requirements of the academic community.</p>
<p>&ldquo;We&rsquo;re very excited about the uptake and use of the eScholarship Repository at the University of California,&rdquo; says Catherine Candee,  Executive Director, Strategic Publishing and Broadcast Services at UC&rsquo;s Office of the President.&nbsp; &ldquo;Our open access publishing platform represents a critical component of UC&rsquo;s broader effort to strengthen  university-based publishing services and integrate them into the research, teaching and public service mission of the University.&rdquo; </p>
<p>Part of a suite of innovative publishing services developed  by the CDL in recent years, the eScholarship Repository serves the scholarly publishing needs of individual faculty and academic departments, laboratories  and research units across the University of California system.&nbsp; It is also a central mechanism in the collaborative publishing efforts between the CDL and the University of California Press.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2008/01/15/university-of-california-escholarship-repository-exceeds-5-million-full-text-downloads-20000-papers/feed/</wfw:commentRss>
		</item>
		<item>
		<title>University of California launches Mark Twain Project Online</title>
		<link>http://cdlinfo.cdlib.org/blog/2007/11/08/university-of-california-launches-mark-twain-project-online/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2007/11/08/university-of-california-launches-mark-twain-project-online/#comments</comments>
		<pubDate>Thu, 08 Nov 2007 19:14:45 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>General</category>

		<category>Digital Publishing</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2007/11/08/university-of-california-launches-mark-twain-project-online/</guid>
		<description><![CDATA[<p>University of California is pleased to announce the launch of the beta version of the Mark Twain Project Online, a digital critical edition of the writings of Mark Twain.</p>]]></description>
			<content:encoded><![CDATA[<p><em>Access to texts, notes, and facsimiles available online at no charge to institutions or individuals</em></p>
<p>University of California is pleased to announce the launch of the beta version of the Mark Twain Project Online (<a href="http://www.marktwainproject.org/">www.marktwainproject.org</a>), a digital critical edition of the writings of Mark Twain.</p>
<p>The Mark Twain Project Online (MTPO) applies innovative technology to more than four decades of archival research by expert editors at the Mark Twain Project.&nbsp; It offers unfettered, intuitive access to reliable texts, accurate and exhaustive  notes, and the most recently discovered letters and documents.</p>
<p>  MTPO is a  joint undertaking of the Mark Twain Papers and Project, the California Digital  Library, and University of California Press.&nbsp; It is funded in part by a generous grant  from the National Endowment for the Humanities to the Mark Twain Project, and is supported by a number of institutions and individuals.&nbsp; The Mark Twain  Foundation, a perpetual charitable trust that possesses the publication rights  to all of Mark Twain&rsquo;s writings, has given UC Press and the Mark Twain Project Online exclusive rights to publish copyright-protected writings by Mark Twain, both in print and electronically. </p>
<p>At beta launch, the site will include more than twenty-three hundred letters written between 1853 and 1880, including nearly 100 facsimiles of originals.&nbsp; Users will also be able to search for information about Mark Twain&#8217;s complete correspondence across his entire life, including letters to  him and his family. In future years, the site will release more of the nearly ten thousand known letters, including many never-before published; electronic editions of many of Mark Twain&rsquo;s most famous literary works; the most complete catalog of Mark Twain&#8217;s writings currently available; and, in 2010, <em>Mark Twain&rsquo;s Autobiography</em>,  never before published in its complete form. </p>
<p>&quot;The Mark Twain Project Online is an extraordinary resource for scholars, teachers, and ordinary readers.&nbsp; Materials that previously could be examined only by scholars fortunate enough to be able  to visit the Mark Twain Project in The Bancroft Library at UC Berkeley  will now be available worldwide to anyone with an interest in Mark Twain&mdash;and that&#8217;s  a cause for celebration,&quot; Shelley Fisher Fishkin, author of<em> Lighting Out for the Territory: Reflections on Mark Twain and American Culture, </em>said.</p>
<p>The customizable interface provides a powerful  reading and research experience.&nbsp; The site offers users unprecedented access to authoritative transcriptions of Mark Twain&rsquo;s writings and the ability to compare those transcriptions side by side with facsimiles when available.  Researchers can gather and store digital citations and links to selected  documents, images, and other resources.&nbsp; These features are supported, in large part, by the California Digital Library&rsquo;s eXtensible Text Framework (XTF) and the ongoing work of the Textual Encoding Initiative (TEI).</p>
<p>The Mark Twain Project Online demonstrates the great  advantages of digital presentation and will be a model for future digital scholarly work.&nbsp; &ldquo;The Mark Twain Project Online is an exciting initiative that  will make a fundamental literary and biographical archive available to scholars and students.&nbsp; MTPO offers easy access  through a sophisticated web interface that is growing and comprehensive  scope.&nbsp; This project has the potential to  become a model for Web accessibility to foundational scholarly resources,&rdquo;  Richard Terdiman, author of<em> Body and Story: The Ethics and  Practice of Theoretical Conflict</em>, said.</p>
<p>View the Mark Twain Project Online and access information about the making of this landmark online publication, by visiting <a href="http://www.marktwainproject.org/">http://www.marktwainproject.org</a>.&nbsp; You can also contact Catherine Mitchell (<a href="mailto:Catherine.Mitchell@ucop.edu">Catherine.Mitchell@ucop.edu</a>;  510.587.6132), Acting Director of CDL&rsquo;s eScholarship Publishing Group for additional information.</p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2007/11/08/university-of-california-launches-mark-twain-project-online/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Digital Preservation News</title>
		<link>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/</link>
		<comments>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 21:33:56 +0000</pubDate>
		<dc:creator>raw</dc:creator>
		
		<category>Digital Preservation</category>

		<category>Digital Publishing</category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/</guid>
		<description><![CDATA[<p>The CDL Digital Preservation Group has been busy with a variety of exciting activities.</p>]]></description>
			<content:encoded><![CDATA[<p>By Trisha  Cruse, CDL Director of Digital Preservation</p>
<p>The CDL  Digital Preservation Group has been busy with a variety of exciting activities,  reported below. </p>
<p><strong>Release 4 of the Web Archiving Service</strong><br />
  On September 18th the Web Archiving Group released a new version of the  Web Archiving Service &ndash; special thanks to Tracy   Seneca, Scott Fisher,  Margaret Low, Erik Hetzner, Mark Reyes, and Mike Wooldridge for getting this  release out the door.&nbsp; So far the group has received very positive feedback  from users on the service&rsquo;s functionality and the user interface.&nbsp; We are  also extremely pleased with the performance; we are up to 500 captures with  relatively few hiccups.</p>
<p>We have also put together an overview of the service that is available on YouTube &lt;http://tinyurl.com/2tdrwq<strong>&gt;</strong>.&nbsp; This brief overview explains why the content targeted for this project is at  risk, how we plan to address this in the Web Archiving Service, and provides an explanation of the collections our curators are working on. Warning: the  YouTube video quality is a bit sketchy so we have also made this presentation  available in a high-quality video format; contact tracy.seneca at ucop dot edu  for further information. </p>
<p><strong>A kinder  and gentler ARK  page</strong><br />
 Thanks to  Kirsten Neilsen and John Kunze there is now a kinder, gentler introduction to ARK identifiers on Inside CDL &lt;<a href="http://www.cdlib.org/inside/diglib/ark/" title="http://www.cdlib.org/inside/diglib/ark/">http://www.cdlib.org/inside/diglib/ark/</a>&gt;.&nbsp; Don&rsquo;t know what that is?&nbsp; Then definitely take a look.&nbsp; Our hope is that this will help others  recognize and appreciate the true beauty and splendor of ARKs. &nbsp;The new  page has already been re-purposed in a German &quot;technology watch&quot;  newsletter, &lt;<a href="http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf" title="http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf">http://www.kim-forum.org/techwatch/kim-dini-technology-watch-report1_2007.pdf</a>&gt;  which is the very first edition of a bi-annual publication from the  Interoperable Metadata Center for Excellence and the German Networked  Information Initiative.</p>
<p><strong>Tidal  wave of web data knocking on our door</strong><br />
 For the  past several years the Digital Preservation group has been working with Andreas  Paepcke and Hector Garcia-Molina at Stanford   University on web  crawling activities.&nbsp; Their research group has a wealth of experience  collecting web data and while CDL&rsquo;s Digital Preservation group was getting their  &ldquo;web crawling sea legs&rdquo; they asked Stanford&rsquo;s group to collect data on our  behalf.&nbsp; Over the years Stanford has collected over 100 TB of data ranging  from dot.gov sites, election data, Katrina,   Virginia Tech tragedy, etc.&nbsp;  However, they have been using a different crawler than the Web Archiving  Service (WAS) crawler (Heritrix).&nbsp; As a consequence their crawler output  is incompatible with most web archiving services, including ours.&nbsp; However, there is good news &#8212; they have recently created a tool that will turn  the output of their crawler data into something that CDL&#8217;s service can  understand.&nbsp; Erik Hetzner, Mike Wooldridge, and Scott Fisher are just  beginning to play around with this, but we are hoping for a positive outcome.</p>
<p><strong>Contributing  to the community by documenting Heritrix</strong><br />
 As  mentioned above, our Web Archiving Service uses Heritrix, the Internet  Archive&#8217;s (IA) open-source, extensible, web-scale, archival-quality web crawler  project.&nbsp; &quot;Heritrix&quot; (often misspelled heretrix, heratrix,  heritix, etc.) is an archaic word for &quot;heiress&quot;, which the IA chose because the project seeks to collect and preserve the digital artifacts of our  culture for the benefit of future researchers and generations.&nbsp; One of the  challenges of using Heritrix is that there is a dearth of documentation.&nbsp; Over the next several months Hunter Stern, CDL&#8217;s technical writer, will be working with Heritrix programmers at CDL and IA to better document the crawler.&nbsp; This collaboration will help us tremendously and benefit the  crawler community as well.</p>
<p><strong>Moving  big data: Mass Transit Project</strong><br />
Over the  past couple of years the Digital Preservation Group has been working with the  campuses to move large chunks of content into the Digital Preservation  Repository (DPR).&nbsp; In the process we have encountered a few speed bumps along the way. The issues are two-fold but related: the files are large and the  network transfer rates have been unaccountably slow.&nbsp; Though we have worked  towards resolving this, we have more work to do in understanding the best  transfer tools and in monitoring our networks to make sure there are no log  jams and that they are ready to be used to their full potential  bandwidth.&nbsp; The goal is to make sure we&#8217;re making the best use of our Internet2 pathways to/from the campuses and the data centers for the benefit of  all CDL projects.</p>
<p>The Digital  Preservation group has embarked on two efforts to speed up movement of large  files into the DPR. &nbsp;First, they are collaborating with San Diego  Supercomputer Center (SDSC) to understand how to transfer data across the  network more quickly and efficiently.&nbsp; Second, they are implementing (on a trial  basis) a method of pulling in large numbers of external data objects into a  kind of preservation holding tank in order to reduce the impact of network  speed and latency on the overall DPR ingest process.&nbsp; They are very excited about the collaboration  with SDSC and Kirsten Neilsen will be leading the project for CDL &ndash; we&rsquo;re  calling the project &ldquo;Mass Transit&rdquo; and there is a project Wiki &lt;http://masstransit.sdsc.edu/&gt;. </p>
<p>If you want  any additional information on any of these projects please contact Trisha Cruse  (patricia.cruse@ucop.edu). </p>
]]></content:encoded>
			<wfw:commentRss>http://cdlinfo.cdlib.org/blog/2007/10/17/digital-preservation-news/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
