<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0">
<channel>
<title>John Mark Ockerbloom</title>
<copyright>Copyright (c) 2012  All rights reserved.</copyright>
<link>http://works.bepress.com/john_mark_ockerbloom</link>
<description>Recent documents in John Mark Ockerbloom</description>
<language>en-us</language>
<lastBuildDate>Sat, 24 Nov 2012 21:26:42 PST</lastBuildDate>
<ttl>3600</ttl>








<item>
<title>Building a global open network of  discovery: Improving subject browsing  with linked authority data</title>
<link>http://works.bepress.com/john_mark_ockerbloom/14</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/14</guid>
<pubDate>Fri, 11 Feb 2011 13:50:30 PST</pubDate>
<description>
	<![CDATA[
	<p>This is a static version of the visuals of a presentation I gave to the ALA Subject  on linked open subject authority data.  I described how linked data from the Library of Congress (http://id.loc.gov/) can be used to improve subject cataloging and browsing. I describe how the data is structured (including a brief overview of RDF and SKOS), and explain how I have used it to improve The Online Books Page (http://onlinebooks.library.upenn.edu/) and the Penn Libraries' catalog. I also show how I augmented and enhanced the authority data from LC, and discuss how widespread use of linked bibliographic and authority data may change how we build and use catalogs in the future.</p>
<p>This presentation is an expanded revision of a presentation made at the Digital Library Federation fall forum in November 2010.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

</item>






<item>
<title>The Metadata Challenge: Promoting Discovery, Access, and Usability for Online Books</title>
<link>http://works.bepress.com/john_mark_ockerbloom/13</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/13</guid>
<pubDate>Fri, 29 Oct 2010 14:21:42 PDT</pubDate>
<description>
	<![CDATA[
	<p>With millions of books, serials, and other documents now  digitized, rich troves of information and culture can now  be made available to anyone in the world with an Internet  connection. But these riches are worthless if they cannot be  found, accessed, and eﬀectively used by the readers who need them. The key to unlock these treasures is metadata. Networked computing enables techniques for making metadata more eﬀective than ever; yet in practice, online collections all too often either do not have or do not take full advantage of the best metadata they could use.</p>
<p>There is much ongoing work harnessing metadata to improve online book discovery, access, and usability. Online  book discovery is being enhanced with concept-oriented catalogs of various kinds, including browsable maps relating  millions of subjects and associated books. Copyright metadata is starting to open access to many books that had been needlessly withheld from the public, while also reducing the risk of inadvertent infringement. Structural and relational metadata and annotations are making complex works much more usable than they were when they were represented as a mishmash of volumes.</p>
<p>Using metadata eﬀectively in multi-million-volume collections poses special problems of scale. Solving these problems requires considered application of both library science and computer science. It also requires harnessing the collective intelligence of readers, writers, librarians, and publishers. Wise metadata management policies, including open data sharing, can promote the eﬀective aggregation of human and machine intelligence at the scale we will need. This talk will demonstrate and describe ways in which we can meet the metadata challenges of large-scale online libraries both now and in the future.</p>
<p>[This record of the presentation consists of the slides, and the notes for the talk.  The notes only partially reflect the talk as delivered.]</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Copyright</category>

<category>Information discovery</category>

<category>Preservation</category>

</item>






<item>
<title>Architectural Mismatch: Why Reuse is Still So Hard</title>
<link>http://works.bepress.com/john_mark_ockerbloom/12</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/12</guid>
<pubDate>Wed, 22 Jul 2009 08:50:40 PDT</pubDate>
<description>
	<![CDATA[
	<p>In this article, David Garlan, Robert Allen, and John Ockerbloom reflect on the state of architectural mismatch, a term they coined in their 1995 IEEE Software article, "Architectural Mismatch: Why Reuse Is So Hard." Although the nature of software systems has changed dramatically since the earlier article was published, the challenge of architectural mismatch remains an important concern for the software engineering field.</p>

	]]>
</description>

<author>David Garlan et al.</author>


<category>Systems architecture</category>

</item>






<item>
<title>Open discovery of library resources: The Digital Library Federation&apos;s ILS-Discovery interface recommendations</title>
<link>http://works.bepress.com/john_mark_ockerbloom/11</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/11</guid>
<pubDate>Wed, 10 Jun 2009 17:37:17 PDT</pubDate>
<description>
	<![CDATA[
	<p>A NISO webinar presentation on the ILS-DI work, its design, its implementation, and its future.  Based in part on my ALA talk in January, but reworked and updated.  The PDF file includes both slides and my speaker's script.   (My actual remarks in the live presentation differed slightly in places.)</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

<category>Systems architecture</category>

</item>






<item>
<title>Open records, open possibilities</title>
<link>http://works.bepress.com/john_mark_ockerbloom/10</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/10</guid>
<pubDate>Wed, 28 Jan 2009 10:07:48 PST</pubDate>
<description>
	<![CDATA[
	<p>Slides and prepared remarks for an ALA panel discussion on sharing bibliographic records and OCLC's proposed WorldCat policy.</p>
<p>I give some examples of useful innovations that can advance the mission of libraries when records are openly shared, and argue that more restrictive policies like OCLC's proposal can cost the library community dearly in lost opportunities.  I show how the open source community has alternative ways to license work openly, and to cover costs.  Finally, I argue that, even if WorldCat is not prepared to open access to all its bibliographic records, the members of the cooperative should be empowered to use and distribute public domain metadata, deposit their own contributions into WorldCat with more liberal licensing, and collectively ratify or reject significant changes to WorldCat policies.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Copyright</category>

<category>Information discovery</category>

</item>






<item>
<title>Opening the ILS for Discovery: The Digital Library Federation’s ILS-Discovery Interface recommendations</title>
<link>http://works.bepress.com/john_mark_ockerbloom/9</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/9</guid>
<pubDate>Wed, 28 Jan 2009 07:06:18 PST</pubDate>
<description>
	<![CDATA[
	<p>Slides for my ALA talk, giving an overview of the DLF's ILD-discovery interface recommendations, and how they are and can be used to enable a richer environment of information discovery applications across a wide variety of ILS's and other library information bases.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

<category>Systems architecture</category>

</item>






<item>
<title>Watching Our Backs: Community Verification of Digital Preservation Systems</title>
<link>http://works.bepress.com/john_mark_ockerbloom/8</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/8</guid>
<pubDate>Fri, 14 Nov 2008 08:55:08 PST</pubDate>
<description>
	<![CDATA[
	<p>Librarians and faculty agree that information preservation is one of the essential roles of libraries. Yet, as the information we manage increasingly becomes digital, we have to rely on new methods of preserving this information that have not been fully tested. While developing and auditing for best practices is important, we must also verify that preservation systems actually perform as we hope they will, preferably long before we have to fall back on them.</p>
<p>In this talk, I will show ways in which this verification can be done now, by the community, with reasonable cost and demonstrable efficacy. Specifically, I will describe Penn's failure recovery tests of LOCKSS, which uncovered issues with the system's performance and reliability, and helped lead to improvements addressing these issues. I will also discuss initiatives being organized through CRL to assess distributed auditing and community knowledge sharing to test and improve LOCKSS, Portico, and other shared preservation systems.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Preservation</category>

</item>






<item>
<title>Promoting discovery and use of repository content: An architectural perspective</title>
<link>http://works.bepress.com/john_mark_ockerbloom/7</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/7</guid>
<pubDate>Wed, 08 Oct 2008 18:25:46 PDT</pubDate>
<description>
	<![CDATA[
	<p>Slides and notes for a talk I gave at a NARA/UMD conference.  (The notes include a full script, though it differs slightly from the talk as delivered.)</p>
<p>In this talk, I stress the importance of effective discovery as an essential component of (and aid to) preservation.  I advocate the importance of opening up information, system, and social architectures to do so, with examples that include subject maps, the DLF ILS-DI work, VCat, and PennTags.</p>
<p>Some of the material in the talk was adapted from the "High Quality Discovery in a Web 2.0 World" talk I gave for Palinet.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

<category>Preservation</category>

<category>Systems architecture</category>

</item>






<item>
<title>High Quality Discovery in a Web 2.0 World: Architectures for Next Generation Catalogs</title>
<link>http://works.bepress.com/john_mark_ockerbloom/6</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/6</guid>
<pubDate>Wed, 28 May 2008 11:58:51 PDT</pubDate>
<description>
	<![CDATA[
	<p>Issues of information and systems architecture underly many of the current debates over the future of cataloging.  This talk discusses some ways in which the architecture of the catalog is being redesigned to combine the rich information architecture of library metadata with the robust systems architecture of many Web-based discovery systems.  I will show "subject map" discovery systems that better exploit the relationships in complex ontologies like LCSH, and discuss a Digital Library Federation initiative to promote standards supporting interoperability between discovery systems and ILS data and services.  I will also touch on the role of networked architectures in improving the quality and efficiency of library cataloging.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

<category>Systems architecture</category>

</item>






<item>
<title>The Next Mother Lode for Large-scale Digitization?  Historic Serials, Copyrights, and Shared Knowledge</title>
<link>http://works.bepress.com/john_mark_ockerbloom/5</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/5</guid>
<pubDate>Wed, 30 Apr 2008 05:53:26 PDT</pubDate>
<description>
	<![CDATA[
	<p>Much of the publicity around recent mass-digitization projects focuses on the millions of books they promise to make freely readable online. Because of copyright, though, most of the books provided in full will be of mainly historical interest. But much of the richest historical text content is not in books at all, but in the newspapers, magazines, newsletters, and scholarly journals where events are reported firsthand, stories and essays make their debut, research findings are announced and critiqued, and issues of the day debated. Back runs of many of these serials are available in major research institutions but often in few other places. But they have the potential for much more intensive use, by a much wider community, if they are digitized and made generally accessible.</p>
<p>In this talk, we will discuss an inventory we have conducted at Penn of periodicals copyright renewals. We found that copyrights of the vast majority of mid-20th-century American serials of historical interest were not renewed to their fullest possible extent. The inventory reveals a rich trove of copyright-free digitizable serial content from major periodicals as late as the 1960s. Drawing on our experience with this inventory's production and previous registry development, we will also show how low-cost, scalable knowledge bases could be built from this inventory to help libraries more easily identify freely digitizable serial content, and collaborate in making it digitally available to the world. Our initial raw inventory can be found at http://onlinebooks.library.upenn.edu/cce/firstperiod.html</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Copyright</category>

</item>






<item>
<title>Archiving and Preserving PDF Files</title>
<link>http://works.bepress.com/john_mark_ockerbloom/4</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/4</guid>
<pubDate>Wed, 02 Apr 2008 12:45:46 PDT</pubDate>
<description>
	<![CDATA[
	<p>Since its release in mid 1993, Adobe Portable Document Format (PDF) has become a widely used standard for electronic document distribution worldwide in many institutional settings. Much of its popularity comes from its ability to faithfully encode both the text and the visual appearance of source documents, preserving their fonts, formatting, colors, and graphics. PDF files can be viewed, navigated, and printed with a free Adobe Acrobat Reader, available on all major computing platforms. PDF has many applications and is commonly used to publish government, public, and academic documents. Many of the electronic journals and other digital resources acquired by libraries are published in PDF format.</p>
<p>As libraries grow more dependent on electronic resources, they need to consider how they can preserve these resources for the long term. Many libraries retain back runs of print journals that are over 100 years old, and which are still consulted by researchers. No digital technology has lasted nearly that long, and many data formats have already become obsolete and not easily readable in a much shorter time period. This document discusses ways that libraries can plan for the preservation of electronic journals and other digital resources in PDF format. After a brief discussion of the file specifications and the future plans for PDF, the article focuses on issues related to preservation of PDF files.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Preservation</category>

</item>






<item>
<title>Mapping the library future: Subject navigation for today&apos;s and tomorrow&apos;s library catalogs</title>
<link>http://works.bepress.com/john_mark_ockerbloom/3</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/3</guid>
<pubDate>Wed, 23 Jan 2008 08:56:54 PST</pubDate>
<description>
	<![CDATA[
	<p>My ALA Mindwinter 2008 presentation slides on subject maps.  For more details on how subject maps are created, see the New Maps of the Library white paper from 2006.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

</item>






<item>
<title>New Maps of the Library: Building Better Subject Discovery Tools Using Library of Congress Subject Headings</title>
<link>http://works.bepress.com/john_mark_ockerbloom/2</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/2</guid>
<pubDate>Wed, 23 Jan 2008 08:26:33 PST</pubDate>
<description>
	<![CDATA[
	<p>We describe tools in development at the University of Pennsylvania to generate and display interactive "subject maps" for exploring library collections. Based on the Library  of Congress Subject Headings (LCSH), these maps are automatically built from existing  authority records, a collection's bibliographic records, and optional local "tweaks" for  local interests and search patterns. Users can explore these maps via ordinary text-  based web browsing, and browse clusters of related research resources. We now provide  these maps for small collections like The Online Books Page, and are experimenting with  maps for the entire Penn Library catalog. We hope to enable users to take full advantage  of the rich conceptual relationships in LCSH-based library collections, and effectively browse increasingly diverse and dispersed library collections.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Information discovery</category>

</item>






<item>
<title>Copyright and Provenance: Some Practical Problems</title>
<link>http://works.bepress.com/john_mark_ockerbloom/1</link>
<guid isPermaLink="true">http://works.bepress.com/john_mark_ockerbloom/1</guid>
<pubDate>Wed, 23 Jan 2008 08:26:31 PST</pubDate>
<description>
	<![CDATA[
	<p>Copyright clearance is an increasingly complex and expensive impediment to the digitization and reuse of information. Clearing copyright issues in a reliable and cost-effective manner for works created in the last 100 years can involve establishing complex provenance chains for the works, their copyrights, and their licenses. This paper gives an overview of some of the practical provenance-related issues and challenges in clearing copyrights at large scale, and discusses efforts to more efficiently gather and share information and its copyright provenance.</p>

	]]>
</description>

<author>John Mark Ockerbloom</author>


<category>Copyright</category>

</item>





</channel>
</rss>
