My research and teaching span information retrieval, human-computer interaction,
information seeking and use, scholarly communication, and bibliometrics. Since the latter
1990s, these themes have converged in the study of data and data practices, exploring how
observations, models, artifacts, and software become data; how these practices vary by
individual and by discipline; and how these findings can be employed in the design of
data collection, data management, data archiving, and science policy. 

Publications remain the currency of scholarship, despite substantial evolution in form
and function over a period of centuries. In the latter 20th century, data began to be
viewed as scholarly products in their own right. Factors contributing to the value of
research data include the transition from print to electronic publishing, the ability to
acquire and analyze large volumes of digital content in the sciences and humanities
alike, and policies that promote openness and transparency. 

On the surface, open access to data appears to offer vast benefits for research,
education, and innovation by leveraging public investments in research. Public policy
documents suggest that releasing data is an easy task to be accomplished at the time of
publishing articles or books, and that research data are yet another genre to be absorbed
by libraries and archives. Underlying these simple claims is a morass of theoretical,
social, policy, and practical problems. This morass has proven to be fertile ground for
research in information studies. 

How to use this site: 

This site contains entries for most of my publications, presentations, and course
syllabi. As governed by copyright agreements, entries may include final published
versions, submitted versions, working documents, slides, abstracts, and metadata. Links
to sources and to video recordings of presentations also are provided. 

As most of my works cover multiple topics, the elaborate subject classification has been
abandoned in favor of listing entries by format in reverse chronological order. Each
entry has subject categories and topical tags (data, scholarly communication, information
retrieval, bibliometrics, sensor networks, astronomy, humanities, and so on) that are
searchable in the box at the bottom right of the page. Other links at the right lead to
my UCLA homepage, research group, blog, Twitter feed, and email. 

The site is updated regularly with new publications, presentations, and other works.
Please subscribe to my mailing list for updates. 


Scholarship in the Digital Age: Information, Infrastructure, and the Internet (2010)

Scholars in all fields now have access to an unprecedented wealth of online information, tools,...



Signposts in Cyberspace: The Domain Name System and Internet Navigation (with Roger Levien, Robert S. Austein, Timothy Casey, Hugh Dubberly, Patrik Falstrom, Per-Kristian Halvorsen, Marylee Jenkins, John C. Klensin, Milton L. Mueller, Sharon Nelson, Craig Partridge, William Raduchel, and Hal R. Varian) (2005)

Articles, Papers, Posters, Reports, Book Chapters


10 Simple Rules for the Care and Feeding of Scientific Data (with Alyssa Goodman, Alberto Pepe, Alexander W. Blocker, Kyle Cranmer, Merce Crosas, Rosanne Di Stephano, Yolanda Gil, Paul Groth, Margaret Hedstrom, David W. Hogg, Vinay Kashyap, Ashish Mahabal, Aneta Siemiginowska, and Aleksandra Slavkovic), Data (2014)

This article offers a short guide to the steps scientists can take to ensure that...



(FORTHCOMING) "We’re Working on It:” Transferring the Sloan Digital Sky Survey from Laboratory to Library (with Sharon Traweek and Laura A. Wynholds), International Journal of Digital Curation (2014)

This paper reports on the transfer of a massive scientific dataset from a national laboratory...



(FORTHCOMING) Ship Space to Database: Scientific and Social Motivations for a Database to Support Deep Subseafloor Biosphere Research (with Peter Darch), Proceedings of the 77th American Society for Information Science and Technology (ASIS&T) Annual Meeting 2014, Seattle, WA (2014)

What motivates the building of databases by scientific collaborations? In this paper, we argue that...


(FORTHCOMING) The Ups and Downs of Knowledge Infrastructures in Science: Implications for Data Management (with Peter Darch, Ashley E. Sands, Jillian C. Wallis, and Sharon Traweek), Proceedings of the Joint Conference on Digital Libraries, 2014 (2014)

The promise of technology-enabled, data-intensive scholarship is predicated upon access to knowledge infrastructures that are...



If We Share Data, Will Anyone Use Them? Data Sharing and Reuse in the Long Tail of Science and Technology (with Jillian C. Wallis and Elizabeth Rolando), PLoS ONE (2013)

Research on practices to share and reuse data will inform the design of infrastructure to...


Presentations and Videos


Big Data, Little Data, No Data: Scholarship in the Networked World, University of Michigan School of Information: Yahoo Seminar Series (2014)

The enthusiasm for “big data” is obscuring the complexity and diversity of data in scholarship....



Big Data, Little Data, No Data: Scholarship in the Networked World, VALA 2014 (2014)

The enthusiasm for “big data” is obscuring the complexity and diversity of data in scholarship....



Big Data, Little Data, No Data: The Contested Landscape of Data Sharing and Reuse, Trends in Society and Information Technology Seminar Series (2013)

Scholars are being asked — by funding agencies and publishers alike — to release their...


Data Sharing: A Problem of Supply or of Demand?, Department of History (2013)

Knowledge sharing in science includes sharing research data. Research funding agencies have focused on increasing...



Why you should care about open data: Open Access Week thoughts on why research data rarely are reused, Open Access Week at UCLA (2013)

Scholarly knowledge-sharing includes sharing research data, but while the supply of data is growing rapidly,...




Research Data, Reproducibility, and Curation, Digital Social Research: A Forum for Policy and Practice, Oxford Internet Institute Invitational Symposium (2012)


Research Data: Who will share what, with whom, when, and why?, China-North America Library Conference, Beijing (2010)

The deluge of scientific research data has excited the general public, as well as the...



Social Aspects of Digital Libraries. Final Report to the National Science Foundation; Computer, Information Science, and Engineering Directorate; Division of Information, Robotics, and Intelligent Systems; Information Technology and Organizations Program (with Marcia J. Bates, Michele V. Bates, Efthimis N. Efthimiadis, Anne J. Gilliland-Swetland, Yasmin B. Kafai, Gregory H. Kafai, and Anthony B. Maddox), Award number 95-28808. (2006)

Science, Cyberinfrastructure, and Knowledge Communities: Leveraging Scientific Data for Educational Applications, Science and Cyberinfrastructure Session, Society for Social Studies of Science (4S) (2004)


Scientific data archiving: the state of the art in information, data, and metadata management (2003)

This white paper is the product of a one-year postdoctoral fellowship to study data archiving...


Contributions to Books


Preface, Evaluating and Measuring the Value, Use and Impact of Digital Collections (2012)


Embedded Sensor Networks, World Wide Research: Reshaping the Sciences and Humanities (2010)

Classroom evaluation of the Alexandria Digital Earth Prototype (ADEPT) (with Gregory H. Leazer, Anne J. Gilliland-Swetland, and Richard E. Mayer), Proceedings of the 63rd Annual Meeting of the American Society for Information Science (Chicago, IL, November 12-16, 2000) (2006)


Building digital libraries for scientific data: An exploratory study of data practices in habitat ecology (with Jillian C. Wallis and Noel Enyedy), 10th European Conference on Digital Libraries (2006)


The Interaction of Community and Individual Practices in the Design of a Digital Library, International Symposium on Digital Libraries and Knowledge Communities in Networked Information Society (2006)



Follow the Data: How astronomers use and reuse data (poster) (with Ashley Sands, Laura Wynholds, and Sharon Traweek) (2012)

We analyze the people and infrastructure involved in the building, sustaining, and curation of large...



IDRE Proposal: UCLA Data Registry System (with Todd Grappone, Gary Strong, Jeffrey Goldman, and Jillian Wallis) (2011)


CENSDC: Adding Context to Content (with Matthew Mayernik, Jillian C. Wallis, and Alberto Pepe), Center for Embedded Network Sensing (2007)

Scientists and engineers working with embedded networked sensing systems in the environmental sciences are acquiring...