Showing posts with label Collaboration. Show all posts
Showing posts with label Collaboration. Show all posts

Thursday, January 27, 2011

The New York Times tries something new: Annotated text via prostitution

The New York Times of January 27, 2011 has a blog entry: A Guide to Houses No Gentleman Would Dare to Frequent by Alison Leigh Cowan.  It's an examination of a little known guidebook from 1870 which reviews 150 brothels existing in the city at that time. The copy that Cowan found is located in the New-York Historical Society.

What interests me about the article is that the Times has gone a step further than usual, and has reproduced a digital copy of the entire book.  This was my first encounter with http://documents.nytimes.com, apparently a new place that allows the Times to store large amounts of text (51 pages of school complaints about  Jared Loughner, 26 pages of poll results, etc.) to supplement articles in the newspaper.  Most of the documents have not undergone OCR, but the guide to prostitution has (unfortunately rather poorly - it would have easy to correct a 32-page book).

What really got my attention was that they have added annotations by Cowan to the digital book, using excepts from her blog post to explain portions of the text.

Whoa!  This is a way to go.  There have been so many attempts to create a web tool for annotating web pages and documents.  But to be able to do it and share it with others is part of the key.  

Of course I wish libraries could lead the troops in doing this.  I know there are so many wanted texts that people want to pour over and dissect, like a page of Talmud.  Why aren't the tools out there, and why aren't libraries (even academic libraries) at the forefront of this opportunity?  I can forsee Wikimedia setting up a WikiTexts website to deal with public domain texts and the desire to study, analyze, and elucidate them among a community.

Friday, August 8, 2008

RBMS Plenary Session 5: If We Build It, Will They Come?

Before the fifth plenary session, there were some words concerning next year's RBMS pre-conference: It'll be a RBMS 50th anniversary conference celebration with a program devoted to nostalgia: "Seas of change: navigating cultural and institutional contexts of special collections." It'll be a critical look at where we have been, where we are, and where we are going, combined with self-examination. RBMS hopes to have new and emerging voices at the pre-conference, including new librarians, those with career changes, and underrepresented librarians.

As in my previous posts, the words following are not to be taken as an exact transcription of the speakers' presentations. There's no guarantee that I captured their words and at times, I was only able to get notes. I take responsibility for any errors.

RBMS Plenary Session 5: If We Build It, Will They Come?
Friday, June 27

Before proceeding with the 2 speakers, the session began with an introductory quote from Albert Manguel's A History of Reading:
It is the reader who grants or recognizes in an object, place or event a certain possible readability; it is the reader who must attribute meaning to a system of signs, and then decipher it. We all read ourselves and the world around us in order to glimpse what and where we are. We read to understand, or to begin to understand. We cannot do but read. Reading, almost as much as breathing, is out essential function...reading is at the beginning of the social contract.

1. Medieval Manuscripts and Digital Approaches - Matthew Fisher, Department of English, UCLA.
Let's take a trip down digital memory lane. In the early history of the web, medieval stuff was more elusive. An example of one page left from early digitization efforts in 1993: A brief description, and an isolated image - that's it.

The idea of a library exhibition is very interesting, but requires lots of time and resources, but presents a disconnect.

Catalogs are by no means are complete; they're not crowd pleasers. Medieval manuscripts are also elusive in catalogs, which often depend on titles, dates, and authors -- exactly those identifying traits that many medieval manuscripts lack. In the British Library catalog not all manuscripts come up when one knows that institution has them (e.g. BL and Piers Ploughman). Also consider the difficulty of locating them in a catalog because earlier English has letters that are not translatable to today's language. (He showed examples.)

Today we have things such as the Digital Scriptorum (a consortium project), an image database of images of medieval manuscripts. In a questionnaire showing user needs, the results showed that people seek specific manuscripts, and are looking for information in the script, although the site is for images. In the digital environment, text = digital text, i.e. that which can be searched. We're accepting the limitation of OCR of early medieval prints (which look like handwriting). The expectations of what can be done limits what scholars can do with them.

But most digital projects involving medieval manuscripts are like tearing leaves from books: the text is struck through and you're left with a catchword (which admittedly can be a fascinating study). But what about scholars whose interest is in alterations of the text? How does one encode such emendations and corrections in the digital environment? It leads to the question: What is the subject: the text, or the corrections?

As a pedagogical resource, these digital images are a godsend. The access they give is also a good thing. Is digital access to selections of a manuscript just a precursor to physical object? If so, why are we engaging in these objects?

The [name of unidentified project] is a catalog of digitized medieval manuscripts, a project of UCLA and Fisher. [See UCLA's Center For Medieval and Rennaissance Studies project page for more projects.]

Who knows what scholar is doing at any time? This is important information to find out, and scholars can't find it out. Just trying to provide access points to medieval manuscripts out there is a problem in itself (official library access points versus the access point needed and desired by the scholarly community). On this [unnamed project] there is no federated search, because Fisher prefers not.

CERL Portal - contains manuscripts and dearly printed material, but has logged in only 2,000 visitors. It requires catalogs to be in an OAI format (to communicate with other websites and portals).

What about research using digital collections? People can do what they want with them, despite what metadata a library provides. There's an issue of imposing limitations on what can be done. Why not have manuscripts in OAI? Why not allow users to provide links, tags, search on their own [user-generated] terms. Why not allow mash-ups, e.g. as the way people use Google maps [i.e. combining Google maps with other applications]. He wants to see mashups of medieval manuscripts. It's a way to foster dialog between patrons and keepers.

2. Strategies for teaching and research - Stephen Davison (Head, UCLA Digital Program):
He spoke about the Digital Library Program at UCLA. Their aims include:
  • creating digital collections
  • looking for synergies with research, teaching, ahd learning
  • digital preservation
  • development of access and use tools standards
  • fostering cooperative projects: campus, statewide, national, international
What are we building within digital projects? Some? All? The "important" parts? Should we digitize complete objects or logical groupings? Should we produce a mirror of a physical collection or some other artificial grouping?

Whatever they may be, digital objects (whether items or fragments) are an important aspect of collection development.

What about the finding aid - should we tie to it, or provide another means of access? Some examples are: OAC (Online Archive of California), and CALISPHERE
(These sites provide different windows for different audiences of a single digital collection.)

Who are the collections for? The saying goes "will they come?" But who are they? (Concerning the Sheet Music Consortium, we've made assumptions of who uses it, but we're not sure; we are trying to design methods to discover who is using it.

There are methods where we can obtain feedback on our digital collections:
  • digitization requests
  • offers of metadata
  • personal communications
  • legal communications
  • statistics
Digitization vs. digital projects. Providing access vs. providing service layers. [The new standard for OAI - protocol for digital exchange).

Currently we offer conventional access to digital files, usually through library catalogs. But we need to start thinking about web 2.0 features - using users to provide more information, more features, e.g. making guides more wiki-like, ability to tag, annotate, recombine, and do mash-ups. This upload of digital content by users will lead to the creation of new knowledge and understanding.

Popular culture is now a big focus of web 2.0 ideas - e.g. Frontera Collection of Mexican-American music (sound recordings).
(Topics related to American Orientalism - i.e. diverse populations in the US - are also popular.)

What about mass digitization of special collections? Some issues:
  • a matter of time and money
  • there should be careful digitization on a grand scale
  • there are parallels to microfilm; but
  • digitization is for broad access; it's not yet considered a mode of preservation
Aspects of digital library services:
  • digitization occurs on demand
  • provides access to details available in the master image not available to unaided eye
  • feature web 2.0 attributes
  • makes metadata harvestable
  • encourages the "resue and exchange" of information
Proactive digitization (UCLA plans)

Digitization can foster scholarly collaborations. Some examples:
There are some digitization issues here. A good example: Cuneiform tablets are not just text, but are three dimensional objects, making their digitization problematic. It's an issue that's being wrestled in the digital worlds of libraries versus museums.

With the Hypercities project (and others), Stephen foresees the growth of cooperative digitization and metadata harvesting among various institutions.

Elements of the virtual research environment:
  • web 2.0 tools
  • collaboration, or not, as needed (small working groups classes, open to public)
  • gathering and sharing of metadata and/or digital objects
  • publishing
In the question and answer period following the talks, Terry Belanger spoke briefly, and reminded all of the hullabaloo that greeted microfilming. In refererence to microfilm's passing, he said: "Digitization is the next thing; it too shall pass." Of course he got a big laugh, but many erupted into applause.

Another person's question was left unanswered: whether the scholarly use of digitized information has the same status as the actual study of objects.

Wednesday, April 2, 2008

How a library can become a part of the community

I came across a nice site devoted to the history of Long Island's South Shore. Knowing a number of towns on Long Island, I know there are some places where families have lived for centuries, so one can't underestimate what a community can contribute to documenting itself.

What's so intriguing about the site is that it's not static. Instead, it's a wiki (using PmWiki software), enabling residents, historians and other interested parties to participate in sharing its knowledge of the history of the community. I believe that is going to be the future of the best websites (i.e. the ones that get the most hits) - their ability to be interactive, to enable participation, and their ability to change.

If that were not enough, I then realized the sponsor of the site: not some association, but the Dowling College Library Archives and Special Collections! Here is a library which takes an active role in fostering the community it serves (or at least offers a community project).

It's probably too much to ask most librarians (especially those with large collections) to assume such active roles in communities. But it does suggest the possibilities than can occur when librarians find a niche not covered by others.

Sunday, August 12, 2007

Enterprise 2.0, Wikipedia and Facism

(I've become quieter than usual because my employer unleashed a bombshell: blogs for the entire staff!! That's well over 1,000 people! So I've focusing on (internal) blogging for my institution (sorry - not visible to the public), but hope to get back here more often.)

I began noticing the phrase Enterprise 2.0 before I realized it was used by Professor Andrew McAfee of the Harvard Business School. (He thought he invented the term in 2006, only realizing later that it was first used in 2001.) I began taking notice with a good article in Harvard Business School's weekly ezine Working Knowledge. Entitled How Wikipedia Works (or Doesn't), author Sean Silverthone succinctly recounted McAfee's experience dealing with Wikipedia and how it serves as a model for collaboration - with both positive and negative attributes.

It fascinated me, so I headed to Wikipedia's (now-defunct) article on Enterprise 2.0. Seeing it incomplete, I added several links, expanded the definition, and made a few more corrections, while arguing on the talk page about why the decision for deletion should be rescinded.

Despite McAfee's credentials as an author of serious work, his and others' arguments could not withstand the fury of anonymous Wikipedians, who wielded their power (which ultimately was more important than factual data) in eradicating the Enterprise 2.0 article by merging it with the more generic article on Enterprise Social Software. In his summary, Silverthorne pointed out that this represents top-down administration on the part of Wikipedia, ironically a type of adminisration which is the opposite of what Wikipedia represents. I call it fascism. :) It reminds me of what happens in certain countries which give the illusion of democracy until it becomes uncomfortable - and then take top-heavy action to roll back the uncomfortable attitudes.

What does this have to do with librarianship? It's about the hurdles of library administration in a 2.o world. For example, if a library adopts a 2.0 attitudes, adopting a flatter way of communicating, allowing decisions to be made from the botttom, but then suddenly issues a top-down ultimatum, that would really quash the functioning of the institution.