Things What I Talk About

Tuesday, 23 November 2010

Unit 2: Data, Data, Everywhere...

What is in This Unit?
The history of web 2.0 and its effect on information behaviours.

Objective
By the end of this unit, a learner should:
  • Have a basic knowledge of the development of web 2.0
  • Understand how this change alters the ways we use and access data
Change Happens
The World Wide Web was first created in 1990 but Tim Berners-Lee. Berners-Lee was at that time working for CERN and wanted to help reduce the loss of information for such a distributed organisation. This system was envisioned as one that linked nodes (people or information) with other nodes. However, the traditional way of doing so was to use a tree structure - in which the top node contains all the nodes below it. If you want to go to a node on a different branch, you need to go back up the tree until you reach a common branch.

A way of countering this problem of back-tracking was using keywords to allow a user to go directly to information. However, people choose different keywords and a user would need to know the particular keywords that pertain to certain information. Hyperlinks create a distributed network in which keywords can link directly to other items in the network (much like the digital text mentioned in Web 2.0...The Machine is Us/ing Us from Unit 1).  These links, however, used to be used in a static environment - where a web master wrote a static site that a surfer read.


The Web is Ours
The older model is neither democratic nor what Berners-Lee was thinking of when he conceived the web. However, when we view the web as platform, it begins to fulfill more of its potential.This phenomenon, coupled with the rise of user-generated content and newer coding languages, has spurred a burst of data creation. Part of this burst is caused by the ease of sharing information - as shown in this little diagram:


This information sharing both mediates and is mediated by the rise of social media sites. Below is a time-line of such social sites. Note the boom in 2003-2005. For comparison, 2004 saw the beginning of Gmail, Google began digitising books for Google Books and MSN started a news-aggregator service.

We Create our Own Web
With all this data, finding the information we require could potentially be much more difficult. However,  new ways of organising (including reusing) data not only increased the pure amount of data but also aids in locating data. Let's watch another video:


This video focused on organising information - particularly with reference to the traditional method of categorisation.Google, however, does not use this traditional method. Instead, Google treats virtually every word within a web page or document as a keyword; there are limitless categories of unbounded data. This has led to library users who expect to find materials with only a few keywords. (Here is an interesting discussion of this newer way of categorising information). And here is a disconnect.

The majority of library cataloging uses the old system of placing the book into a category. The disconnect comes from creating the data - i.e. cataloging an item - and accessing the data - that is a patron attempting to locating an item. Whilst it is a generalisation to say that not everyone understands the old categorisation techniques, it is a growing trend. Our items may only be in one place at once - although there may be multiple formats - yet digital information is at once everywhere and nowhere. It may be accessed via multiple paths and ordered by the user. Physical items in the collection, however, are located in only one place and the user is not able to add their own order onto this system.

Assessment
In your blog, post on your interpretation of the history of Web 2.0 and how it alters information literacy behaviours. How can we use the new approaches to data to better match library processes to patron requirements?

Further Reading
For background on web architecture here and here.
Another video by Michael Wesch - a recording of a lecture:

How the Library of Congress uses Web 2.0 to improve information literacy.
Information on the National Library of Australia's Newspaper Digitisation Project. Including tagging functionality.