Database as a Genre of New Media [1]

Lev Manovich



The Database Logic

After the novel, and subsequently cinema privileged narrative as the key form of cultural expression of the modern age, the computer age introduces its correlate - database. Many new media objects do not tell stories; they don't have beginning or end; in fact, they don't have any development, thematically, formally or otherwise which would organize their elements into a sequence. Instead, they are collections of individual items, where every item has the same significance as any other.

Why does new media favor database form over others? Can we explain its popularity by analyzing the specificity of the digital medium and of computer programming? What is the relationship between database and another form, which has traditionally dominated human culture - narrative? These are the questions I will address in this article.

Before proceeding I need to comment on my use of the word database. In computer science database is defined as a structured collection of data. The data stored in a database is organized for fast search and retrieval by a computer and therefore it is anything but a simple collection of items. Different types of databases - hierarchical, network, relational and object-oriented - use different models to organize data. For instance, the records in hierarchical databases are organized in a treelike structure. Object-oriented databases store complex data structures, called "objects," which are organized into hierarchical classes that may inherit properties from classes higher in the chain. [2] New media objects may or may not employ these highly structured database models; however, from the point of view of user's experience a large proportion of them are databases in a more basic sense. They appear as a collections of items on which the user can perform various operations: view, navigate, search. The user experience of such computerized collections is therefore quite distinct from reading a narrative or watching a film or navigating an architectural site. Similarly, literary or cinematic narrative, an architectural plan and database each present a different model of what a world is like. It is this sense of database as a cultural form of its own which I want to address here. Following art historian Ervin Panofsky's analysis of linear perspective as a "symbolic form" of the modern age, we may even call database a new symbolic form of a computer age (or, as philosopher Jean-Francois Lyotard called it in his famous 1979 book Postmodern Condition, "computerized society"), [3] a new way to structure our experience of ourselves and of the world. Indeed, if after the death of God (Nietzche), the end of grand Narratives of Enlightenment (Lyotard) and the arrival of the Web (Tim Berners-Lee) the world appears to us as an endless and unstructured collection of images, texts, and other data records, it is only appropriate that we will be moved to model it as a database. But it is also appropriate that we would want to develops poetics, aesthetics, and ethics of this database.

Let us begin by documenting the dominance of database form in new media. The most obvious examples of this are popular multimedia encyclopedias, which are collections by their very definition; as well as other commercial CD-ROM titles which are collections as well - of recipes, quotations, photographs, and so on. [4] The identity of a CD-ROM as a storage media is projected onto another plane, becoming a cultural form of its own. Multimedia works which have "cultural" content appear to particularly favor the database form. Consider, for instance, the "virtual museums" genre - CD-ROMs which take the user on a "tour" through a museum collection. A museum becomes a database of images representing its holdings, which can be accessed in different ways: chronologically, by country, or by artist. Although such CD-ROMs often simulate the traditional museum experience of moving from room to room in a continuous trajectory, this "narrative" method of access does not have any special status in comparison to other access methods offered by a CD-ROM. Thus the narrative becomes just one method of accessing data among others. Another example of a database form is a multimedia genre which does not has an equivalent in traditional media - CD-ROMs devoted to a single cultural figure such as a famous architect, film director or writer. Instead of a narrative biography we are presented with a database of images, sound recordings, video clips and/or texts which can be navigated in a variety of ways.

CD-ROMs and other digital storage media (floppies, and DVD-ROMs) proved to be particularly receptive to traditional genres which already had a database-like structure, such as a photo-album; they also inspired new database genres, like a database biography. Where the database form really flourished, however, is on the Internet. As defined by original HTML, a Web page is a sequential list of separate elements: text blocks, images, digital video clips, and links to other pages. It is always possible to add a new element to the list - all you have to do is to open a file and add a new line. As a result, most Web pages are collections of separate elements: texts, images, links to other pages or sites. A home page is a collection of personal photographs. A site of a major search engine is a collection of numerous links to other sites (along with a search function, of course). A site of a Web-based TV or radio station offers a collections of video or audio programs along with the option to listen to the current broadcast; but this current program is just one choice among many other programs stored on the site. Thus the traditional broadcasting experience, which consisted solely of a real-time transmission, becomes just one element in a collection of options. Similar to the CD-ROM medium, the Web offered fertile ground to already existing database genres (for instance, bibliography) and also inspired the creation of new ones such as the sites devoted to a person or a phenomenon (Madonna, Civil War, new media theory, etc.) which, even if they contain original material, inevitably center around the list of links to other Web pages on the same person or phenomenon.

The open nature of the Web as medium (Web pages are computer files which can always be edited) means that the Web sites never have to be complete; and they rarely are. The sites always grow. New links are being added to what is already there. It is as easy to add new elements to the end of list as it is to insert them anywhere in it. All this further contributes to the anti-narrative logic of the Web. If new elements are being added over time, the result is a collection, not a story. Indeed, how can one keep a coherent narrative or any other development trajectory through the material if it keeps changing?

Data and Algorithm

Of course not all new media objects are explicitly databases. Computer games, for instance, are experienced by their players as narratives. In a game, the player is given a well-defined task - winning the match, being first in a race, reaching the last level, or reaching the highest score. It is this task which makes the player experience the game as a narrative. Everything which happens to her in a game, all the characters and objects she encounters either take her closer to achieving the goal or further away from it. Thus, in contrast to the CD-ROM and Web databases, which always appear arbitrary since the user knows that additional material could have been added without in any way modifying the logic of the database, in a game, from a user's point of view, all the elements are motivated ( i.e., their presence is justified). [5]

Often the narrative shell of a game ("you are the specially trained commando who has just landed on a Lunar base; your task is to make your way to the headquarters occupied by the mutant base personnel...") masks a simple algorithm well-familiar to the player: kill all the enemies on the current level, while collecting all treasures it contains; go to the next level and so on until you reach the last level. Other games have different algorithms. Here is an algorithm of the legendary "Tetris": when a new block appears, rotate it in such a way so it will complete the top layer of blocks on the bottom of the screen making this layer disappear. The similarity between the actions expected from the player and computer algorithms is too uncanny to be dismissed. While computer games do not follow database logic, they appear to be ruled by another logic - that of an algorithm. They demand that a player executes an algorithm in order to win.

An algorithm is the key to the game experience in a different sense as well. As the player proceeds through the game, she gradually discovers the rules which operate in the universe constructed by this game. She learns its hidden logic, in short its algorithm. Therefore, in games where the game play departs from following an algorithm, the player is still engaged with an algorithm, albeit in another way: she is discovering the algorithm of the game itself. I mean this both metaphorically and literally: for instance, in a first person shooter, such as "Quake," the player may eventually notice that under such and such condition the enemies will appear from the left, i.e. she will literally reconstruct a part of the algorithm responsible for the game play. Or, in a diffirent formulation of the legendary author of Sim games Will Wright, "Playing the game is a continuos loop between the user (viewing the outcomes and inputting decisions) and the computer (calculating outcomes and displaying them back to the user). The user is trying to build a mental model of the computer model." [6]

What we encountered here is an example of the general principle of new media: the projection of the ontology of a computer onto culture itself. If in physics the world is made of atoms and in genetics it is made of genes, computer programming encapsulates the world according to its own logic. The world is reduced to two kinds of software objects which are complementary to each other: data structures and algorithms. Any process or task is reduced to an algorithm, a final sequence of simple operations which a computer can execute to accomplish a given task. And any object in the world - be it the population of a city, or the weather over the course of a century, a chair, a human brain - is modeled as a data structure, i.e. data organized in a particular way for efficient search and retrieval. [7] Examples of data structures are arrays, linked lists and graphs. Algorithms and data structures have a symbiotic relationship. The more complex the data structure of a computer program, the simpler the algorithm needs to be, and vice versa. Together, data structures and algorithms are two halves of the ontology of the world according to a computer.

The computerization of culture involves the projection of these two fundamental parts of computer software - and of the computer's unique ontology - onto the cultural sphere. If CD-ROMs and Web databases are cultural manifestations of one half of this ontology - data structures, then computer games are manifestations of the second half - algorithms. Games (sports, chess, cards, etc.) are one cultural form which required algorithm-like behavior from the players; consequently, many traditional games were quickly simulated on computers. In parallel, new genres of computer games came into existence such as a first person shooter ("Doom," "Quake"). Thus, as it was the case with database genres, computer games both mimic already existing games and create new game genres.

It may appear at first sight that data is passive and algorithm is active - another example of passive-active binary categories so loved by human cultures. A program reads in data, executes an algorithm, and writes out new data. We may recall that before "computer science" and "software engineering" became established names for the computer field, it was called "data processing." This name remained in use for a few decades during which computers were mainly associated with performing calculations over data. However, the passive/active distinction is not quite accurate since data does not just exist - it has to be generated. Data creators have to collect data and organize it, or create it from scratch. Texts need to written, photographs need to be taken, video and audio need to be recorded. Or they need to be digitized from already existing media. In the 1990Ís, when the new role of a computer as a Universal Media Machine became apparent, already computerized societies went into a digitizing craze. All existing books and video tapes, photographs and audio recordings started to be fed into computers at an ever increasing rate. Steven Spielberg created the Shoah Foundation which videotaped and then digitized numerous interviews with Holocaust survivors; it would take one person forty years to watch all the recorded material. The editors of Mediamatic journal, who devoted a whole issue to the topic of "the storage mania" (Summer 1994) wrote: "A growing number of organizations are embarking on ambitious projects. Everything is being collected: culture, asteroids, DNA patterns, credit records, telephone conversations; it doesn't matter." [8] Once it is digitized, the data has to be cleaned up, organized, indexed. The computer age brought with it a new cultural algorithm: reality-> media-> data->database. The rise of the Web, this gigantic and always changing data corpus, gave millions of people a new hobby or profession: data indexing. There is hardly a Web site which does not feature at least a dozen links to other sites, therefore every site is a type of database. And, with the rise of Internet commerce, most large-scale commercial sites have become real databases, or rather front-ends to company databases. For instance, in the Fall of 1998, Amazon.com, an online book store, had 3 million books in its database; and the maker of leading commercial database Oracle has offered Oracle 8i, fully intergrated with the Internet and featuring unlimited database size, natural-langauge queries and support for all multimedia data types. [9] Jorge Luis Borges's story about a map which was equal in size to the territory it represented became re-written as the story about indexes and the data they index. But now the map has become larger than the territory. Sometimes, much larger. Porno Web sites exposed the logic of the Web to its extreme by constantly re-using the same photographs from other porno Web sites. Only rare sites featured the original content. On any given date, the same few dozen images would appear on thousands of sites. Thus, the same data would give rise to more indexes than the number of data elements themselves.

Database and Narrative

As a cultural form, database represents the world as a list of items and it refuses to order this list. In contrast, a narrative creates a cause-and-effect trajectory of seemingly unordered items (events). Therefore, database and narrative are natural enemies. Competing for the same territory of human culture, each claims an exclusive right to make meaning out of the world.

In contrast to most games, most narratives do not require algorithm-like behavior from their readers. However, narratives and games are similar in that the user, while proceeding through them, must uncover its underlying logic - its algorithm. Just like a game player, a reader of a novel gradually reconstructs an algorithm (here I use it metaphorically) which the writer used to create the settings, the characters, and the events. From this perspective, I can re-write my earlier equations between the two parts of the computer's ontology and its corresponding cultural forms. Data structures and algorithms drive different forms of computer culture. CD-ROMs, Web sites and other new media objects which are organized as databases correspond to the data structure; while narratives, including computer games, correspond to the algorithms.

In computer programming, data structures and algorithms need each other; they are equally important for a program to work. What happens in a cultural sphere? Do databases and narratives have the same status in computer culture?

Some media objects explicitly follow database logic in their structure while others do not; but behind the surface practically all of them are databases. In general, creating a work in new media can be understood as the construction of an interface to a database. In the simplest case, the interface simply provides the access to the underlying database. For instance, an image database can be represented as a page of miniature images; clicking on a miniature will retrieve the corresponding record. If a database is too large to display all of its records at once, a search engine can be provided to allow the user to search for particular records. But the interface can also translate the underlying database into a very different user experience. The user may be navigating a virtual three-dimensional city composed from letters, as in Jeffrew Shaw's interactive installation "Legible City." [10] Or she may be traversing a black and white image of a naked body, activating pieces of text, audio and video embedded in its skin (Harwood's CD-ROM "Rehearsal of Memory.") [11] Or she may be playing with virtual animals which come closer or run away depending upon her movements (Scott Fisher et al, VR installation, "Menagerie.") [12] Although each of these works engages the user in a set of behaviors and cognitive activities which are quite distinct from going through the records of a database, all of them are databases. "Legible City" is a database of three-dimensional letters which make up the city. "Rehearsal of Memory" is a database of texts and audio and video clips which are accessed through the interface of a body. And "Menagerie" is a database of virtual animals, including their shapes, movements and behaviors.

Database becomes the center of the creative process in the computer age. Historically, the artist made a unique work within a particular medium. Therefore the interface and the work were the same; in other words, the level of an interface did not exist. With new media, the content of the work and the interface become separate. It is therefore possible to create different interfaces to the same material. These interfaces may present different versions of the same work, as in David Blair's Wax Web. [13] Or they may be radically different from each other, as in Moscow WWW Art Centre. [14] This is one of the ways in which the general principle of variability of new media manifests itself. The new media object consists of one or more interfaces to a database of multimedia material. If only one interface is constructed, the result will be similar to a traditional art object; but this is an exception rather than the norm.

This formulation places the opposition between database and narrative in a new light, thus redefining our concept of narrative. The "user" of a narrative is traversing a database, following links between its records as established by the database's creator. An interactive narrative (which can be also called "hyper-narrative" in an analogy with hypertext) can then be understood as the sum of multiple trajectories through a database. A traditional linear narrative is one, among many other possible trajectories; i.e. a particular choice made within a hyper-narrative. Just as a traditional cultural object can now be seen as a particular case of a new media object (i.e., a new media object which only has one interface), traditional linear narrative can be seen as a particular case of a hyper-narrative.

This "technical," or "material" change in the definition of narrative does not mean that an arbitrary sequence of database records is a narrative. To qualify as a narrative, a cultural object has to satisfy a number of criteria, which literary scholar Mieke Bal defines as follows: it should contain both an actor and a narrator; it also should contain three distinct levels consisting of the text, the story, and the fabula; and its "contents" should be "a series of connected events caused or experienced by actors." [15] Obviously, not all cultural objects are narratives. However, in the world of new media, the word "narrative" is often used as all-inclusive term, to cover up the fact that we have not yet developed a language to describe these new strange objects. It is usually paired with another over-used word - interactive. Thus, a number of database records linked together so that more than one trajectory is possible, is assumed to be constitute "interactive narrative." But to just create these trajectories is of course not sufficient; the author also has to control the semantics of the elements and the logic of their connection so that the resulting object will meet the criteria of narrative as outlined above. Another erroneous assumption frequently made is that by creating her own path (i.e., choosing the records from a database in a particular order) the user constructs her own unique narrative. However, if the user simply accesses different elements, one after another, in a usually random order, there is no reason to assume that these elements will form a narrative at all. Indeed, why should an arbitrary sequence of database records, constructed by the user, result in "a series of connected events caused or experienced by actors"?

In summary, database and narrative do not have the same status in computer culture. In the database / narrative pair, database is the unmarked term. [16] Regardless of whether new media objects present themselves as linear narratives, interactive narratives, databases, or something else, underneath, on the level of material organization, they are all databases. In new media, the database supports a range of cultural forms which range from direct translation (i.e., a database stays a database) to a form whose logic is the opposite of the logic of the material form itself - a narrative. More precisely, a database can support narrative, but there is nothing in the logic of the medium itself which would foster its generation. It is not surprising, then, that databases occupy a significant, if not the largest, territory of the new media landscape. What is more surprising is why the other end of the spectrum - narratives - still exist in new media.




Notes:

1. This article develops ideas which were first presented by me at "Chips and Bits" Symposium, UCLA School of Film and Television, May 1997. The role of database in new media was further explored in the symposium "Computing Culture: Defining New Media Genres" which I organized the Center for Computing in the Arts (CRCA) at the University of California, San Diego, May 1-2, 1998. See http://jupiter.ucsd.edu/~culture/symposium.html.
2. "database" Britannica Online. http://www.eb.com:180/cgi-bin/g?DocF=micro/160/23.html [Accessed 27 November 1998].
3. Jean-Francois Lyotard, The Postmodern Condition: A Report on Knowledge, trans. Geoff Bennington and Brian Massumi (Minneapolis: University of Minnesota Press, 1984), 3.
4. As early as 1985 Grolier, Inc. issued text-only "Academic American Encyclopedia" on CD-ROM. First multimedia encyclopedia was "Compton's MultiMedia Encyclopedia" published in 1989.
5. David Bordwell and Kristin Thompson define motivation in cinema in the following way: "Because films are human constructs, we can expect that any one element in a film will have some justification for being there. This justification is the motivation for that element." Here are some examples of motivation: "When Tom jumps from the balloon to chase a cat, we motivate his action by appealing to notions of how dogs are likely to act when cats are around." "The movement of a character across a room may motivate the moving of the camera to follow the action and keep the character within a frame." David Bordwell and Kristin Thompson, Film Art: an Introduction. 5th Edition (New York: The McGraw-Hill Companies, Inc., 1997), 80.
6. Chris McGowan and Jim McCullaugh, Entertainment in the Cyber Zone (New York: Random House, 1995), 71.
7. This is true for a procedural programming paradigm. In a object-oriented programming paradigm, represented by such computer langauges as Java and C++, algorithms and data structures are modeled together as objects.
8. Mediamatic 8, no. 1 (Summer 1994), 1860.
9. http://www.amazon.com/exec/obidos/subst/misc/company-info.html/, http://www.oracle.com/database/oracle8i/, accessed Nov. 28, 1998. 10. http://artnetweb.com/guggenheim/mediascape/shaw.html
11. Harwood. Rehearsal of Memory, CD-ROM (London: Artec and Bookworks, 1996.)
12. http://www.telepresence.com/MENAGERIE,
accessed October 22, 1998.
13. http://jefferson.village.virginia.edu/wax/, accessed September 12, 1998.
14. http://www.cs.msu.su/wwwart/, accessed October 22, 1998.
15. Mieke Bal, Naratology: Introduction to the Theory of Narrative (Toronto: University of Toronto Press, 1985), 8.
16. The theory of marketdness was first developed by linguists of the Prague School in relation to phonology but subsequently applied to all levels of linguistic analysis. For example, "bitch" is the marked term and "dog" is unmarked term. Whereas the "bitch" is used only in relation to females, "dog" is applicable to both males and females.



main AI & Society Contents Bio