Apr
07
2009

Google Books and Copyright

Harvard Librarian Robert Darnton offers a fascinating discussion of the history of copyright in the context of Google’s publication of all digital books

Our republic was founded on faith in the central principle of the eighteenth-century Republic of Letters: the diffusion of light. For Jefferson, enlightenment took place by means of writers and readers, books and libraries—especially libraries, at Monticello, the University of Virginia, and the Library of Congress. This faith is embodied in the United States Constitution. Article 1, Section 8, establishes copyright and patents “for limited times” only and subject to the higher purpose of promoting “the progress of science and useful arts.” The Founding Fathers acknowledged authors’ rights to a fair return on their intellectual labor, but they put public welfare before private profit.

How to calculate the relative importance of those two values? As the authors of the Constitution knew, copyright was created in Great Britain by the Statute of Anne in 1710 for the purpose of curbing the monopolistic practices of the London Stationers’ Company and also, as its title proclaimed, “for the encouragement of learning.” At that time, Parliament set the length of copyright at fourteen years, renewable only once. The Stationers attempted to defend their monopoly of publishing and the book trade by arguing for perpetual copyright in a long series of court cases. But they lost in the definitive ruling of Donaldson v. Becket in 1774.

When the Americans gathered to draft a constitution thirteen years later, they generally favored the view that had predominated in Britain. Twenty-eight years seemed long enough to protect the interests of authors and publishers. Beyond that limit, the interest of the public should prevail. In 1790, the first copyright act—also dedicated to “the encouragement of learning”—followed British practice by adopting a limit of fourteen years renewable for another fourteen.

How long does copyright extend today? According to the Sonny Bono Copyright Term Extension Act of 1998 (also known as “the Mickey Mouse Protection Act,” because Mickey was about to fall into the public domain), it lasts as long as the life of the author plus seventy years. In practice, that normally would mean more than a century. Most books published in the twentieth century have not yet entered the public domain. When it comes to digitization, access to our cultural heritage generally ends on January 1, 1923, the date from which great numbers of books are subject to copyright laws.

Darnton relates this change in law to changes in the distribution networks of knowledge:

As the Enlightenment faded in the early nineteenth century, professionalization set in. You can follow the process by comparing the Encyclopédie of Diderot, which organized knowledge into an organic whole dominated by the faculty of reason, with its successor from the end of the eighteenth century, the Encyclopédie méthodique, which divided knowledge into fields that we can recognize today: chemistry, physics, history, mathematics, and the rest. In the nineteenth century, those fields turned into professions, certified by Ph.D.s and guarded by professional associations. They metamorphosed into departments of universities, and by the twentieth century they had left their mark on campuses—chemistry housed in this building, physics in that one, history here, mathematics there, and at the center of it all, a library, usually designed to look like a temple of learning.Along the way, professional journals sprouted throughout the fields, subfields, and sub-subfields. The learned societies produced them, and the libraries bought them. This system worked well for about a hundred years. Then commercial publishers discovered that they could make a fortune by selling subscriptions to the journals. Once a university library subscribed, the students and professors came to expect an uninterrupted flow of issues. The price could be ratcheted up without causing cancellations, because the libraries paid for the subscriptions and the professors did not. Best of all, the professors provided free or nearly free labor. They wrote the articles, refereed submissions, and served on editorial boards, partly to spread knowledge in the Enlightenment fashion, but mainly to advance their own careers.

The result stands out on the acquisitions budget of every research library: the Journal of Comparative Neurology now costs $25,910 for a year’s subscription; Tetrahedron costs $17,969 (or $39,739, if bundled with related publications as a Tetrahedron package); the average price of a chemistry journal is $3,490; and the ripple effects have damaged intellectual life throughout the world of learning. Owing to the skyrocketing cost of serials, libraries that used to spend 50 percent of their acquisitions budget on monographs now spend 25 percent or less. University presses, which depend on sales to libraries, cannot cover their costs by publishing monographs. And young scholars who depend on publishing to advance their careers are now in danger of perishing.

Darnton offers this history in the context of Google’s settlement with the book publishers, reached in October 2008, which

creates an enterprise known as the Book Rights Registry to represent the interests of the copyright holders. Google will sell access to a gigantic data bank composed primarily of copyrighted, out-of-print books digitized from the research libraries. Colleges, universities, and other organizations will be able to subscribe by paying for an “institutional license” providing access to the data bank. A “public access license” will make this material available to public libraries, where Google will provide free viewing of the digitized books on one computer terminal. And individuals also will be able to access and print out digitized versions of the books by purchasing a “consumer license” from Google, which will cooperate with the registry for the distribution of all the revenue to copyright holders. Google will retain 37 percent, and the registry will distribute 63 percent among the rightsholders.

Meanwhile, Google will continue to make books in the public domain available for users to read, download, and print, free of charge. Of the seven million books that Google reportedly had digitized by November 2008, one million are works in the public domain; one million are in copyright and in print; and five million are in copyright but out of print. It is this last category that will furnish the bulk of the books to be made available through the institutional license.

Many of the in-copyright and in-print books will not be available in the data bank unless the copyright owners opt to include them. They will continue to be sold in the normal fashion as printed books and also could be marketed to individual customers as digitized copies, accessible through the consumer license for downloading and reading, perhaps eventually on e-book readers such as Amazon’s Kindle. . .

[H]ere is a proposal that could result in the world’s largest library. It would, to be sure, be a digital library, but it could dwarf the Library of Congress and all the national libraries of Europe. Moreover, in pursuing the terms of the settlement with the authors and publishers, Google could also become the world’s largest book business—not a chain of stores but an electronic supply service that could out-Amazon Amazon. . .

Google is not a guild, and it did not set out to create a monopoly. On the contrary, it has pursued a laudable goal: promoting access to information. But the class action character of the settlement makes Google invulnerable to competition. Most book authors and publishers who own US copyrights are automatically covered by the settlement. They can opt out of it; but whatever they do, no new digitizing enterprise can get off the ground without winning their assent one by one, a practical impossibility, or without becoming mired down in another class action suit. If approved by the court—a process that could take as much as two years—the settlement will give Google control over the digitizing of virtually all books covered by copyright in the United States.

This outcome was not anticipated at the outset. Looking back over the course of digitization from the 1990s, we now can see that we missed a great opportunity. Action by Congress and the Library of Congress or a grand alliance of research libraries supported by a coalition of foundations could have done the job at a feasible cost and designed it in a manner that would have put the public interest first. By spreading the cost in various ways—a rental based on the amount of use of a database or a budget line in the National Endowment for the Humanities or the Library of Congress—we could have provided authors and publishers with a legitimate income, while maintaining an open access repository or one in which access was based on reasonable fees. We could have created a National Digital Library—the twenty-first-century equivalent of the Library of Alexandria. It is too late now. Not only have we failed to realize that possibility, but, even worse, we are allowing a question of public policy—the control of access to information—to be determined by private lawsuit. . .

What will happen if Google favors profitability over access? Nothing, if I read the terms of the settlement correctly. Only the registry, acting for the copyright holders, has the power to force a change in the subscription prices charged by Google, and there is no reason to expect the registry to object if the prices are too high. Google may choose to be generous in it pricing, and I have reason to hope it may do so; but it could also employ a strategy comparable to the one that proved to be so effective in pushing up the price of scholarly journals: first, entice subscribers with low initial rates, and then, once they are hooked, ratchet up the rates as high as the traffic will bear.

Free-market advocates may argue that the market will correct itself. If Google charges too much, customers will cancel their subscriptions, and the price will drop. But there is no direct connection between supply and demand in the mechanism for the institutional licenses envisioned by the settlement. Students, faculty, and patrons of public libraries will not pay for the subscriptions. The payment will come from the libraries; and if the libraries fail to find enough money for the subscription renewals, they may arouse ferocious protests from readers who have become accustomed to Google’s service. In the face of the protests, the libraries probably will cut back on other services, including the acquisition of books, just as they did when publishers ratcheted up the price of periodicals.

Written by Elliott in: Uncategorized |

No Comments »

RSS feed for comments on this post. TrackBack URL

Leave a comment

Powered by WordPress | Aeros Theme | TheBuckmaker.com WordPress Themes