Google is in the process of scanning 40,000 Harvard library books, which will be made available for browsing on the internet. Harvard is taking part in Google Print’s pilot program, which also includes collaborations with the University of Michigan, Oxford University, Stanford University, and the New York Public Library. If successful, Harvard may opt to digitize all 15 million of its volumes.
In December, Google Project Manager Adam Smith said that books in the public domain will be fully available online so researchers can use Google Print in place of a trip to the library. He added that much smaller excerpts of copyrighted works will be displayed.
While Google may upload some of Harvard’s copyrighted works, they will not yet be displayed, HUL Director of Publications and Communications Peter Kosewski told The Crimson at the time.
But Sally Morris, chief executive of the Association of Learned and Professional Society Publishers—an international association of over 300 not-for-profit publishers—wrote in an e-mail that even with these safeguards, she and the publishers she represents object to the project, which plans to digitize copyrighted books.
“The law does not permit wholesale copying (which is what digitisation is) by a commercial organisation of works that are still in copyright,” she wrote. “It is also illegal to make those works available digitally once they have been copied.”
Morris wrote that Google needs to obtain permission from publishers before using their work. While she wrote that it may be impractical to ask every publisher, Google should ask permission through collective licensing organizations.
But Jonathan Zittrain, faculty co-director of Harvard Law School’s Berkman Center for Internet and Society, wrote in an e-mail that he believes the pilot project is not a copyright infringement.
“This is what fair use is designed for,” he wrote. “By showing only snippets, the market for the books themselves is not harmed.”
Zittrain added that publishing industry practice “has become extremely cautious and that quite often firms ask each other for permission.”
A Google spokesperson said that Google intends to work directly with publishers but did not speak directly to the copyright concerns raised by publishers.
Morris also said Harvard might not be able to discern which books are copyrighted because of varying copyright regulations in different countries.
In the U.S., works copyrighted before 1923 are generally in the public domain, while in many other countries, the copyright period is determined by the number of years after the author’s death, according to HUL Associate Director for Planning and Systems Dale Flecker.
As a result, “Google will likely do something very conservative initially,” he wrote in an e-mail, adding that technological advances may make determining the copyright status of books easier in the future.
Allan Adler, vice president for legal and government affairs at the Association of American Publishers—the principal trade association of the book publishing industry with 310 members, including the Harvard University Press—raised additional concerns. He said he is concerned that Google is digitizing the works without knowing how Harvard will restrict access to the digital copies.
Adler said digitization may create “an ease of reproduction and distribution of copyrighted material” if Harvard allows its students internet access to the digital copies.