O'Reilly Network    
 Published on O'Reilly Network (http://www.oreillynet.com/)
 See this if you're having trouble printing code examples

The Open Informatics Petition

by Jason E. Stewart and Harry Mangalam

Editor's note: For an opposing viewpoint, read Why I'm Not Supporting the Open Informatics Petition, by Andrew Dalke.

Jason E. Stewart and Harry Mangalam have created an Internet-available petition to require that software resulting from publicly funded research be made open source. The authors feel that if the public pays for the research, it should have access to the results of that research. In this article, they present their case, review objections and apparent conflicts with the Bayh-Dole act, and try to resolve remaining issues. Links to ongoing discussions and other useful online articles are also included.

Opening Shots

In writing a paper on analytical software for large-scale gene expression, we were struck by the wide range of licenses used by software packages produced by different academic groups. They ranged from strict GPL to executable-only, which was available after signed licenses were faxed back directly from the office of a high university official. As we explored further we were soon drawn into the vortex of the Bayh-Dole Act1 and Technology Transfer Offices. It was also at this time that Steven Brenner was discovering that it was illegal for an employee of the University of California to release his or her work as open source without first clearing it with the Technology Transfer Office. This apparent schism between the assumed ability of academics to freely share the results of their work with others and the UC employment contract was eventually amicably settled, but not without effort. This prohibition on the voluntary release of academic software was one of the points that sparked the creation of the OpenInformatics.org petition site as an adjunct to the article2.

The Open Informatics petition can be viewed online in all its warty splendor, but briefly, it requests that:

When money from public research grants is used to develop software, that software should be published under an open source or a free software license, as a condition of funding. Such licensing is the software equivalent of peer-reviewed publication of research results.

The two key points being that public money should fund public software, and that scientific software should be subject to peer review. In a scientific endeavor, we make a hypothesis and provide evidence to support or discredit the hypothesis. In reporting the evidence, we publish the data in as raw a form as is possible so that others can examine and critique it; this is how science moves forward. We submit that like other "Materials and Methods," the source code of software involved in arriving at a decision should also be published.

To follow are some explicit advantages to publishing source code:

O'Reilly Bioinformatics Technology Conference

Jason E. Stewart will be presenting G2G: A Peer-to-Peer Architecture for Gene Expression Data. He is also leading a Birds-of-a-Feather session on the Open Informatics petition, all at the upcoming O'Reilly Bioinformatics Technology Conference.

Our goals in writing the petition were two-fold: we wanted to educate researchers and funding agencies about open source, and we wanted to start a broader discussion about public funding, software licensing, and Bayh-Dole. Ultimately, we hope the petition will encourage public-funding agencies to create a joint policy defining how they intend to handle open source software.

The Open Informatics petition gained more visibility when both Science and the Associated Press covered the petition. It was then the subject of some spirited discussion on both Slashdot and the O'Reilly Bioinformatics list. We had naively expected nothing but unswerving support and unbending devotion, but a number of objections were raised, some of which we describe below.

Controversy Rages

Comment on this articleDo you think all code generated by publicly funded research should be licensed as open source?
Post your comments or read what others have to say

The contention surrounding our petition involves only a few major issues and a number of minor ones. The major objections are:

While we were initially a little surprised at the criticism, we were glad to have the feedback. Some of the criticism was (and still is) deserved. Other objections we think are off the mark, but most of the criticism caused us to re-examine our initial position and make explicit some of the cloudy or implicit language of the original phrasing.

Many of these points and others are addressed in the Petition FAQ, which is continuing to grow and now provides a reasonable overview of some of the issues involved.

Moving Forward

The petition is not intended as a final policy document. On the contrary, we raise a number of issues that require discussion within a larger forum. The area which needs the most discussion concerns software license details. For example, the petition only indicates that software be published using either an open source or free software license. Which licenses should be allowed? Should agencies choose a single license for all software, and if so, which ones? Or should authors be allowed to choose from a set of approved licenses, and if so, who chooses which licenses get approved, and what are the criteria that should be used to decide? The FSF lists four while the OSI lists nine possible considerations.

We are not wedded to either open source or free software, but chose them because they each satisfy the four basic criteria listed in the free software definition. Perhaps there are criteria that are unique to scientific software, which the FSF and OSI did not consider. Because of these wide-open issues we encourage everyone who is interested to participate in the ongoing discussion.

Further Discussion

We've only begun to scratch the surface of this important topic. Please visit the OpenInformatics.org Web site to read more about the petition. Also, we want to hear more discussion around the issues raised, so we encourage you to join the petition discussion list or to go to the upcoming O'Reilly Bioinformatics Technology conference and attend the panel discussion on public funding and open source, and the Birds-of-a-Feather session for the OpenInformatics.org petition.


Bayh-Dole Act1
The relationship of public funding of research and the Bayh-Dole Act's implications of allowing the exclusive privatization of publicly funded resources is too large an issue to cover here in the depth that it needs to be. We can however point to several other online articles that cover it in more detail, both favorable (UC Office of Technology Transfer and 21stC) and unfavorable (AlterNet.org and the Atlantic).

Salon article2
The situation is not limited to universities. A recent Salon article discusses how it took years for researchers at various national laboratories to obtain permission to release software as open source.

Copyright © 2009 O'Reilly Media, Inc.