小程序
传感搜
传感圈

A Simple Replication Agreement Could Improve Trust in Science

2023-07-11 22:29:06
关注

Daphne, an eager computer-vision scientist, encounters a novel study highly relevant to her autonomous vehicles research. The study uses computational methods to reveal hidden driving patterns. Aware of autonomous driving’s safety implications, Daphne decides to act responsibly and replicate the study to verify its findings prior to relying on them.  

After carefully reading the paper’s methodology, however, Daphne realizes that the original researchers provided no access to the code and only partial access to the data sets they used in their study. Daphne emails Sasha, the corresponding author of the original study, and asks for assistance but gets no response. Daphne now faces a serious dilemma: conducting an independent study from scratch that requires tremendous resources, or relying on the existing study, although it is irreplicable.

Today, computational science, powered by complex models and vast data sets, drives research and innovation across almost every scholarly arena. Daphne’s illustrative story highlights a real challenge scientists face in replicating studies, and underscores the importance of enabling access to code, data and other information. Promoting replicability of scientific studies benefits the scientific community and fosters public trust in science.

Replication is vital to science, seen as fundamental by investigators as far back as the pioneering 17th-century chemist Robert Boyle. It ensures that published findings are valid. If other scientists can repeat the study and get the same results, we can trust the findings. If not, it's a red flag that something might be off. Science is cumulative, and replications build its comprehensive and credible body.

Unfortunately, the scientific community has struggled with a “replication crisis” in recent decades, where scientists in fields from economics to physics find it hard to reproduce results from published studies. In fact, some estimates of the severity of the crisis in specific fields, such as biology and psychology, found that a significant portion of scientific publications in these disciplines stand in question.

One of the major challenges driving the replication crisis is that scientists often do not share all information needed to replicate their work. Access to research materials is especially crucial for the replication of computational studies, given the increasing utilization of computational methods and the data-reliant nature of such studies on large data sets. Unfortunately, it is far from guaranteed.

There are many reasons why. Sometimes academic considerations, such as avoiding criticism, fear of retraction in case a mistake is revealed, and avoiding “scooping,” may drive the decision. In other cases, as the boundary between academic and applied research blurs, commercial considerations keep replication materials private to preserve the prospect of their commercialization. This creates a culture of secrecy, which goes against the fundamental values of openness and sharing in the scientific community.

Intellectual property (IP) law plays a significant role in creating a culture of secrecy in science. Patents and trade secrets secure the economic potential held in research and development (R&D). Patent law does so by providing exclusive rights to inventions. Yet a patented invention must be novel and nonobvious, which encourages inventors to conceal information (at least until they file a patent application). This can lead them to limit access to replication materials to maximize their chances of obtaining patent protection. The mechanism by which trade secrets contribute to nonsharing norms is more straightforward: trade-secret law can essentially protect any kind of information as long as it is kept secret.

Thus, while encouraging innovation, IP rights can also discourage sharing of research materials and exacerbate the replication crisis. Nevertheless, these rights matter to researchers, organizations and commercial firms, and benefit them (and the public that enjoys their efforts) in multiple ways, from the financial to the reputational.

To address this conflict, we propose a new policy instrument that could facilitate studies’ replicability without depriving scientists of their IP protection: the conditional access agreement (CAA). In short, the CAA establishes a private, controlled channel of communication for the transfer of replication materials between authors and replicators. This allows for on-demand replicability while maintaining the proprietary potential of a scientific study.

Under the CAA mechanism, when submitting a paper for publication, an author would execute an agreement with the journal, pledging to provide full access to replication materials upon demand by other researchers. The agreement would specify that anyone requesting access to the materials can only obtain it upon signing a nondisclosure agreement (NDA). The NDA would prohibit the use of the replication materials delivered by the original authors for any purpose other than replication. Since patent law and trade-secret law bar protection only in the case of public disclosure, information shared privately under the NDA would not nullify the possibility of obtaining patent protection, nor would it negate trade secrecy. The CAA policy is feasible thanks to the involvement of scientific journals as powerful intermediaries in the scientific ecosystem. Journals are the gatekeepers of responsible science, and as such, they are continuously involved in a publication’s life cycle, from submission to post-publication. A CAA policy also aligns with journals’ mission of promoting rigorous and credible science. Notably, the CAA policy imposes minimal costs on researchers, replicators and journals, as the infrastructures for such a mechanism already exist, including repositories of data and online manuscript submission systems for embedding the CAA.

Faced with a replication crisis, we believe that replication agreements facilitated by journals can improve science. Pursuing this path will enhance the replicability of scientific research and increase public trust in science.

This is an opinion and analysis article, and the views expressed by the author or authors are not necessarily those of Scientific American.

参考译文
一项简单的复制协议可以提升对科学的信任
达芙妮是一位热忱的计算机视觉科学家,她发现了一项对她的自动驾驶汽车研究高度相关的新型研究。该研究利用计算方法揭示了隐藏的驾驶模式。鉴于自动驾驶对安全性的影响,达芙妮决定负责任地行动,在依赖这些成果之前先复制这项研究,以验证其发现。然而,在仔细阅读了论文的方法部分后,达芙妮意识到原研究人员并未提供代码访问权限,仅提供了研究中使用数据集的部分访问权限。达芙妮写邮件给原研究的通讯作者萨莎,请求帮助,但没有收到回复。达芙妮现在面临一个艰难的抉择:从零开始进行一项独立研究,这需要巨大的资源;或者依赖现有研究,尽管它无法复制。如今,计算科学依靠复杂的模型和庞大的数据集,推动着几乎所有学术领域的研究与创新。达芙妮的这个例子凸显了科学家在复制研究时所面临的现实挑战,并强调了提供代码、数据和其他信息获取通道的重要性。促进科学研究的可复制性有利于科学界,也有助于公众对科学的信任。复制对于科学来说至关重要,早在17世纪的开创性化学家罗伯特·波义耳时就已被视为基础。它确保了发表的成果是真实有效的。如果其他科学家可以重复研究并得出相同的结果,我们就可以信任这些发现。如果不能,则可能是某种问题的警示信号。科学是积累发展的,而复制研究则构建了其全面而可信的体系。不幸的是,近年来科学界一直面临“复制危机”,从经济学到物理学等各个领域的科学家都发现难以复现已发表研究的结果。事实上,一些特定领域的研究复制危机的严重性评估发现,生物学和心理学等学科中,有相当一部分科学出版物正受到质疑。导致复制危机的主要挑战之一是科学家们往往不分享复制其研究所需的所有信息。对于计算研究的复制而言,获取研究材料尤其关键,因为计算方法的日益使用,以及这些研究对大数据集的依赖性。遗憾的是,这种获取并不总是有保障的。其中有很多原因。有时,学术因素,比如避免批评、担心出错后被撤稿,以及避免被他人抢先发表成果,都会驱使科学家做出这样的决定。在其他情况下,随着学术研究和应用研究之间的界限变得模糊,商业因素可能使复制材料保持私有,以确保其商业化潜力。这催生了一种保密文化,而这与科学界开放与共享的根本价值观背道而驰。知识产权(IP)法在科学中培养保密文化方面起到了重要作用。专利和商业秘密保护了研发成果中所蕴含的经济潜力。专利法通过提供排他性的发明权来实现这一目的。然而,一项可获得专利的发明必须具有新颖性和非显而易见性,这促使发明人隐瞒信息(至少在他们提交专利申请之前)。这可能导致他们限制访问复制材料,以最大化获得专利保护的机会。商业秘密对不分享研究材料规范的影响机制更为直接:商业秘密法在信息保密的前提下,几乎可以保护任何类型的信息。因此,尽管知识产权制度在鼓励创新方面发挥着作用,但也可能抑制研究材料的分享,从而加剧复制危机。尽管如此,这些权利对研究人员、组织和商业公司都很重要,它们从财务到声誉等多个方面,为他们(以及享受其成果的公众)带来益处。为了解决这种冲突,我们提出了一种新的政策措施,既能促进研究的可复制性,又不会剥夺科学家的知识产权保护:即“有条件访问协议”(CAA)。简而言y,CAA为作者与复现者之间提供了一个私下且受控的复制材料传递通信渠道。这可以在保持研究专有潜力的同时,实现按需复制。在CAA机制下,当提交论文发表时,作者将与期刊签署一项协议,承诺在其他研究人员提出要求时,提供完整的复制材料。该协议将规定,任何请求访问材料的人,必须签署一份保密协议(NDA)后才能获得材料。NDA禁止将原作者提供的复制材料用于复制以外的任何目的。由于专利法和商业秘密法仅在信息被公开披露的情况下才禁止保护,因此在NDA下私下共享的信息不会削弱获得专利保护的可能性,也不会影响商业秘密的保护性。CAA政策的可行性得益于科学期刊在科学生态系统中作为强大中介的作用。期刊是负责任的科学的守门人,因此它们在出版物的整个生命周期中都持续参与,从投稿到发布后。CAA政策也与期刊推动严谨和可信科学的使命相一致。值得注意的是,CAA政策对研究人员、复现者和期刊的成本都很小,因为这种机制所需的基础设施已经存在,包括数据存储库和嵌入CAA的在线论文提交系统。面对复制危机,我们相信期刊推动的复制协议可以改善科学。沿着这条道路前进,将增强科学研究的可复制性,并增强公众对科学的信任。这是一篇观点与分析文章,作者的见解不一定代表《科学美国人》的观点。
您觉得本篇内容如何
评分

评论

您需要登录才可以回复|注册

提交评论

广告
提取码
复制提取码
点击跳转至百度网盘