小程序
传感搜
传感圈

ChatGPT update released as OpenAI launches AI-generated text detection tool

2023-02-02 21:38:44
关注

OpenAI has released a new update for its hugely popular chatbot ChatGPT to ensure it produces more factually accurate responses and to improve its basic mathematics skills. The update comes as the company also published its first “detection tool” to help spot when AI is being used in a piece of text, though this apparently has a low success rate.

OpenAI launched ChatGPT in November 2022 and has been gradually improving it since it was launched. (Photo: Ascannio/Shutterstock)

Though its creators did not expect it to prove popular, within a few days of launching at the end of November last year ChatGPT had passed the million-user mark, and has since become a viral sensation. Since its launch OpenAI has been slowly improving the system, adding new functionality and cleaning up responses to make the chatbot more accurate.

Earlier this month it gave users the ability to stop it from generating a response halfway through if it wasn’t churning out what they expected. It also had the first accuracy boost. Accuracy has been one of the biggest problems facing the chatbot since its launch, with coding site StackOverflow blocking ChatGPT-generated responses as they are often accurate-looking but wrong.

The first round of updates saw technical improvements that reduced the number of times ChatGPT would simply refuse to answer or cut out mid-response. They also placed limits placed on the number of concurrent users to reduce the load on servers. There are still extended periods when users can’t access the system due to it being at capacity.

Data Insights

View All

This latest update was to improve its “factuality and mathematical capabilities”. That was the full extent of the most recent release notes. The team didn’t go into details on how it has improved those features, although ChatGPT has been known to be thrown by some mathematical problems.

Improved maths skills will likely allow it to handle complex calculations and provide more precise answers which would improve its value for professionals using it to generate reports or look for patterns in data. It is also much harder to trick it into giving a “wrong answer” in response to a simple query.

Is ChatGPT preparing for an API?

These gradual updates to the chatbot are likely designed to test and improve its functionality, removing its ability to make damaging or harmful responses, before the ChatGPT API is released by OpenAI. This API will join others from the start-up including image generation through DALL-E 2 and code production through Codex.

When launched the API will be available through OpenAI directly but also on the Microsoft Azure cloud platform. This was announced on the same day Microsoft confirmed a multi-billion dollar investment in the company that will also see ChatGPT integrated into its search engine Bing and other consumer products.

Content from our partners

Sherif Tawfik: The Middle East and Africa are ready to lead on the climate

Sherif Tawfik: The Middle East and Africa are ready to lead on the climate

What to look for in a modern ERP system

What to look for in a modern ERP system

How tech leaders can keep energy costs down and meet efficiency goals

How tech leaders can keep energy costs down and meet efficiency goals

Mike Krause, data science director at AI software company Beyond Limits told Tech Monitor the problem with false information stems from the source material ChatGPT was trained on back in 2021 and as such it “isn’t bound by the structures of factuality, reality or social morality”.

View all newsletters Sign up to our newsletters Data, insights and analysis delivered to you By The Tech Monitor team

Wikipedia was a major source of training data for ChatGPT which is written by everyday people who “can edit the written corpus of encyclopedia knowledge for all humanity and while there are content moderators, they are few and far between, leaving us mostly free to write wildly exaggerated accounts of basically anything we want until it gets flagged,” says Krause.

Despite this problem, OpenAI is improving its chatbot, says Krause. But he adds that “at its heart, it still learns patterns from data it’s fed without any intelligence or knowledge of content, and without any abstraction of data and information into concepts, which is how humans learn and extrapolate”. He says a machine learning model has to be trained to explicitly not discriminate against each group, assuming there are enough unbiased training data sets to make that possible and if it is left wild, without restriction or retraining “there are real consequences for real people in the real world”.

“OpenAI knew this and limited access from the start,” Krause adds. “ChatGPT is super-cool but it’s also capable of creating a high volume of false content automatically and feeding false information campaigns of governments that could influence public opinion, elections, even being used as a reference or source of truth when it is anything but.”

Sanjeev Kumar, VP EMEA at Boost.ai welcomed the most recent update. “However, businesses are still far from being able to use this technology as-is in customer-facing applications,” he warns. “If we expect ChatGPT’s full potential to be useful in an enterprise setting, it’s not enough to even have 99% accuracy as any slip-up could lead to possible liability concerns. It will be necessary to regularly curate and verify the sources of information that the model is connected to, in order to ensure it is both reliable and accurate.”

OpenAI launches AI content detection tool

As the factual accuracy of ChatGPT improves, so will its use as a tool for purposes both good and bad. There is evidence of hackers using it to generate malware and better-targeted phishing emails, as well as students making liberal use of the chatbot to write essays for them. These examples have inspired efforts to create detection tools.

There are some independent tools, such as the open-source GPTZero, which are designed to spot content generated by the chatbot, and OpenAI itself is experimenting with ways to watermark text generated by GPT-3 to make detection easier in the future.

In the meantime, the company is working on training a text classifier that can distinguish between text written by a human and that from an AI, and it works independently of the provider but its accuracy isn’t great at the moment, only correctly identifying about 26% of AI-written text as “likely AI-written”.

“We’re making this classifier publicly available to get feedback on whether imperfect tools like this one are useful. Our work on the detection of AI-generated text will continue, and we hope to share improved methods in the future,” OpenAI wrote.

“While it is impossible to reliably detect all AI-written text, we believe good classifiers can inform mitigations for false claims that AI-generated text was written by a human: for example, running automated misinformation campaigns, using AI tools for academic dishonesty, and positioning an AI chatbot as a human.”

Read more: Meet the large language AI models competing with ChatGPT

Topics in this article : ChatGPT , OpenAI

参考译文
随着OpenAI推出人工智能生成文本检测工具,ChatGPT更新发布
OpenAI 为其广受欢迎的聊天机器人 ChatGPT 推出了新更新,以确保生成的回答更加事实准确,并提升其基本的数学能力。此次更新正值该公司发布了其首个“检测工具”,以帮助识别文本中是否使用了 AI,不过该工具目前的成功率似乎较低。OpenAI 于 2022 年 11 月推出了 ChatGPT,并自发布以来一直在逐步改进其功能。(照片:Ascannio/Shutterstock)尽管其开发团队并未预料到它会如此受欢迎,但在去年 11 月下旬推出后,短短几天内,ChatGPT 的用户数量就突破了百万大关,此后迅速成为网络热门现象。自推出以来,OpenAI 一直在逐步改进系统,添加新功能并优化回复内容,以使聊天机器人更加准确。本月早些时候,它新增了让用户在生成回复中途停止的功能,如果生成的内容不符合预期,可随时中止。同时,它也迎来了首次准确度提升。准确度一直是 ChatGPT 自推出以来面临的主要问题之一。技术问答网站 StackOverflow 已经屏蔽了 ChatGPT 生成的回复,因为这些回复往往看起来准确无误,实则错误百出。首批更新带来了技术上的改进,减少了 ChatGPT 一味拒绝回答或中途截断回复的次数。他们还对同时访问系统的用户数量设置了限制,以减轻服务器的负担。尽管如此,目前仍存在用户因系统容量达到上限而无法访问的长时间情况。**数据洞察 查看全部**此次最新更新的目标是提升其“事实性和数学能力”。这是目前更新说明中的全部内容。团队并未详细说明如何提升这些功能,但 ChatGPT 在某些数学问题上确实存在挑战。数学能力的提升将有助于处理更复杂的计算,并提供更精确的答案,从而提升其对专业人士用于生成报告或查找数据模式的价值。同时,它也更难被欺骗,从而给出错误答案来回应简单问题。ChatGPT 正在为 API 做准备?这些渐进式的更新可能是为了测试和改进聊天机器人的功能,在 ChatGPT 的 API 发布之前,剔除其生成有害或危险回复的可能。该 API 将加入 OpenAI 已有的其他 API,例如通过 DALL-E 2 进行图像生成,以及通过 Codex 进行代码生成。在发布后,该 API 将直接通过 OpenAI 提供,同时也在微软 Azure 云平台上提供。这一消息发布于同一天,微软也确认了对 OpenAI 的数十亿美元投资,并将 ChatGPT 集成到其搜索引擎 Bing 及其他消费级产品中。**来自我们的合作伙伴**Sherif Tawfik:中东和非洲已准备好在气候变化方面引领潮流 现代 ERP 系统应具备哪些要素? 科技领袖如何保持能耗低、实现效率目标 AI 软件公司 Beyond Limits 的数据科学总监 Mike Krause 告诉 Tech Monitor,虚假信息的问题源于 ChatGPT 所训练的数据源,这些数据来自 2021 年,因此它“并不受事实性、现实性或社会道德结构的约束”。 **查看所有新闻通讯** 订阅我们的新闻通讯 数据、洞察和分析直接送达您的邮箱 由 Tech Monitor 团队提供 **[此处订阅]**维基百科是 ChatGPT 的重要训练数据来源,而维基百科由普通人撰写,“他们可以编辑全人类的百科知识文本,虽然存在内容审核员,但人数极少,因此我们几乎可以随意夸大任何我们想写的内容,直到被标记为止”,Krause 说道。尽管存在这些问题,Krause 表示,OpenAI 仍在改进其聊天机器人。但他同时指出:“从根本上说,它依然只是从所接收的数据中学习模式,没有任何智能或对内容的理解能力,也没有将数据和信息抽象为概念的能力,而这就是人类学习和推理的方式。”他说,机器学习模型必须明确地被训练以不针对任何群体进行歧视,前提是存在足够无偏的训练数据集来实现这一目标。如果模型未加限制和再训练,“那么在现实世界中,将对现实中的真实人群造成真实影响。”Krause 补充道:“OpenAI 一开始就意识到这一点,并从一开始就限制了访问。ChatGPT 确实很酷,但它也具备自动制造大量虚假内容的能力,并可能被政府用于影响公众舆论、选举,甚至被当作参考或事实来源,尽管它根本不是。”Boost.ai 的欧洲、中东及非洲副总裁 Sanjeev Kumar 对最近的更新表示欢迎,但他警告道:“然而,企业仍远未达到能够直接在面向客户的应用程序中使用这项技术的程度。”“如果我们希望 ChatGPT 在企业环境中发挥全部潜力,即使达到 99% 的准确率也不足够,因为哪怕有细微的错误,也可能带来法律责任。必须定期对模型所连接的信息来源进行整理和验证,以确保其可靠和准确。”**OpenAI 推出 AI 内容检测工具** 随着 ChatGPT 的事实准确性不断提升,它被用于良性和恶意目的的可能性也在增加。有证据表明黑客正在利用它来生成恶意软件和更精准的钓鱼邮件,而学生也大量使用聊天机器人代写论文。这些例子促使人们努力开发检测工具。目前已有独立工具,如开源的 GPTZero,旨在识别由聊天机器人生成的内容。OpenAI 本身也在尝试为 GPT-3 生成的文本添加水印,以便未来更容易检测。与此同时,该公司正在训练一个文本分类器,用以区分人类和 AI 所写的文本。该工具不依赖于提供方,但目前其准确性并不高,仅能正确识别约 26% 的 AI 生成文本为“可能由 AI 生成”。OpenAI 表示:“我们正在公开发布这个分类器,以获取反馈,了解此类不完美的工具是否具有实际用途。我们对检测 AI 生成文本的工作仍将继续进行,我们希望在未来分享更先进的方法。”“虽然无法可靠地检测所有 AI 生成的文本,但我们相信优秀的分类器可以有助于减轻 AI 生成文本被错误归因于人类的情况:例如,自动运行虚假宣传运动、在学术上不诚实使用 AI 工具,或将 AI 聊天机器人伪装成人类。”**阅读更多:与 ChatGPT 竞争的大型语言 AI 模型** **本文主题:ChatGPT,OpenAI**
您觉得本篇内容如何
评分

评论

您需要登录才可以回复|注册

提交评论

广告

techmonitor

这家伙很懒,什么描述也没留下

关注

点击进入下一篇

重磅来袭!2023第十一届深圳电子信息展览会

提取码
复制提取码
点击跳转至百度网盘