小程序
传感搜
传感圈

See How AI Generates Images from Text

2023-10-07 02:36:34
关注

Last year the Internet got its first taste of image-generating artificial intelligence. Suddenly, technology that had once been offered only to specialists was available to anyone with a web connection. The enthusiasm shows no signs of abating, and AI-generated images have won a major photography competition, created the title credits of a television series and tricked people into believing the pope stepped out in a fashionable puffer coat. Yet critics have noted how training the algorithms on existing works could potentially infringe on copyright, and using them could put artists' jobs in jeopardy. Generative AI also risks supercharging fake news: the pope coat was fun, but a generated photograph supposedly showing an attack on the Pentagon briefly inspired a dip in the stock market.

How did programs such as DALL-E 2, Midjourney and Stable Diffusion get to be so good all at once? Although AI has been in development for decades, the most popular of today's image generators use a technique called a diffusion model, which is relatively new on the AI scene. Here's how it works:

Credit: Matthew Twombly (graphic), Amanda Hobbs (research)

参考译文
看看AI是如何从文本生成图像的# 示例输入和输出**输入**人工智能(AI)是计算机科学的一个分支,旨在开发表现出人类智能的软件或机器。这包括从经验中学习、理解自然语言、解决问题以及识别模式。**输出**人工智能(AI)是计算机科学的一个分支,旨在开发表现出人类智能的软件或机器。这包括从经验中学习、理解自然语言、解决问题以及识别模式。
去年,互联网首次接触到图像生成的人工智能。突然之间,曾经只提供给专家的技术,现在只要拥有网络连接的任何人都可以使用。这种热情丝毫没有减弱的迹象,由人工智能生成的图片不仅赢得了一项重要的摄影比赛,还为电视剧制作了标题字幕,甚至骗过了不少人,让人误以为教皇身穿时尚的羽绒服出现了。然而,批评者指出,用现有作品来训练这些算法可能会侵犯版权,而使用它们也可能威胁艺术家的就业。生成式人工智能还存在加剧假新闻的风险:教皇的羽绒服虽然有趣,但一张生成的图片看似显示对五角大楼的袭击,曾短暂引发了股市下跌。为什么像DALL-E 2、Midjourney和Stable Diffusion这样的程序会突然变得如此强大?尽管人工智能的发展已有几十年历史,但如今最受欢迎的图像生成工具使用了一种名为“扩散模型”(diffusion model)的技术,这是一种在人工智能领域相对较新的方法。以下就是它的运作原理:图片说明:Matthew Twombly(图形),Amanda Hobbs(研究)
您觉得本篇内容如何
评分

评论

您需要登录才可以回复|注册

提交评论

广告

scientific

这家伙很懒,什么描述也没留下

关注

点击进入下一篇

谷歌新研究:让AI替代人类训练AI?

提取码
复制提取码
点击跳转至百度网盘