Tumblr user? Here's what to know about Tumblr selling your data to OpenAI and MidJourney

  发布时间:2024-09-22 01:24:02   作者:玩站小弟   我要评论
OpenAI and photo generator Midjourney will soon pay to train their AI models using public Tumblr con 。

OpenAI and photo generator Midjourney will soon pay to train their AI models using public Tumblr content, according to internal documents reviewed by the site 404 Media.

404 Media has reported that a deal is "imminent" between Tumblr parent company Automattic and the two AI giants but could not specify what types of data would be sold to each company. The deal also reportedly includes the sale of data from Wordpress.com, another Automattic property.

Posts detailing how user content is used for AI training were published on Feb. 27 on the staff blogs of both Tumblr and Wordpress.com. However, the posts did not tell users that Automattic was in talks to sell that data.

Here's what you need to know about how the sale may affect your Tumblr content.

SEE ALSO:Tumblr CEO's public 'meltdown' is mocked, memed by users

Which content will Automattic reportedly sell?

404 Media has reported that the documents it reviewed did not specify the types of data that would be sold to each company. It is also unclear if this deal will affect future posts to Tumblr only, or if it encompasses past content as well. AI companies have been critiqued for their rampant use of "publicly available" content to train their models, since much of what is publicly available online is still beholden to copyright.

According to a support article on OpenAI's website, "ChatGPT and our other services are developed using information that is publicly available on the internet" among other sources. Ostensibly, OpenAI has alreadyscraped and used any and all content once publicly available on Tumblr. Given that, the current deal could serve as a sort of mea culpa on the part of OpenAI and Midjourney as they offer to pay for the use of all future Tumblr content as well.

Mashable Light SpeedWant more out-of-this world tech, space and science stories?Sign up for Mashable's weekly Light Speed newsletter.By signing up you agree to our Terms of Use and Privacy Policy.Thanks for signing up!

Automattic did not respond to requests for comment from 404 Media regarding the deal but posted a statement called "Protecting User Choice" in which the company wrote, "We currently block, by default, major AI platform crawlers—including ones from the biggest tech companies—and update our lists as new ones launch." It is unclear when the site began blocking the crawlers, which is important considering that OpenAI has been training its algorithm on public content for years.

SEE ALSO:The inside story of how Tumblr lost its way

How do I opt out?

To opt out of sharing your public Tumblr content with third parties, you'll need to toggle on a new "Prevent third-party sharing" option in the settings of each individual blog you run. This needs to be done on a web browser, not through the Tumblr app. These updates have been added to Tumblr's support article about user privacy.

If you have already elected to discourage searching of your blog in the past, the new "prevent third-party sharing" option will already be toggled on by default.

But what if you decide to forgo toggling on the setting now, opting instead to do it in three months? 404 Media reported that, in a document it accessed from Feb. 23, a Tumblr staff member asked a question addressing this issue. "Do we have assurances," they wrote, "that if a user opts out of their data being shared with third parties that our existing data partners will be notified of such a change and remove their data?"

Automattic’s head of AI, Andrew Spittle, replied, "We will notify existing partners on a regular basis about anyone who's opted out... I want this to be an ongoing process where we regularly advocate for past content to be excluded based on current preferences. We will ask that content be deleted and removed from any future training runs. I believe partners will honor this based on our conversations with them to this point."

Is this normal?

It certainly seems to be, at the very least, the new normal. OpenAI is licensing news stories from the Associated Press and is reportedly in talks to do the same with CNN, Time, and Fox. Reddit is working with Google to monetize its database of content.

It was just a matter of time before Automattic started selling its own data, especially considering how much money it's losing on Tumblr. In its entire 17-year history, the site has never been profitable, and Automattic has failed to turn it around. In November, TechCrunch reported that resources had been diverted from the struggling site to support projects elsewhere within Automattic.

  • Tag:

相关文章

  • 味道真系正!怀集食材邂逅顺德厨艺

    味道真系正!怀集食材邂逅顺德厨艺_南方+_南方plus借顺德区举办第46届国际龙家具展览会和第36届亚洲国际家具材料博览会之机,8月19日,怀集县联合顺德区在龙江镇S-Park体育公园举办“怀集味道·
    2024-09-22
  • 向“一考定终身”说NO!

    步入7月,中高考成绩也都揭晓了。每年中高考,总是有人欢喜有人愁。考好了,当然是欢喜无忧,若考得不理想,有些同学出现负面情绪,甚至引发心理问题。不管他们接下来会选择怎样的道路,失败并不是他们的代名词,他
    2024-09-22
  • 校园健康饮水项目在芦山启动

    雅安日报讯25日,由中国扶贫基金会联合加多宝集团、中华环境保护基金会,在芦山、天全、宝兴、雨城4个县区的14所小学援建的校园健康饮水项目,在芦山县芦阳小学正式启动。省环保厅副厅长钟勤建、省环保宣教中心
    2024-09-22
  • 提高合规意识 强化账户管理

    雅安日报讯近日,天全联社借省联社召开规范员工账户管理专项治理工作动员视频会之机,组织召开了规范员工账户管理专项治理工作的动员大会,该社班子成员、中层干部、机关员工、城区网点员工及客户经理共计120余人
    2024-09-22
  • 水产品占“四席”!广州南沙十个农产品上榜“国字号”

    水产品占“四席”!广州南沙十个农产品上榜“国字号”_南方+_南方plus近日,农业农村部农产品质量安全中心公示2024年第二批全国名特优新农产品名录,南沙东涌果蔗入选“全国名特优新农产品”名录。据悉,
    2024-09-22
  • 院士领衔,聚焦前沿!2023水产种业高质量发展论坛亮点抢先看

    院士领衔,聚焦前沿!2023水产种业高质量发展论坛亮点抢先看_南方+_南方plus2023水产种业高质量发展论坛将于2023年11月24日8:30-12:00隆重举行!地点:广州番禺·科尔海悦酒店种业
    2024-09-22

最新评论