Business

Is OpenAI using your content without permission?

The number of organizations accusing OpenAI of stealing their work continues to grow like extra patties on a burger, with a prominent news organization now joining the fray with its own set of claims against the Microsoft-backed artificial intelligence startup.

In a lawsuit filed against OpenAI, the Center for Investigative Reporting, the oldest nonprofit newsroom in the US, claims the ChatGPT maker used its investigative journalism to train and enhance its generative AI product without permission or compensation.

It’s a tale as old as time.

Ever since ChatGPT hit the scene, different quarters of the internet have been raising alarm bells over the data used to train generative AI, often, without permission. You’ve got artistsmusic labelsauthors, heck, even programmers, who have either sued or complained against the company for allegedly using their work to build ChatGPT and its derivatives.

“This free rider behavior is not only unfair, it is a violation of copyright,” Monika Bauerlein, CEO of the Center for Investigative Reporting, said in a statement.

Free rider behavior is perhaps the best way to describe what companies developing AI are doing.

Take Meta, for example. The social media giant admitted to using users’ Facebook and Instagram posts to develop an AI assistant. Meanwhile, ChatGPT has been found to produce verbatim paragraphs from novels, complete verbatim copies of poems, and even articles from The New York Times!

In fact, CopyLeaks estimates that nearly 60% of the responses provided by GPT-3.5 (which is the model behind ChatGPT) contain some form of plagiarized content, the Center for Investigative Reporting says.

Grim, isn’t it?

At this point, the entire output of humanity, creative or otherwise, is apparently a valid target for AI companies. The question then is, are gen AI companies just profiteering off of our work? Evidence seems to suggest so.

Reddit, for example, has already struck a deal with both OpenAI and Google to let them use content from its platform to make their AI products better. There’s an age old adage: the rich get richer, while the poor get poorer. That seems to fit with Reddit’s partnership with OpenAI and Google, as the company will earn millions of dollars off of the deals but will likely never share its earnings with the users whose posts are gobbled up by OpenAI and Google to fine tune their AI models.

OpenAI also has similar arrangements with the Associated PressAxel Springer, and TIME magazine to use up journalists’ work to (probably) make ChatGPT even better. Other tech companies probably have something lined up with major publications as well.

This means that people who create will be left to do the heavy lifting while some tech bro is going to feed all that raw material to produce more powerful generative AI products, likely without permission or compensation.

The Center for Investigative Reporting is one of a handful of organizations that have taken OpenAI to court, joining the likes of The New York Times and others like it for allegedly infringing on its copyrights.

Suing OpenAI is not cheap, though. As The Verge reports, The NYT has raked up $1 million in legal costs during Q1 after it began its legal action, and there’s no telling how long this entire saga will play out — assuming both parties don’t end up settling out of court.

However, the case(s) are perhaps significant in that they could determine how AI operates within the bounds of copyright. Until then, I guess OpenAI is going to be sailing the high seas. 🏴‍☠️ 🏴‍☠️ 🏴‍☠️ ☠️☠️☠️ #IYKYK 😉

OpenAI backer Microsoft topped HackerNoon’s Tech Company Rankings this week.


In Other News.. 📰

  • Crypto Industry Is About to Boom, Is Outperforming the Internet: Architect Partners — via CoinDesk
  • Figma disables its AI design feature that appeared to be ripping off Apple’s Weather app — via TechCrunch
  • Meta accused of breaking European law with its ‘pay or consent’ model — via CNN
  • OnlyFans vows it’s a safe space. Predators are exploiting kids there. — via Reuters
  • Meta’s Threads turns one, has more than 175 million active users — via Axios
  • China’s BYD is set to take Tesla’s crown as the world’s No. 1 producer of battery electric vehicles — via CNBC

And that’s a wrap! Don’t forget to share this newsletter with your family and friends

See y’all next week. PEACE! ☮️


This article was originally published by Sheharyar Khan on HackerNoon.

HackerNoon

Recent Posts

Ness Digital Engineering strengthens foothold in tech, media and telco verticals with Raja Software Labs acquisition

Digital solutions have the power to transform business outcomes across nearly every industry. Yet for…

10 hours ago

Why organizations must accelerate data initiatives in 2025 to drive meaningful business growth

By Allison Foster, Content Marketing Manager at SQream As we look ahead to 2025, the…

1 day ago

AI communication platform Prezent recognized for contributions to innovation in the 2024 Inc.’s Best In Business Awards

On average, 4.7 million new businesses are created in the US each year. These emerging…

2 days ago

Supply Chain Woes Set to Plague American Businesses Into 2025 and Beyond

Article by Sunil Kardam, Consulting & Analytics Leader at Gramener-A Straive Company  The three-day dockworkers’…

2 days ago

Why Velocity Without Compromising Quality Needs to be Your North Star in 2025 

Article by Vikas Basra, Global Head, Intelligent Engineering Practice, Ness Digital Engineering  Today, digital experiences…

4 days ago

Google and Massy join forces at TBR to grow the tech ecosystem in the Caribbean

The Caribbean's premier technology conference, Tech Beach Retreat (TBR), has captured the interest of major…

1 week ago