Business

Is OpenAI using your content without permission?

The number of organizations accusing OpenAI of stealing their work continues to grow like extra patties on burger, with a prominent news organization now joining the fray with its own set of claims against the Microsoft-backed artificial intelligence startup.

In a lawsuit filed against OpenAI, the Center for Investigative Reporting, the oldest nonprofit newsroom in the US, claims the ChatGPT maker used its investigative journalism to train and enhance its generative AI product without permission or compensation.

It’s a tale as old as time.

Ever since ChatGPT hit the scene, different quarters of the internet have been raising alarm bells over the data used to train generative AI, often, without permission. You’ve got artistsmusic labelsauthors, heck, even programmers, who have either sued or complained against the company for allegedly using their work to build ChatGPT and its derivatives.

“This free rider behavior is not only unfair, it is a violation of copyright,” Monika Bauerlein, CEO of the Center for Investigative Reporting, said in a statement.

Free rider behavior is perhaps the best way to describe what companies developing AI are doing.

Take Meta, for example. The social media giant admitted to using users’ Facebook and Instagram posts to develop an AI assistant. Meanwhile, ChatGPT has been found to produce verbatim paragraphs from novels, complete verbatim copies of poems, and even articles from The New York Times!

In fact, CopyLeaks estimates that nearly 60% of the responses provided by GPT-3.5 (which is the model behind ChatGPT) contain some form of plagiarized content, the Center for Investigative Reporting says.

Grim, isn’t it?

At this point, the entire output of humanity, creative or otherwise, is apparently a valid target for AI companies. The question then is, are gen AI companies just profiteering off of our work? Evidence seems to suggest so.

Reddit, for example, has already struck deal with both OpenAI and Google to let them use content from its platform to make their AI products better. There’s an age old adage: the rich get richer, while the poor get poorer. That seems to fit with Reddit’s partnership with OpenAI and Google, as the company will earn millions of dollars off of the deals but will likely never share its earnings with the users whose posts are gobbled up by OpenAI and Google to fine tune their AI models.

OpenAI also has similar arrangements with the Associated PressAxel Springer, and TIME magazine to use up journalists’ work to (probably) make ChatGPT even better. Other tech companies probably have something lined up with major publications as well.

This means that people who create will be left to do the heavy lifting while some tech bro is going to feed all that raw material to produce more powerful generative AI products, likely without permission or compensation.

The Center for Investigative Reporting is one of a handful of organizations that have taken OpenAI to court, joining the likes of The New York Times and others like it for allegedly infringing on its copyrights.

Suing OpenAI is not cheap, though. As The Verge reports, The NYT has raked up $1 million in legal costs during Q1 after it began its legal action, and there’s no telling how long this entire saga will play out — assuming both parties don’t end up settling out of court.

However, the case(s) are perhaps significant in that they could determine how AI operates within the bounds of copyright. Until then, I guess OpenAI is going to be sailing the high seas. 🏴‍☠️ 🏴‍☠️ 🏴‍☠️ ☠️☠️☠️ #IYKYK 😉

OpenAI backer Microsoft topped HackerNoon’s Tech Company Rankings this week.


In Other News.. 📰

  • Crypto Industry Is About to Boom, Is Outperforming the Internet: Architect Partners — via CoinDesk
  • Figma disables its AI design feature that appeared to be ripping off Apple’s Weather app — via TechCrunch
  • Meta accused of breaking European law with its ‘pay or consent’ model — via CNN
  • OnlyFans vows it’s a safe space. Predators are exploiting kids there. — via Reuters
  • Meta’s Threads turns one, has more than 175 million active users — via Axios
  • China’s BYD is set to take Tesla’s crown as the world’s No. 1 producer of battery electric vehicles — via CNBC

And that’s a wrap! Don’t forget to share this newsletter with your family and friends

See y’all next week. PEACE! ☮️


This article was originally published by Sheharyar Khan on HackerNoon.

HackerNoon

Recent Posts

Top 15 LatAm tech journalists and editors of 2024

Latin America’s tech industry is booming, with innovative new startups popping up across the region.…

54 mins ago

G20 announces initiative to crackdown on climate change disinformation

The Global Initiative for Information Integrity on Climate Change claims to 'safeguard those reporting on…

3 hours ago

How GPUs, widely used in gaming, are helping doctors get better look inside us

In the late 19th Century, physicians began inserting hollow tubes equipped with small lights into…

13 hours ago

Top Five Trends Shaping Gaming in 2025

This year wasn’t exactly what the video gaming industry expected — it declined by 7%…

2 days ago

Why data flywheels are the key to sustainable growth in 2025 

By Oren Askarov, Growth & Operations Marketing Director at SQream Becoming “data-driven” has become a…

2 days ago

Swiss-based Horasis to host its Asia Meeting in Dubai, United Arab Emirates 

Horasis Asia Meeting, led by German entrepreneur Frank Jurgen-Richter, will take place this year on the…

5 days ago