Microsoft Probing If DeepSeek-Linked Group Improperly Obtained OpenAI Data

The irony is overwhelming

248

leisesprecher reply

feddit.org

An irony curtain.

Onno (VK6FLAB)

lemmy.radio

What, you mean like Microsoft, uh, OpenAI did?

172

latenightnoir reply

Yep, NOW it's a problem, though! Because it's someone else doing the same thing, someone who isn't part of the human centipede starting at Trump's colon.

121

Konomi

lemmy.blahaj.zone

152

TransplantedSconie reply

Chinese company:

Truly, you have a dizzling intellect.

Microsoft:

AND IM JUST GETTING STARTED! Where was I?

Chinese company:

Stealing data....

MolecularCactus1324

118

metaStatic reply

kbin.earth

we stole it fair and square

cyrano

It’s no crime to steal from a thief.

unexposedhazard reply

discuss.tchncs.de

How could it be better when they just stole everything? The fact that its better basically proves that its not stolen.

RangerJosey

lemmy.ml

Stealing from thieves isn't a crime.

Especially not when China turns around and Robin Hoods it back to the world.

Just saying.

sunzu2 reply

thebrainbin.org

LappingDog reply

Making R1 open source really makes it such a big FU to all the grifters asking for billions for AI in the us. Especially funny because high-flyer is a hedge fund firm themselves. The ai race should only be determined by what you do with it, not protecting how much IP you hoovered up and are now trying to cry about it being copied by others.

AeronMelon

“You can’t steal that public data! We stole it first!”

And considering that’s exactly what Microsoft did to Apple with point and click, what irony!

FartVentriloquist69 reply

https://dougengelbart.org/content/view/162/000/

They both stole point and click from Xerox if my memory serves me correctly

leftytighty reply

slrpnk.net

jadelord reply

discuss.tchncs.de

Actually, it was invented by Douglas Engelbart in Stanford in the 60s

Xerox (re)made it for the PC in the 80s.

leftytighty reply

slrpnk.net

NotSteve_ reply

lemmy.ca

Apple did pay Xerox for it if I'm remembering right

The_Decryptor reply

aussie.zone

Yep, Apple paid with shares (More specifically, the right to buy $1 million dollars worth at the initial share price) which, according to a share calculator I just tried, would be worth nearly $328 million these days, I wonder if Xerox kept them or offloaded them early.

Considering Xerox was utterly uninterested in any of the tech they had, it's worked out well.

moving to lemme.zip. reply

Oh really? **Rabbit hole unlocked

AeronMelon reply

https://www.youtube.com/watch?v=UFcb-XF1RPQ

The relevant part of Pirates of Silicon Valley. After which you should watch the whole thing. It’s fan fiction, but it’s the best explanation of what happened between Apple and Microsoft leading into the 1990s.

Oaksey reply

They didn’t steal it from Smith & Wesson?

Jhex

What data? they one OpenAI illegally obtained first?!

misk

sopuli.xyz

I’m sure now that OpenAI accuses DeepSeek of stealing they will now prove that they have rights to things that are being stolen, right? XD

demizerone

Somebody better call the WAHMBULANCE!

potoooooooo 🥔 reply

Surely they'd like some cheese to go with that whine?

ubergeek

Are they worried that deepsink too stuff written by others, mixed it up, and repackaged it as it's own?

Well, yeah, that's all AI is. An expensive weighted pachinko machine, that uses human made content, and remixes it.

chiliedogg reply

The question isn't whether they've used the same information. It's whether they've faked the process to achieve that 20x efficiency.

Look at it like a dictionary. Writing one from scratch is a huge task, no matter how many other books exist. How do you even go about finding all of the words?

But if other people have already written dictionaries, you can just use their word lists and go from there.

It's more efficient, but only because it's a completely different task.

ubergeek reply

No AI company has ever made any of their own content to train their models, they took what others created, remixed it, and presented it as something new.

This AI model did the same thing.

AI lost its job to AI.

chiliedogg reply

Yes, but that doesn't mean it is more efficient, which is what the whole thing is about.

Let's pretend we're not talking about AI, but tuna fishing. OpenTuna is sending hundreds of ships to the ocean to go fishing. It's extremely expensive, but it gets results.

If another fish distributor shows up out of nowhere selling tuna for 1/10 the price, it would be amazing. But if you found out that they could sell them cheap because they were stealing the fish from OpenTuna warehouses, you wouldn't argue that the secret to catching fish going forward is theft and stop building boats.

ubergeek reply

Yes, I would.

chiliedogg reply

So what happens when OpenTuna runs out of fish to steal and there are no more boats?

Information doesn't stop being created. AI models need to be constantly trained and updated with new information. One of the biggest issues with GPT3 was the 2021 knowledge cutoff.

Let's pretend you're building a legal analysis AI tool that scrapes the web for information on local, state, and federal law in the US. If your model was from January 2008 and was never updated, then gay marriage wouldn't be legal in the US, the ACA wouldn't exist, Super PACs would be illegal, the Consumer Financial Protection Bureau wouldn't exist, zoning ordinances in pretty much every city would be out of date, and openly carrying a handgun in Texas would get you jailtime.

It would essentially be a useless tool, and copying that old training data wouldn't make a better product no matter how cheap it was to do.

ubergeek reply

Once tuna runs out, and we run out of boats?

Maybe we then stop destroying the tuna population?

Or, to bring this back to point: the environment will be better off once the AI bubble collapses.

chiliedogg reply

That's a very important, but entirely separate conversation.

No_Ones_Slick_Like_Gaston reply

Is it worth it? Let me work it I put my thing down, flip it and reverse it

What’s the game plan if they did?

Trade restrictions?

China already proved those did fuck all to stop them from developing their own model.

Ducking knew this ai bubble would burst sooner or later, just glad we can finally get on with it now.

ɔiƚoxɘup reply

infosec.pub

I ducking knew it too, I've been a long for the ride though. The models still do have some niche applications where they're actually useful.

This whole thing with OpenAI and Microsoft whinging about fair play is truly laughable though. What clowns.

As a side note, it took a few tries to write ducking, my keyboard kept correcting it to fucking. We're definitely 2 different people. Lol.

rumba reply

Name the company involved a military asset
Forbid US companies from hosting the models
Pressure foreign companies to not work with them

rhabarba

feddit.org

So that means that Microsoft will pay compensation to us, right?

cyrano

Article from ft https://www.ft.com/content/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6

https://archive.is/D9whR

Looks like Microsoft is bracing for today’s earnings call

daddy32

Lol its like fucking lavrov from fucking russia screaming "this is against international law" when Europe froze their assets.

sunzu2 reply

thebrainbin.org

Erasmus

I Cast Fist reply

In Brazil, there's a rhymed saying: "ladrão que rouba ladrão tem 100 anos de perdão", it translates to "a thief that steals from a thief has 100 years of forgiveness"

spirinolas reply

It's a common proverb in Portuguese, not just in Brazil.

NutWrench

When a writer copies someone else's work without cites or compensation, it's called "plagiarism." But when an AI does it, it's called "LLM training."

𝕸𝖔𝖘𝖘 reply

infosec.pub

Unless that AI is not OpenAI, then it's "plagiarism" still.

intensely_human reply

When a reader reads someone else’s work that’s called “reading”. But when an AI does it, it’s called “training”.

ChowJeeBai

When you can't beat em, sue em. It's the American way.

MegaUltraChicken

"Tom, if irony was strawberries we'd all be drinking smoothies right now"

rumba

Microsoft Probing If DeepSeek-Linked Group Improperly Obtained data the same way OpenAI did. --FTFY

Deepus

What the fuck is Microsoft getting involved for?! Maybe concentrate on not providing shitty fucking software fuck heads!

TheRealKuni reply

They have a large stake in OpenAI, last I checked.

Deepus reply

Oh fair, i didnt know that. But still, fuck Microsoft.

rumba

OMG Competition!

QUICK, they're a foreign threat! They're coming right for us!!!!

spearz

https://www.reuters.com/technology/artificial-intelligence/alibaba-releases-ai-model-it-claims-surpasses-deepseek-v3-2025-01-29/

Maybe they could buy a spare one from AliBaba. Maybe that would help..

BrianTheeBiscuiteer

"Waaaaah" you say?

shirro

aussie.zone

Mikina

lol, I love it. I'm thinking about paying for DeepSeek even though I hate AI bullshit, just to spite all the panicking AI tech scammers. This has seriously made my week, the amount of copium they are inhaling is insanely funny :D

-11

cyrano reply

Use duck duck go ai chat https://duck.ai/

Mikina reply

Isn't the OpenAI one they offer the same one as the one provided at https://chatgpt.com/ without login? So probably something not as impactful.

Or do they share their unlimited subscription?

heavydust reply

If I had money to spend, I would get a ChatGPT subscription since they lose money for every account.

-5

filister reply

The whole startup industry rely on investors to cover for their costs for years, while they work on a loss, in order to obtain a bigger market share. Look at Netflix, Facebook, WhatsApp, etc.

So buying an account you are increasing their market share.

But feel free to use Mistral, Deepseek, etc. that would be better

photonic_sorcerer reply

Just rent some server space instead and run your open-weights model of choice

peto (he/him) reply

You have to use it a lot. From what I can tell that's their problem, they priced unlimited access low based on some numbers they pulled out their arse and then were all shocked Pikachu face when people used it and unlimited amount.

Brumefey reply

I had a subscription but I barely used it, maybe twice a day with no complex stuff. I don’t get how it’s possible to lose money on users like me. I finally cancelled because of the price.

jonathan reply

They don't lose money on users like you.

Brumefey reply

So the suggestion from @[email protected] will just increase their revenue 🙂

Mikina reply