Microsoft Probing If DeepSeek-Linked Group Improperly Obtained OpenAI Data
https://www.bloomberg.com/news/articles/2025-01-29/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-dataOpen linkView original on lemmy.dbzer0.com398
Comments70
The irony is overwhelming
An irony curtain.
What, you mean like Microsoft, uh, OpenAI did?
Yep, NOW it's a problem, though! Because it's someone else doing the same thing, someone who isn't part of the human centipede starting at Trump's colon.
Chinese company:
Truly, you have a dizzling intellect.
Microsoft:
AND IM JUST GETTING STARTED! Where was I?
Chinese company:
Stealing data....
we stole it fair and square
It’s no crime to steal from a thief.
How could it be better when they just stole everything? The fact that its better basically proves that its not stolen.
Stealing from thieves isn't a crime.
Especially not when China turns around and Robin Hoods it back to the world.
Just saying.
Making R1 open source really makes it such a big FU to all the grifters asking for billions for AI in the us. Especially funny because high-flyer is a hedge fund firm themselves. The ai race should only be determined by what you do with it, not protecting how much IP you hoovered up and are now trying to cry about it being copied by others.
“You can’t steal that public data! We stole it first!”
And considering that’s exactly what Microsoft did to Apple with point and click, what irony!
They both stole point and click from Xerox if my memory serves me correctly
Actually, it was invented by Douglas Engelbart in Stanford in the 60s
https://dougengelbart.org/content/view/162/000/
Xerox (re)made it for the PC in the 80s.
Apple did pay Xerox for it if I'm remembering right
Yep, Apple paid with shares (More specifically, the right to buy $1 million dollars worth at the initial share price) which, according to a share calculator I just tried, would be worth nearly $328 million these days, I wonder if Xerox kept them or offloaded them early.
Considering Xerox was utterly uninterested in any of the tech they had, it's worked out well.
Oh really? **Rabbit hole unlocked
https://www.youtube.com/watch?v=UFcb-XF1RPQ
The relevant part of Pirates of Silicon Valley. After which you should watch the whole thing. It’s fan fiction, but it’s the best explanation of what happened between Apple and Microsoft leading into the 1990s.
They didn’t steal it from Smith & Wesson?
What data? they one OpenAI illegally obtained first?!
I’m sure now that OpenAI accuses DeepSeek of stealing they will now prove that they have rights to things that are being stolen, right? XD
Somebody better call the WAHMBULANCE!
Surely they'd like some cheese to go with that whine?
Are they worried that deepsink too stuff written by others, mixed it up, and repackaged it as it's own?
Well, yeah, that's all AI is. An expensive weighted pachinko machine, that uses human made content, and remixes it.
The question isn't whether they've used the same information. It's whether they've faked the process to achieve that 20x efficiency.
Look at it like a dictionary. Writing one from scratch is a huge task, no matter how many other books exist. How do you even go about finding all of the words?
But if other people have already written dictionaries, you can just use their word lists and go from there.
It's more efficient, but only because it's a completely different task.
No AI company has ever made any of their own content to train their models, they took what others created, remixed it, and presented it as something new.
This AI model did the same thing.
AI lost its job to AI.
Yes, but that doesn't mean it is more efficient, which is what the whole thing is about.
Let's pretend we're not talking about AI, but tuna fishing. OpenTuna is sending hundreds of ships to the ocean to go fishing. It's extremely expensive, but it gets results.
If another fish distributor shows up out of nowhere selling tuna for 1/10 the price, it would be amazing. But if you found out that they could sell them cheap because they were stealing the fish from OpenTuna warehouses, you wouldn't argue that the secret to catching fish going forward is theft and stop building boats.
Yes, I would.
So what happens when OpenTuna runs out of fish to steal and there are no more boats?
Information doesn't stop being created. AI models need to be constantly trained and updated with new information. One of the biggest issues with GPT3 was the 2021 knowledge cutoff.
Let's pretend you're building a legal analysis AI tool that scrapes the web for information on local, state, and federal law in the US. If your model was from January 2008 and was never updated, then gay marriage wouldn't be legal in the US, the ACA wouldn't exist, Super PACs would be illegal, the Consumer Financial Protection Bureau wouldn't exist, zoning ordinances in pretty much every city would be out of date, and openly carrying a handgun in Texas would get you jailtime.
It would essentially be a useless tool, and copying that old training data wouldn't make a better product no matter how cheap it was to do.
Once tuna runs out, and we run out of boats?
Maybe we then stop destroying the tuna population?
Or, to bring this back to point: the environment will be better off once the AI bubble collapses.
That's a very important, but entirely separate conversation.
Is it worth it? Let me work it I put my thing down, flip it and reverse it
What’s the game plan if they did?
Trade restrictions?
China already proved those did fuck all to stop them from developing their own model.
Ducking knew this ai bubble would burst sooner or later, just glad we can finally get on with it now.
I ducking knew it too, I've been a long for the ride though. The models still do have some niche applications where they're actually useful.
This whole thing with OpenAI and Microsoft whinging about fair play is truly laughable though. What clowns.
As a side note, it took a few tries to write ducking, my keyboard kept correcting it to fucking. We're definitely 2 different people. Lol.
So that means that Microsoft will pay compensation to us, right?
Article from ft https://www.ft.com/content/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6
https://archive.is/D9whR
Looks like Microsoft is bracing for today’s earnings call
Lol its like fucking lavrov from fucking russia screaming "this is against international law" when Europe froze their assets.
In Brazil, there's a rhymed saying: "ladrão que rouba ladrão tem 100 anos de perdão", it translates to "a thief that steals from a thief has 100 years of forgiveness"
It's a common proverb in Portuguese, not just in Brazil.
When a writer copies someone else's work without cites or compensation, it's called "plagiarism." But when an AI does it, it's called "LLM training."
Unless that AI is not OpenAI, then it's "plagiarism" still.
When a reader reads someone else’s work that’s called “reading”. But when an AI does it, it’s called “training”.
When you can't beat em, sue em. It's the American way.
"Tom, if irony was strawberries we'd all be drinking smoothies right now"
Microsoft Probing If DeepSeek-Linked Group Improperly Obtained data the same way OpenAI did. --FTFY
What the fuck is Microsoft getting involved for?! Maybe concentrate on not providing shitty fucking software fuck heads!
They have a large stake in OpenAI, last I checked.
Oh fair, i didnt know that. But still, fuck Microsoft.
OMG Competition!
QUICK, they're a foreign threat! They're coming right for us!!!!
Maybe they could buy a spare one from AliBaba. Maybe that would help..
https://www.reuters.com/technology/artificial-intelligence/alibaba-releases-ai-model-it-claims-surpasses-deepseek-v3-2025-01-29/
"Waaaaah" you say?
lol, I love it. I'm thinking about paying for DeepSeek even though I hate AI bullshit, just to spite all the panicking AI tech scammers. This has seriously made my week, the amount of copium they are inhaling is insanely funny :D
Use duck duck go ai chat https://duck.ai/
Isn't the OpenAI one they offer the same one as the one provided at https://chatgpt.com/ without login? So probably something not as impactful.
Or do they share their unlimited subscription?
If I had money to spend, I would get a ChatGPT subscription since they lose money for every account.
The whole startup industry rely on investors to cover for their costs for years, while they work on a loss, in order to obtain a bigger market share. Look at Netflix, Facebook, WhatsApp, etc.
So buying an account you are increasing their market share.
But feel free to use Mistral, Deepseek, etc. that would be better
Just rent some server space instead and run your open-weights model of choice
You have to use it a lot. From what I can tell that's their problem, they priced unlimited access low based on some numbers they pulled out their arse and then were all shocked Pikachu face when people used it and unlimited amount.
I had a subscription but I barely used it, maybe twice a day with no complex stuff. I don’t get how it’s possible to lose money on users like me. I finally cancelled because of the price.
They don't lose money on users like you.
So the suggestion from @[email protected] will just increase their revenue 🙂
That's only for the 200$ one, and if you use it constantly, no?