What's if instead of a dead internet we end up with a dark forest internet | Spyke

showerthoughts·ShowerthoughtsbyAussiemandeus

What's if instead of a dead internet we end up with a dark forest internet

We all migrate to smaller websites try not to post outside drawing attention just to hide from the "Ai" crawlers. The internet seems dead except for the few pockets we each know existed away from the clankers

View original on aussie.zone

393

lemmy.dbzer0.com

I have a testing website. I have never gave the address to absolutely anyone, ever. It's not linked with anything. It's just a silly html site living in a domain.

It's still being ping and probed to death by bad actors. No necessarily AI scrappers. But it's dozens or hundreds of http petitions a day for random places all over the world.

There's no black forest. It's all light up and under constant attack, every tree is already on fire.

220

dual_sport_dork 🐧🗡️ reply

That's because it's numerically possible to sweep through the entire IPv4 address range fairly trivially, especially if you do it in parallel with some kind of botnet, proverbially jiggling the digital door handles of every server in the world to see if any of them happen to be unlocked.

One wonders if switching to purely IPv6 will forestall this somewhat, as the number space is multiple orders of magnitude larger. That's only security through obscurity, though, and it's certain the bots will still find you eventually. Plus, if you have a doman name the attackers already know where you are — they can just look up your DNS record, which is what DNS records are for.

110

MyNameIsIgglePiggle reply

sh.itjust.works

I like seeing them try and then thinking "begone thot! There is no entry for you"

In fact, I might make a honeypot that issues exactly that

19

WhyJiffie reply

sh.itjust.works

nepenthes is the tool for that

5

early_riser reply

YOU SHALL NOT PASS!

1

kossa reply

But an IP can have multiple websites and even not return anything on plain IP access. How do crawlers find out about domains and unlinked subdomains? Do they even?

13

Chamomile 🐑 reply

@kossa @dual_sport_dork If you're using HTTPS, which is by and large the norm nowadays, then every domain is going to be trivially discoverable via certificate transparency logs: https://social.cryptography.dog/@ansuz/115592837662781553

19

taaz reply

biglemmowski.win

thinking about this, wouldn't the best way to hide a modern websie be something along getting a wildcard domain cert (can be done with LE with DNS challenge), cnaming the wildcard to the root domain and then hosting the website on a random subdomain string ? am I missing something

13

confusedpuppy reply

lemmy.dbzer0.com

I do something something like this using wildcard certs with Let's Encrypt. Except I go one step further because my ISP blocks incoming data on common ports so I end up using an uncommon port as well.

I'm not hosting anything important and I don't need to always access to it, it's mostly just for fun for myself.

Accessing my site ends up looking like https://randomsubdomain.registered-domain-name.com:4444/

My logs only ever show my own activity. I'm sure there are downsides to using uncommon ports but I mitigate that by adjusting my personal life to not caring about being connected to my stuff at all times.

I get to have my little hobby in my own corner of the internet without the worry of bots or AI.

10

MrPoopyButthole reply

Thanks for the link!

1

simeon reply

Every SSL certificate is publicly logged(you can see these logs e. g. under crt.sh) and you might be able to read DNS records to find new (sub)domains. The modern internet is too focused on being discoverable and transparent to make hiding an entire service(domain + servers) feasible. But things like example.com/dhusvsuahavag8wjwhsusiajaosbsh are entirely unfindable as long as they are not linked to

4

kossa reply

Random subdomain on wildcard certificate, IP written in the host file to mitigate DNS records, only given by word-to-mouth 😅.

Nobody said the uncrawled dark forest would be comfortable.

5

kazaika reply

Servers which are meant to be secure usually are configured to not react to pings and do not give out failure responses to unauthenticated requests. This should be viable for a authenticated only walled garden type website op is suggesting, no?

9

Cooper8 reply

I have suggested a couple of times now that ActivityPub should implement an encryption layer for user authentication of requests and pings. It already has a system for instances vauching for each other. The situation is that users of "walled garden" instances in ActivityPub lack means of interfacing with public facing instances that doesnt leave the network open for scraping. I believe a pivot towards default registered users only content service built on encrypted handshakes, with the ability for servers to opt-in to serving content to unregistered users would make the whole network much more robust and less dependent on third party contingencies like CloudFlare.

Then again, maybe I should just be looking for a different network, I'm sure there are services in the blockchain/cryptosphere that take that approach, I just would rather participate in a network built on commons rather than financialization at it's core. Where is the protocol doing both hardened network and distributed volunteer instances?

1

dual_sport_dork 🐧🗡️ reply

There are several things you could do in that regard, I'm sure. Configure your services to listen only on weird ports, disable ICMP pings, jigger your scripts to return timeouts instead of error messages... Many of which might make your own life difficult, as well.

All of these are also completely counterproductive if you want your hosted service, whatever it is, to be accessible to others. Or maybe not, if you don't. The point is, the bots don't have to find every single web service and site with 100% accuracy. The hackers only have to get lucky once and stumble their way into e.g. someone's unsecured web host where they can push more malware, or a pile of files they can encrypt and demand a ransom, or personal information they can steal, or content they can scrape with their dumb AI, or whatever. But they can keep on trying until the sun burns out basically for free, and you have to stay lucky and under the radar forever.

In my case just to name an example I kind of need my site to be accessible to the public at large if I want to, er, actually make any sales.

1

SkyeStarfall reply

lemmy.blahaj.zone

It's not as simple as "only security through obscurity". You could say the same thing for an encryption key of a certain length. The private key to a public key is still technically just an obscurity, but it's still impractical to actually go through the entire range

IPv6 is big enough where this obscurity becomes impractical to sweep. But of course, as you said, there may be other methods of finding your address

5

lauha reply

I love your "multiple orders of magnitude". I don't think you appreciate or realise how much larger ipv6 address space is :)

2

dual_sport_dork 🐧🗡️ reply

I wasn't going to type that many commas for the sake of brevity, but it's 340,282,366,920,938,463,463,374,607,431,768,211,456 possible addresses. I.e. 2^128^. So yes, I do.

I consider 96 orders (in binary, anyway) as "multiple." Wouldn't you?

8

lauha reply

No need to be defensive. I'm not insulting, I just find it funny :) usually people call that "dozens". But dozens of orders of magnitude really doesn't give the sense of scale.

You could have 8 billion in habitants in every 10^24 stars in the universe and everyone could still have 42k addresses.

0

🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕠𝕕𝕚𝕝𝕖 reply

hilariouschaos.com

It it had a DNS its obviously lit up like a Christmas tree. If its ipv4 its on shodan. The only way to hide is to have no DNS and an ipv6.

8

Aussiemandeus reply

Do you know how they find it? Is it just random input of address over and over?

7

dual_sport_dork 🐧🗡️ reply

Almost certainly. There are only 4,294,967,296 possible IPv4 addresses, i.e. 4.3ish billion, which sounds like a lot but in computer terms really isn't. You can scan them in parallel, and if you're an advanced script kiddie you could even exclude ranges that you know belong to unexciting organizations like Google and Microsoft, which are probably not worth spending your time messing with.

If you had a botnet of 8,000 or so devices and employed a probably unrealistically generous timeout of 15 seconds, i.e. four attempts per minute per device, you could scan the entire IPv4 range in just a hair over 93 days and that's before excluding any known pointless address blocks. If you only spent a second on each ping you could do it in about six days.

For the sake of argument, cybercriminals are already operating botnets with upwards of 100,000 compromised machines doing their bidding. That bidding could well be (and probably is) probing random web servers for vulnerabilities. The largest confirmed botnet was the 911 S5 which contained about 19 million devices.

43

Melobol reply

That's amazing and scary at the same time. Thanks for putting it into perspective!

16

pipe01 reply

programming.dev

I don't know exactly how they do it, but probing every ipv4 address isn't that hard

11

kossa reply

But there can be multiple websites behind one IP address?! They would not show when onhy accessing the IP. They would need to know about the domains somehow.

3

friend_of_satan reply

7

RememberTheApollo_ reply

I have a DDNS setup. Pretty random site name. Nonetheless, it’s been found and constantly probed. Lots of stuff from Russia, China, a few countries in Africa, and India. A smattering of others, but those are the constant IPs that are probing or attempting logins.

6

Croquette reply

sh.itjust.works

DNS only translate a string address (www.mywebsite.com) to its IP address (xxx.xxx.xxx.xxx) so that it is easier to remember.

Bots just try a range of address and they don't need to know your domain name. You could have the most unintelligible domain name in the world, bots would still ping your website because they use direct IP addresses.

4

RememberTheApollo_ reply

Yeah, that’s probably it. Just them spamming different numeric IP addresses to see if any get a hit.

2

taaz reply

biglemmowski.win

crt.sh and certificate transparency

4

Taleya reply

I have domain rather the same and it's frankly hilarious watching people try IIS and PHP exploits over and over. That box technically doesn't even qualify as a LAMP, it's a LA

3

Fabulous insight. I think that would make me very happy. Bring back the forests! Burn down the Nazi trees!

48

Carnelian reply

That’s not just a fabulous insight, it’s a powerful revelation!

6

SaharaMaleikuhm

How about just living in the actual woods with no internet? Gets more tempting by the day.

40

Aussiemandeus reply

Yeah but where i live its to damn hot

12

PokerChips reply

programming.dev

Actually, there's no woods left.

3

TronBronson reply

Seriously, I live in the woods and people keep moving in and clear cutting. The woods are just suburbia after covid lol

4

Taleya reply

It's all fun and games until bushfire season

1

TronBronson reply

I’m on a hill so I’m waiting for the rains to pick up and carve their house into a gully. It’s basically a future stream now

2

TheBat reply

0

Is this hopeposting ?

31

Aussiemandeus reply

Kinda yeah, it's what I thought lemmy would be, but more and more it isn't

7

Cyberpunk as a literary genre, and the Cyberpunk TTRPG in specific, are incredibly prophetic. In the Cyberpunk TTRPG (which predates the WWW), "the net" is eventually condemned (as in boarded up) because of AI and ia replaced by silo'd networks (think extended intranets).

27

Cooper8 reply

And of course in Cyberpunk the ttrpg setting much of the o0en internet was rendered useless by self replicating AI malware hijacking storage, processing, and bandwidth due to a zero day exploit discovered by one egomaniacal hacker.

11

Sounds a bit like computers in Dune as well

5

Well I mean that's kind of what Lemmy is like since it's far more niche than something like reddit, but AI crawlers will find it anyway.

23

mic_check_one_two reply

lemmy.dbzer0.com

AI crawlers don’t even need to crawl individual instances. If someone wanted to scrape Lemmy, it would be way more efficient to simply spin up their own instance and let federation do its thing. Federation is literally a built in way to mass distribute content to a bunch of different servers. So just spin up an instance, set it to not respect delete requests, (so you still get the deleted posts and comments), and scrape it locally. The entire thing could be set up in like 20 minutes, and it would allow for passive data collection instead of requiring active scrapers that run constantly.

14

I'm sure they'll use instance killing AI scrapers anyway because they don't give a shit.

5

pressanykeynow reply

Efficiency is not what AI is known for.

3

MarriedCavelady50

13

tomcatt360 reply

And don't forget freenet!

4

AI-orchestrated cyber espionage campaign

12

Cooper8 reply

Damn, how was this not big headline news?

4

SaharaMaleikuhm reply

China and Russia do this shit all the time. Now they used a new tool. It's really not big headline news. I'd love for it to be and burst that AI bubble tho.

6

「黃家駒 Wong Ka Kui」(old account, migrated to Piefed)

sh.itjust.works

~~shhh~~ ~~they'll~~ ~~hear~~ ~~you!~~

FUCK WE'RE TOO LATE, YOU ACTIVATED THE BOTS! YOU DOOMED US!

11

Aussiemandeus reply

My bad, I'm sorry

5

Back in the days of dial up and bbs this was a problem but you would still get robots trying to connect to modems by dialing every phone number possible.

11

friend_of_satan reply

3

It's almost time, we're almost back to web-rings.

9

https://sr.ht/~sircmpwn/openring/

Already there!

2

SanctimoniousApe

Isn't that what TOR-based .onion sites are for?

8

Aussiemandeus reply

Unfortunately I've no idea

1

lad reply

programming.dev

That's the point

2

Aussiemandeus reply

Perfect that's what I need

2

TheWeirdestCunt

Ah a fellow spacetime enjoyer

7

Aussiemandeus reply

Space time?

Someone else had this idea before me? Like they say no new ideas haha

1

TheWeirdestCunt reply

Tbh I mixed up a couple videos I saw in my feed, I was watching a new PBS space time video and saw an old cool world's lab video on the dark forest hypothesis in the side bar.

3

Too late, it's dead!

7

Aussiemandeus reply

Yeah it certainly is

4

sh.itjust.works

Morpheus, that you?

7

I was thinking the other week about how it's getting to a point that I would consider a membership fee to access something like lemmy but guaranteed no AI or bots or bullshit advertising.

I know it isn't possible, but if it was, I'd pay a small fee to have it.

6

Do you think there will be safe places on the internet?

If it's connected, it's accessible. Won't matter what human level security we put in place when the datacenters these clankers run on have enough GPUs to brute force their way through.

Offline communication will make a resurgence, and will become indespensible when the resource wars the billionaires are funding reach the rest of the world.

6

Quadrexium reply

If i had to guess, maybe everything would become invite-only

12

pcr3 reply

So like past torrent search providers, ike demoniod?

3

Lag reply

If the person who got invited by the person you invited gets banned, your whole family dies. It's the only way to keep people honest.

6

Aussiemandeus reply

Internet vampires

3

bampop reply

You know, maybe that's not so bad, there would be real world links between users rather than the random collection of absolute strangers you get now. Oh wait no that's just facebook again

1

I would prefer a smaller HUMAN internet, over a bigger AI internet.

6

Aussiemandeus reply

Absolutely

2

I like this metaphor

5

Aussiemandeus reply

Thank you

1

TropicalDingdong

peaks out from behind tree

5

Aussiemandeus reply

1

The last reduct of mankind against the machines? Let's call it Sion

5

Aussiemandeus reply

I think you're on to something maybe make a movie

4

How is Gemini fairing in the existing bot landscape? Usenet?

4

Disconnection is the only solution, walled gardens, paid or by invite, that prevent all the shit corporate America fills the commons with.

4

Aussiemandeus reply

Paid subs is no good but invite only is a good idea but how do you distinguish invite to a person compared to invite to a bot

2

vatlark reply

This is something I'm very curious about. It seems like a really necessary utility in the future.

A way for people to validate other people but not totally blow away all privacy. Large group chats, email providers, etc already try to solve it. It would be cool to see some powerful open source tooling. Like what Signal is to E2E encrypted chats.

2

minorkeys reply

You only invite those you know, personally.

1

Aussiemandeus reply

Just go outside at that point

1

minorkeys reply

I think you can know them personally from online. Ppl you game with or fellow content creators or people in the same hobby spaces etc.

1

There are plenty of alternative Internets. And even just skipping the HTTP protocol is a start. IRC is a great example.

3

I have made a concerted effort over the last two or three years to de-urbanize my online activity. One thing I've noticed is that even small communities are affected by AI. Crawlers and spambots can cause a small site with limited resources to crumple under the weight of nonhuman traffic, a DDoS attack more or less. This makes it hard to self host a community.

While the fediverse helps to some degree it still suffers from copying the format of big social media sites. Lemmy is just a Reddit clone and Mastodon a Twitter clone, so the cultures of these communities mimic those of the big sites they emulate.

2

lemmy.blahaj.zone

Already happened, but the AI companies will try their best to kill that off too.

2

Taleya reply

They don't want to kill it...they're just incapable of invention and innovation, so they push and crowd like parasite universes on the edge of humanity. Like the sea trying to warm itself around a candle.

2

piefed.blahaj.zone

2

I'm ready for the return to webrings.

1

Some sort of F2F network like Retroshare would solve the problem.

0

What's if instead of a dead internet we end up with a dark forest internet | Spyke