Honestly, I find it wild there aren’t more digital archives. It’s really just the wayback machine?
boostedred on
I’ve used The Wayback machine several times for different use cases. I got a lot of value out of it!
jiggrinder on
Now why would they do that ?
banditta82 on
I know the NYT sells access to its back archive, I wonder what % of the remaining 23 do as well. While I have no love for how the AI companies train their models this reeks of „think of the children“.
EverNeko200 on
What’s the point? AI bots will scrape the content anyways. If not bots, AI companies will just pay users with NYT subscriptions to install extensions that siphon articles from behind paywalls. It would be undetectable and unstoppable.
Maybe they could play the cat and mouse game of obfuscation and passing around PO tokens like YouTube does, but at the end of the day it’s just text on a screen. Way easier to steal than a video.
Blocking Archive.org is a waste of time and energy, because what they’re trying to achieve is futility.
[deleted] on
[deleted]
Ok-Comedian-9377 on
It’s me guys. It’s my fault. I’ve been using the way back machine to go to one page in the NYT for a gumbo recipe. Despite memorizing it, I pull it up all the time since it’s got lots of extra info and I like looking at it. Last week, it was gone. No more access. Denied. I did it one too many times. I knew it. So I had to go find a picture of a screen shot I took years ago and then I printed it out and pasted it on the back of a kitchen cupboard door. Sorry I broke the nyt with my gumbo recipe obsession.
AutistcCuttlefish on
The internet Archive should try to find a way to impose access blocks on journalists that work for organizations that forbid archiving their websites.
If you aren’t gonna contribute to the archive you shouldn’t be allowed to freeload off of it for your fiscal benefit.
Rehcraeser on
They would get sued a lot more if there was a history of all their titles/articles. I’ve witnessed it first hand so many times. They make a crazy claim with clickbait, and change it a few days later. Somehow it’s legal to fix it days later, when nobody will see it, and act like they didn’t just manipulate millions of people. They would probably slip up more often if it was all being tracked.
Individual-Result777 on
Internet archive clones should pop up just to cover the news only. thats doable…
ttystikk on
They didn’t want to be caught lying.
Fine-Procedure-6374 on
this actually kinda scares me… i went down a wayback machine rabbit hole once looking at old sites and it felt like opening a time capsule, it’d suck if that just slowly disappears 😭 makes you realize how much of the internet isn’t really permanent like we think
Shabooopee on
They don’t want history to see their lie and propaganda
No_Sherbert4143 on
Oh no they don’t.
That’s horrible.
K1TSUNE9 on
They don’t want people remembering what lies they posted about the current administration or anything that reflection them. They want people to forget when they remove it from their site.
llamajava on
Can’t we clone it? And then make a clone of the clone as a backup…
whoismico on
This needs to be part of the evolution of libraries; they should be funded to archive digital history and grant access to people through them
Maybe it’s already happening and I’m just unaware, which would be awesome
Leave A Reply
Du musst angemeldet sein, um einen Kommentar abzugeben.
17 Kommentare
Honestly, I find it wild there aren’t more digital archives. It’s really just the wayback machine?
I’ve used The Wayback machine several times for different use cases. I got a lot of value out of it!
Now why would they do that ?
I know the NYT sells access to its back archive, I wonder what % of the remaining 23 do as well. While I have no love for how the AI companies train their models this reeks of „think of the children“.
What’s the point? AI bots will scrape the content anyways. If not bots, AI companies will just pay users with NYT subscriptions to install extensions that siphon articles from behind paywalls. It would be undetectable and unstoppable.
Maybe they could play the cat and mouse game of obfuscation and passing around PO tokens like YouTube does, but at the end of the day it’s just text on a screen. Way easier to steal than a video.
Blocking Archive.org is a waste of time and energy, because what they’re trying to achieve is futility.
[deleted]
It’s me guys. It’s my fault. I’ve been using the way back machine to go to one page in the NYT for a gumbo recipe. Despite memorizing it, I pull it up all the time since it’s got lots of extra info and I like looking at it. Last week, it was gone. No more access. Denied. I did it one too many times. I knew it. So I had to go find a picture of a screen shot I took years ago and then I printed it out and pasted it on the back of a kitchen cupboard door. Sorry I broke the nyt with my gumbo recipe obsession.
The internet Archive should try to find a way to impose access blocks on journalists that work for organizations that forbid archiving their websites.
If you aren’t gonna contribute to the archive you shouldn’t be allowed to freeload off of it for your fiscal benefit.
They would get sued a lot more if there was a history of all their titles/articles. I’ve witnessed it first hand so many times. They make a crazy claim with clickbait, and change it a few days later. Somehow it’s legal to fix it days later, when nobody will see it, and act like they didn’t just manipulate millions of people. They would probably slip up more often if it was all being tracked.
Internet archive clones should pop up just to cover the news only. thats doable…
They didn’t want to be caught lying.
this actually kinda scares me… i went down a wayback machine rabbit hole once looking at old sites and it felt like opening a time capsule, it’d suck if that just slowly disappears 😭 makes you realize how much of the internet isn’t really permanent like we think
They don’t want history to see their lie and propaganda
Oh no they don’t.
That’s horrible.
They don’t want people remembering what lies they posted about the current administration or anything that reflection them. They want people to forget when they remove it from their site.
Can’t we clone it? And then make a clone of the clone as a backup…
This needs to be part of the evolution of libraries; they should be funded to archive digital history and grant access to people through them
Maybe it’s already happening and I’m just unaware, which would be awesome