HN2new | past | comments | ask | show | jobs | submitlogin

I can't understand, is storing pure text too expensive for IMDB which is owned by Amazon which runs half the servers of the planet? Why don't they keep the threads while disabling new discussions?


There is a comment on Reddit saying that the ArchiveTeam is working on archiving them:

https://www.reddit.com/r/scifi/comments/5s0iiv/slug/ddbugy5


Thanks for the link. Following the link it appears that the db can is available for download from these two ftp sites :

ftp.fu-berlin.de (Germany) ftp.funet.fi (Finland)

PS: I have not verified them yet.


Unfortunately, that's just the DB itself, it doesn't include the boards at all.


Does this not run afoul of IP laws?


Archiveteam does not give a flying fuck about some corporation's IP if it means that people's creations can be saved for them and future generations.

archive.org, were the data usually ends up, is protected by the law.


Archiveteam does not give a flying fuck about some corporation's IP if it means that people's creations can be saved for them and future generations.

Future generations are likely to be disappointed already, since it seems, sadly, that IMDB has had a rolling expiry of message posts.

http://www.imdb.com/help/show_leaf?boardswhydeleted

If you are certain that the post has been deleted and you didn't do it, then there are two possible reasons: the message has been removed by a moderator or it has expired. Our boards are not designed to store messages indefinitely: after a certain amount of time older threads are removed from the board automatically. This process is called "expiry".

The rate of expiry is adjusted dynamically according to the posting traffic for each board. Threads are removed sooner from a board that attracts a lot of posts than they would be from a board with less activity.

Please keep in mind that the boards system is not meant to be an archival medium, and all posts are subject to expire and disappear sooner or later. This is intended to allow us to manage the resources consumed by each board effectively, and also to promote the ever changing flow of discussion.


I've battled spam at another film site. If my experience is anything to go by, even mentioning the word "spam" is enough to make smoke come out of the operators' ears. They tried to delete the spam and abuse when it was posted, but some slipped through and leads to new support requests and new complaints even now.

Sorry if that sounded exasperated.


Yeah but his point is it's one thing to disable new posts or comments, but just archive what's there instead of removing it.


And my answer is that there's a stream of new complaints even about what's there.


You can say that about large parts of the Internet, so it's kind of moot, and not unlike throwing out the baby with the bathwater etc.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: