Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

It seems to me like you want an offline cache. How can something local deal with searching the content of a possibly deleted page?


Local cache would definitely be the (easiest) way of solving it. Tools like Memex [0] are most of the way there.

But a text-only copy on my local device isn't great if the content had special formatting in presentation. Also, it misses out on images or embedded videos. That's where something like ArchiveBox [1] comes in.

> ArchiveBox takes a list of website URLs you want to archive, and creates a local, static, browsable HTML clone of the content from those websites (it saves HTML, JS, media files, PDFs, images and more).

But really what I'd like to see at some point is an opt-in community tool where every page I visit that fits a certain criteria (URL, topic, special mark by me, etc) is fully cloned and uploaded to IPFS [2] for anyone interested in that topic to find and use later - regardless of what happens to the source content. Definitely a host of legal issues around this, but not impossible.

0 - https://worldbrain.io/

1 - https://github.com/pirate/ArchiveBox

2 - https://ipfs.io/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: