Skip to content

Q: effort towards making IA "distributed"? #383

@drok

Description

@drok

Hello,

Background

I am an engineer and have some time and software/systems development skill to volunteer, as well as a small amount of reliable datacenter hosting (unused bandwidth and storage on VPS's I use for other projects).

I came upon this project on October 12, 2024, which will likely be known in the future as the day/week when the Internet Archive was knocked offline by DDOS.

I came here looking to continue research while the IA is down, as my current research heavily relies on IA. If there were a way to reach the documents from another source than directly IA's hard-drives, eg, IPFS, I could continue my work.

Question 1

Is there any effort ongoing, desirable or welcome towards building a "distributed" mirror of the Internet Archive?

I am not intimately familiar with this project, or with IPFS, but I have the skills and ability to work on integrating a peer-to-peer distributed filesystem.

I have noticed that a lot of effort has been made with internetarchive/dweb-transports@b93c251 and and related IPFS/WebTorrent initiatives, but a distributed Internet Archive mirror remains elusive to mere mortals. Thus, my question remains, is there still desire to achieve a distributed IA?

Question 2

I naively (curl -o- -L https://unpkg.com/@internetarchive/dweb-mirror/install.sh | bash) attempted to install the project on Ubuntu as per [INSTALLATION.md] and this failed. I took note that this project could use a user friendly installation package (a "one-click" way to access the data).

I could work on the accessibility of the project, too, if that were desirable/welcome. Is it?

The simplest (from the end user's perspective) installer I envision would be a web page that implements a WebWorker/ServiceWorker that implements this project and uses the browser's local storage as a cache.

The second simplest way would be the traditional app installer that most computer users are familiar with. It would not be limited to Pi users as it seems to be today, nor to computer savvy people. Anyone who has the minimal skill to install Chrome should be able to install this program.

As above, I noticed some effort has already been made, but it's not clear that there is still interest/support from IA for it.

Conclusion.

I want to help make the data within the Internet Archive more accessible, even the https://archive.org site is offline for any reason.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions