Skip to content

[Bug]: There is no way to download large files #3092

@AramZS

Description

@AramZS

Browsertrix Version

v1.21.0-2793d2c

What did you expect to happen? What happened instead?

I have a few smaller crawls that are between 1-3 gigs that appear to be downloading fine, but my crawls that are above 10gigs are impossible to download from the web UI. They seem to just fail out given enough time with the server just no longer responding with additional file data. I'm fine using other methodology but none seem to be clear and I don't want to delete my archives just so my account can start working again (it is full).

Reproduction instructions

  • Run a crawl that collects larger than 10 gigs worth of data
  • Attempt to download it.

This was one of my crawl lists:

Some of these sites are down now, so I'm not sure you'll get the same results.

https://www.train.org/cdctrain/course/1117398/details
https://www.mdpi.com/1660-4601/11/6/6433
https://www.apha.org/topics-and-issues/climate-health-and-equity/jedi
https://www.cdc.gov/climateandhealth/climate_ready.htm
https://www.apha.org/topics-and-issues/racial-equity/racism-declarations
https://www.pathlms.com/health/courses/64574
https://ephtracking.cdc.gov/Applications/HeatRisk/
https://www.ready.gov/
https://www.cdc.gov/natural-disasters/
https://endingracism.apha.org/
https://nca2023.globalchange.gov/
https://atlas.globalchange.gov/#data
https://atlas.globalchange.gov/#explore
https://nca2018.globalchange.gov/
https://health2016.globalchange.gov/
https://science.nasa.gov/climate-change/

Not run locally.

Screenshots / Video

Other settings:

Image

Environment

OSX Tahoma, Arc (Chrome) latest.

Additional details

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    Projects

    Status

    Triage

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions