Skip to content

Parameters to change the way we download sites? #56

@steveharman

Description

@steveharman

Hi Ahmed,

I didn't know how best to explain this so I've added a screenshot in the hope that a picture really is worth a thousand words. :-)

I have an old Mac app called SiteSucker which downloads websites. I'd prefer to move to your Website Downloader tool, but for some reason, I get "different" results from Website Downloader. I'm wondering if this is down to a wget parameter?

  • On the left of the screenshot is the result of downloading my WordPress site with SiteSucker. I have a single index.html at the root of the download and all is well.
  • On the right of the screenshot is the result of downloading my WordPress site with Website Downloader - the same index.html exists but in addition a number of other index.html.... files all suffixed with parameters. Eg; index.html?p=26446.html. These files are other "pages" on my site and blog posts.

Having "multiple index.html" files causes issues when deploying my site to Netlify, where it doesn't seem to recognize the solitary index.html as my index and instead picks one of the others.

I could always remove the offending files and manually update various links in my index.html to point to the correct locations, but if there was a way to download in "the same way as SiteSucker does", that would be fantastic.

Does any of that make sense? :-)

Thanks,

Steve

wsd_png

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions