Skip to content

The results of dead urls 2023/09/28 #1444

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

toomore
Copy link

@toomore toomore commented Sep 28, 2023

After running ./scripts/prune-dead-urls.py for 5 hours, I obtained the results for dead URLs. What should I do next steps?

I believe I can verify the URLs manually from ./lists/tw.csv (where I am located) by checking their format or determining if the websites have permanently moved to new locations. Recently, we started the project to update the list of URLs for OONI in Taiwan community.

Thanks!

Signed-off-by: Toomore Chiang (ocf.tw) <[email protected]>
@sloncocs
Copy link
Collaborator

Hi @toomore!

Thanks so much for running the script! we also have Gardener script which detects dead URLs in the test lists. Probably worth comparing the results, for the TW list it detects 19 dead URLs.

For the tw.csv, could you please review the URLs which were detected as dead to check the following:

  1. Is the formatting of the tested URL right and is this URL indeed dead?
  2. Does the organisation which used this domain still exist and operate?
    2.1. If yes, what domain does it use? Please add the new domain to the test list.
    2.2. If no, what is the reason why it stopped operating? If there is a chance that the organisation will become active again in the next 2-3 years, please leave the URL in the list.

For other than Taiwanese test lists, we are asking local organisations and researchers to review them one at a time. We avoid deleting URLs identified as 'dead' without a review because sometimes these URLs pertain to media or political organisations which still exist but for some reason changed their domain addresses.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants