Skip to content

Better management of proxies#80

Open
Bouki wants to merge 3 commits intoyasserg:masterfrom
Bouki:master
Open

Better management of proxies#80
Bouki wants to merge 3 commits intoyasserg:masterfrom
Bouki:master

Conversation

@Bouki
Copy link
Copy Markdown

@Bouki Bouki commented Jul 9, 2015

Allow using one proxy per fetched page instead of one proxy per seed

@SoulZhong
Copy link
Copy Markdown

How about this commit? Is it accepted?

@mhelf
Copy link
Copy Markdown

mhelf commented Sep 9, 2017

You could just use a local proxy server to handle proxy redirects. That way you can easily achieve proxy rotation without touching crawler4j's code.

Take a look at: https://github.com/adamfisk/LittleProxy and witchChainProxyManager method.

@s17t
Copy link
Copy Markdown
Contributor

s17t commented Sep 11, 2017

A modern crawler should have a proxy rotation feature. I would gladly accept PRs about this as long they comes with some tests around them. @Bouki, please add some some tests.

Another option is to use dedicated external proxy like @helfsoft suggested.

@Bouki
Copy link
Copy Markdown
Author

Bouki commented Sep 11, 2017

Sorry I can't add tests because this commit is 2 years old it certainly needs an update

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants