Skip to content

Pull requests: yasserg/crawler4j

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

learning github
#481 opened Mar 22, 2026 by anilarelli5-gif Loading…
added disregarded protocols
#446 opened May 31, 2020 by mihalispap Loading…
Selenium basic integration
#444 opened May 10, 2020 by dgoiko Loading…
added custom html content filter
#168 opened Nov 7, 2016 by pdesmet Loading…
Began work on an asynchronous crawling
#157 opened Sep 8, 2016 by lostmsu Loading…
maxPagesToFetch bug
#155 opened Aug 9, 2016 by cmacdonald Loading…
BerkeleyDB to Redis migration
#145 opened Jul 15, 2016 by christophe-pietquin Loading… 4.5.0
allow parsing script tag and other html tags
#114 opened Feb 10, 2016 by code-911 Loading…
Canonical URL meta tag handling and AJAX crawling
#82 opened Jul 15, 2015 by EgbertW Contributor Loading…
Better management of proxies
#80 opened Jul 9, 2015 by Bouki Loading…
Feature: seed tracking
#63 opened May 20, 2015 by EgbertW Contributor Loading…
Allow negative priorities
#61 opened May 20, 2015 by EgbertW Contributor Loading…
Allow to differentiate between queue sizes in Frontier
#60 opened May 20, 2015 by EgbertW Contributor Loading…
Also notify waiting lists when a single URL has been schedules, instead
#59 opened May 20, 2015 by EgbertW Contributor Loading…
Improved delay handling
#57 opened May 20, 2015 by EgbertW Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.