Skip to content

Conversation

@zejn
Copy link

@zejn zejn commented Oct 27, 2012

Sometimes a page has POST based navigation (yes, sadly this happens) and there isn't an URL where you could point pjscrape to go. In this case you need to start at a specific URL and navigate by issuing click events on certain DOM elements to get to the desired page. And this is something a scraping tool such as pjscrape, which runs in the browser, can really do well.

This pull request implements this functionality via two properties on pjs.suite. The "nextPage" is a function which determines and triggers an event, guiding browser to next page. It returns true when next page was requested and false otherwise. The other property is "maxDepth" which determines how many times can next page be requested (useful for example for scraping POST based pagination, but no more than N pages). Included test demonstrates the functionality.

Gasper Zejn added 4 commits March 27, 2012 09:13
@zejn
Copy link
Author

zejn commented Nov 15, 2012

Ping? Comments?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant