Skip to content

Latest commit

 

History

History
43 lines (30 loc) · 1.03 KB

File metadata and controls

43 lines (30 loc) · 1.03 KB

Welcome

POST a new crawl job

curl -X POST -H "Content-Type:application/json" http://localhost:9000 -d '{"seedurl":"http://example.com", "levels": 2}'
  • seedurl: You can specify the url where the crawler will start.
  • levels: You can specify how many levels of recursion the crawler will crawl

Returns JSON doc of the new job:

{
  "id":"2e07fb5e-0615-46f3-aace-0297ed946123",
  "seedurl":"http://carolbhansen.com",
  "levels":2,
  "status":"pending",
  "urls":{}
}

GET status of an existing job

curl http://localhost:9000/status/{jobid}

Returns JSON of job status and number of unique URLs crawled so far:

{
  "status":"complete",
  "unique urls crawled":8
}

GET results of an existing job

curl http://localhost:9000/result/{jobid}

Returns JSON of URLs crawled and count:

{ 
  "https://github.com/gretacb":2,
  "https://www.sram.com/en/sram":2,
  "https://www.mapbox.com/":2,
  "https://www.gatsbyjs.org":2,
  "https://www.netlify.com/":2
}