Show and tell: LinkedIn Jobs Scraper built with Crawlee — 45K results, no auth #3566
MarcinDudekDev
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey Crawlee folks 👋
Wanted to share a scraper I built on top of Crawlee that's been holding up surprisingly well: a LinkedIn Jobs search extractor that pulls full job listings at scale without requiring a LinkedIn API key or logged-in session.
What it does
Given a search query + location, it walks LinkedIn's public jobs search and returns structured results: title, company, location, posted date, description, apply URL, seniority, employment type. I've pulled ~45K results in a single run without tripping the usual guest-session walls.
How it's built
CheerioCrawlerfor the listing pages (fast, no browser needed for the guest search endpoint)PuppeteerCrawlerfallback for detail pages that gate behind JSRequestQueuefor pagination through the offset-based result pagesproxyConfigurationwith rotating residential proxies + exponential backoff on 429suseSessionPool: true— sessions get retired fast when LinkedIn starts shadow-blockingCrawlee's session pool + request retry logic is what makes this viable — hand-rolling that retry state machine would be painful.
Use cases
Trigger via Apify API:
Link: https://apify.com/cryptosignals/linkedin-jobs-scraper
Would love feedback on:
Happy to dig into any part of the implementation if it's useful.
Beta Was this translation helpful? Give feedback.
All reactions