mirror of https://github.com/Bunsly/JobSpy
196 lines
5.6 KiB
Markdown
196 lines
5.6 KiB
Markdown
<img src="https://github.com/cullenwatson/JobSpy/assets/78247585/ae185b7e-e444-4712-8bb9-fa97f53e896b" width="400">
|
||
|
||
**JobSpy** is a simple, yet comprehensive, job scraping library.
|
||
## Features
|
||
|
||
|
||
- Scrapes job postings from **LinkedIn**, **Indeed** & **ZipRecruiter** simultaneously
|
||
- Aggregates the job postings in a Pandas DataFrame
|
||
|
||

|
||
|
||
### Installation
|
||
`pip install python-jobspy`
|
||
|
||
_Python version >= [3.10](https://www.python.org/downloads/release/python-3100/) required_
|
||
|
||
### Usage
|
||
|
||
```python
|
||
from jobspy import scrape_jobs
|
||
import pandas as pd
|
||
|
||
jobs: pd.DataFrame = scrape_jobs(
|
||
site_name=["indeed", "linkedin", "zip_recruiter"],
|
||
search_term="software engineer",
|
||
location="Dallas, TX",
|
||
results_wanted=10,
|
||
|
||
# country: only needed for indeed
|
||
country='USA'
|
||
)
|
||
|
||
if jobs.empty:
|
||
print("No jobs found.")
|
||
else:
|
||
#1 print
|
||
pd.set_option('display.max_columns', None)
|
||
pd.set_option('display.max_rows', None)
|
||
pd.set_option('display.width', None)
|
||
pd.set_option('display.max_colwidth', 50) # set to 0 to see full job url / desc
|
||
print(jobs)
|
||
|
||
#2 display in Jupyter Notebook
|
||
#display(jobs)
|
||
|
||
#3 output to .csv
|
||
#jobs.to_csv('jobs.csv', index=False)
|
||
```
|
||
|
||
### Output
|
||
```
|
||
SITE TITLE COMPANY_NAME CITY STATE JOB_TYPE INTERVAL MIN_AMOUNT MAX_AMOUNT JOB_URL DESCRIPTION
|
||
indeed Software Engineer AMERICAN SYSTEMS Arlington VA None yearly 200000 150000 https://www.indeed.com/viewjob?jk=5e409e577046... THIS POSITION COMES WITH A 10K SIGNING BONUS!...
|
||
indeed Senior Software Engineer TherapyNotes.com Philadelphia PA fulltime yearly 135000 110000 https://www.indeed.com/viewjob?jk=da39574a40cb... About Us TherapyNotes is the national leader i...
|
||
linkedin Software Engineer - Early Career Lockheed Martin Sunnyvale CA fulltime yearly None None https://www.linkedin.com/jobs/view/3693012711 Description:By bringing together people that u...
|
||
linkedin Full-Stack Software Engineer Rain New York NY fulltime yearly None None https://www.linkedin.com/jobs/view/3696158877 Rain’s mission is to create the fastest and ea...
|
||
zip_recruiter Software Engineer - New Grad ZipRecruiter Santa Monica CA fulltime yearly 130000 150000 https://www.ziprecruiter.com/jobs/ziprecruiter... We offer a hybrid work environment. Most US-ba...
|
||
zip_recruiter Software Developer TEKsystems Phoenix AZ fulltime hourly 65 75 https://www.ziprecruiter.com/jobs/teksystems-0... Top Skills' Details• 6 years of Java developme...
|
||
```
|
||
### Parameters for `scrape_jobs()`
|
||
```plaintext
|
||
Required
|
||
├── site_type (List[enum]): linkedin, zip_recruiter, indeed
|
||
└── search_term (str)
|
||
Optional
|
||
├── location (int)
|
||
├── distance (int): in miles
|
||
├── job_type (enum): fulltime, parttime, internship, contract
|
||
├── is_remote (bool)
|
||
├── results_wanted (int): number of job results to retrieve for each site specified in 'site_type'
|
||
├── easy_apply (bool): filters for jobs on LinkedIn that have the 'Easy Apply' option
|
||
├── country (enum): uses the corresponding subdomain on Indeed (e.g. Canada on Indeed is ca.indeed.com
|
||
```
|
||
|
||
|
||
### JobPost Schema
|
||
```plaintext
|
||
JobPost
|
||
├── title (str)
|
||
├── company_name (str)
|
||
├── job_url (str)
|
||
├── location (object)
|
||
│ ├── country (str)
|
||
│ ├── city (str)
|
||
│ ├── state (str)
|
||
├── description (str)
|
||
├── job_type (enum)
|
||
├── compensation (object)
|
||
│ ├── interval (CompensationInterval): yearly, monthly, weekly, daily, hourly
|
||
│ ├── min_amount (int)
|
||
│ ├── max_amount (int)
|
||
│ └── currency (str)
|
||
└── date_posted (datetime)
|
||
```
|
||
|
||
## Supported Countries for Job Searching
|
||
|
||
|
||
### **LinkedIn**
|
||
|
||
LinkedIn searches globally. Use the `location` parameter
|
||
|
||
### **ZipRecruiter**
|
||
|
||
ZipRecruiter searches for jobs in US/Canada. Use the `location` parameter
|
||
|
||
|
||
### **Indeed**
|
||
For Indeed, you `location` along with `country` param
|
||
|
||
You can specify the following countries when searching on Indeed (use the exact name):
|
||
|
||
- Argentina
|
||
- Australia
|
||
- Austria
|
||
- Bahrain
|
||
- Belgium
|
||
- Brazil
|
||
- Canada
|
||
- Chile
|
||
- China
|
||
- Colombia
|
||
- Costa Rica
|
||
- Czech Republic
|
||
- Denmark
|
||
- Ecuador
|
||
- Egypt
|
||
- Finland
|
||
- France
|
||
- Germany
|
||
- Greece
|
||
- Hong Kong
|
||
- Hungary
|
||
- India
|
||
- Indonesia
|
||
- Ireland
|
||
- Israel
|
||
- Italy
|
||
- Japan
|
||
- Kuwait
|
||
- Luxembourg
|
||
- Malaysia
|
||
- Mexico
|
||
- Morocco
|
||
- Netherlands
|
||
- New Zealand
|
||
- Nigeria
|
||
- Norway
|
||
- Oman
|
||
- Pakistan
|
||
- Panama
|
||
- Peru
|
||
- Philippines
|
||
- Poland
|
||
- Portugal
|
||
- Qatar
|
||
- Romania
|
||
- Saudi Arabia
|
||
- Singapore
|
||
- South Africa
|
||
- South Korea
|
||
- Spain
|
||
- Sweden
|
||
- Switzerland
|
||
- Taiwan
|
||
- Thailand
|
||
- Turkey
|
||
- Ukraine
|
||
- United Arab Emirates
|
||
- UK
|
||
- USA
|
||
- Uruguay
|
||
- Venezuela
|
||
- Vietnam
|
||
|
||
## Frequently Asked Questions
|
||
|
||
---
|
||
|
||
**Q: Encountering issues with your queries?**
|
||
**A:** Try reducing the number of `results_wanted` and/or broadening the filters. If problems persist, [submit an issue](#).
|
||
|
||
---
|
||
|
||
**Q: Received a response code 429?**
|
||
**A:** This indicates that you have been blocked by the job board site for sending too many requests. Currently, **ZipRecruiter** is particularly aggressive with blocking. We recommend:
|
||
|
||
- Waiting a few seconds between requests.
|
||
- Trying a VPN to change your IP address.
|
||
|
||
**Note:** Proxy support is in development and coming soon!
|
||
|
||
---
|
||
|
||
|