mirror of https://github.com/Bunsly/JobSpy
docs: update vid
parent
ca5155f234
commit
4406f9350f
37
README.md
37
README.md
|
@ -8,15 +8,13 @@
|
||||||
- Aggregates the job postings in a Pandas DataFrame
|
- Aggregates the job postings in a Pandas DataFrame
|
||||||
- Proxy support (HTTP/S, SOCKS)
|
- Proxy support (HTTP/S, SOCKS)
|
||||||
|
|
||||||
[Video Guide for JobSpy](https://www.youtube.com/watch?v=-yS3mgI5H-4) (outdated, updating within couple hours)
|
[Video Guide for JobSpy](https://www.youtube.com/watch?v=RuP1HrAZnxs&pp=ygUgam9icyBzY3JhcGVyIGJvdCBsaW5rZWRpbiBpbmRlZWQ%3D) - Updated for release v1.1.3
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
![jobspy](https://github.com/cullenwatson/JobSpy/assets/78247585/ec7ef355-05f6-4fd3-8161-a817e31c5c57)
|
![jobspy](https://github.com/cullenwatson/JobSpy/assets/78247585/ec7ef355-05f6-4fd3-8161-a817e31c5c57)
|
||||||
|
|
||||||
### Installation
|
### Installation
|
||||||
```
|
```
|
||||||
pip install python-jobspy
|
pip install --upgrade python-jobspy
|
||||||
```
|
```
|
||||||
|
|
||||||
_Python version >= [3.10](https://www.python.org/downloads/release/python-3100/) required_
|
_Python version >= [3.10](https://www.python.org/downloads/release/python-3100/) required_
|
||||||
|
@ -28,33 +26,33 @@ from jobspy import scrape_jobs
|
||||||
import pandas as pd
|
import pandas as pd
|
||||||
|
|
||||||
jobs: pd.DataFrame = scrape_jobs(
|
jobs: pd.DataFrame = scrape_jobs(
|
||||||
site_name=['indeed', 'linkedin', 'zip_recruiter'],
|
site_name=["indeed", "linkedin", "zip_recruiter"],
|
||||||
search_term='software engineer',
|
search_term="software engineer",
|
||||||
location='Dallas, TX',
|
location="Dallas, TX",
|
||||||
results_wanted=10,
|
results_wanted=10,
|
||||||
|
|
||||||
country_indeed='USA' # only needed for indeed
|
country_indeed='USA' # only needed for indeed
|
||||||
|
|
||||||
# use if you want to use a proxy
|
# use if you want to use a proxy (3 types)
|
||||||
# proxy="socks5://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
# proxy="socks5://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
||||||
# proxy="http://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
# proxy="http://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
||||||
# proxy="https://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
# proxy="https://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
|
||||||
)
|
)
|
||||||
|
|
||||||
|
# formatting for pandas
|
||||||
pd.set_option('display.max_columns', None)
|
pd.set_option('display.max_columns', None)
|
||||||
pd.set_option('display.max_rows', None)
|
pd.set_option('display.max_rows', None)
|
||||||
pd.set_option('display.width', None)
|
pd.set_option('display.width', None)
|
||||||
pd.set_option('display.max_colwidth', 50) # set to 0 to see full job url / desc
|
pd.set_option('display.max_colwidth', 50) # set to 0 to see full job url / desc
|
||||||
|
|
||||||
#1 output
|
#1 display in Jupyter Notebook (1. pip install jupyter 2. jupyter notebook)
|
||||||
|
display(jobs)
|
||||||
|
|
||||||
|
#2 output to console
|
||||||
print(jobs)
|
print(jobs)
|
||||||
|
|
||||||
#2 display in Jupyter Notebook
|
|
||||||
#display(jobs)
|
|
||||||
#display(errors)
|
|
||||||
|
|
||||||
#3 output to .csv
|
#3 output to .csv
|
||||||
#result.jobs.to_csv('result.jobs.csv', index=False)
|
#jobs.jobs.to_csv('result.jobs.csv', index=False)
|
||||||
```
|
```
|
||||||
|
|
||||||
### Output
|
### Output
|
||||||
|
@ -76,10 +74,11 @@ Optional
|
||||||
├── location (int)
|
├── location (int)
|
||||||
├── distance (int): in miles
|
├── distance (int): in miles
|
||||||
├── job_type (enum): fulltime, parttime, internship, contract
|
├── job_type (enum): fulltime, parttime, internship, contract
|
||||||
|
├── proxy (str): in format 'http://user:pass@host:port' or [https, socks]
|
||||||
├── is_remote (bool)
|
├── is_remote (bool)
|
||||||
├── results_wanted (int): number of job results to retrieve for each site specified in 'site_type'
|
├── results_wanted (int): number of job results to retrieve for each site specified in 'site_type'
|
||||||
├── easy_apply (bool): filters for jobs that are hosted on LinkedIn
|
├── easy_apply (bool): filters for jobs that are hosted on LinkedIn
|
||||||
├── country_indeed (enum): filters the country on Indeed
|
├── country_indeed (enum): filters the country on Indeed (see below for correct spelling)
|
||||||
```
|
```
|
||||||
|
|
||||||
|
|
||||||
|
@ -103,6 +102,12 @@ JobPost
|
||||||
└── date_posted (date)
|
└── date_posted (date)
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### Exceptions
|
||||||
|
The following exceptions may be raised when using JobSpy:
|
||||||
|
* `LinkedInException`
|
||||||
|
* `IndeedException`
|
||||||
|
* `ZipRecruiterException`
|
||||||
|
|
||||||
## Supported Countries for Job Searching
|
## Supported Countries for Job Searching
|
||||||
|
|
||||||
|
|
||||||
|
@ -145,7 +150,7 @@ You can specify the following countries when searching on Indeed (use the exact
|
||||||
---
|
---
|
||||||
|
|
||||||
**Q: Encountering issues with your queries?**
|
**Q: Encountering issues with your queries?**
|
||||||
**A:** Try reducing the number of `results_wanted` and/or broadening the filters. If problems persist, [submit an issue](#).
|
**A:** Try reducing the number of `results_wanted` and/or broadening the filters. If problems persist, [submit an issue](https://github.com/cullenwatson/JobSpy/issues).
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue