JobSpy/examples/JobSpy_AllSites.py

from jobspy import scrape_jobs
import pandas as pd

jobs: pd.DataFrame = scrape_jobs(
    site_name=["indeed", "linkedin", "zip_recruiter", "glassdoor"],
    search_term="software engineer",
    location="Dallas, TX",
    results_wanted=25,  # be wary the higher it is, the more likey you'll get blocked (rotating proxy can help tho)
    country_indeed="USA",
    # proxy="http://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",
)

# formatting for pandas
pd.set_option("display.max_columns", None)
pd.set_option("display.max_rows", None)
pd.set_option("display.width", None)
pd.set_option("display.max_colwidth", 50)  # set to 0 to see full job url / desc

# 1: output to console
print(jobs)

# 2: output to .csv
jobs.to_csv("./jobs.csv", index=False)
print("outputted to jobs.csv")

# 3: output to .xlsx
# jobs.to_xlsx('jobs.xlsx', index=False)

# 4: display in Jupyter Notebook (1. pip install jupyter 2. jupyter notebook)
# display(jobs)
add offset param & email extraction (#51) * add offset param * [enh]: extract emails 2023-09-28 16:11:28 -07:00			`from jobspy import scrape_jobs`
			`import pandas as pd`

			`jobs: pd.DataFrame = scrape_jobs(`
add long scrape example (#81) 2024-01-12 10:24:00 -08:00			`site_name=["indeed", "linkedin", "zip_recruiter", "glassdoor"],`
add offset param & email extraction (#51) * add offset param * [enh]: extract emails 2023-09-28 16:11:28 -07:00			`search_term="software engineer",`
			`location="Dallas, TX",`
add long scrape example (#81) 2024-01-12 10:24:00 -08:00			`results_wanted=25, # be wary the higher it is, the more likey you'll get blocked (rotating proxy can help tho)`
Multiple job types for Indeed, urgent keywords column (#56) * enh(indeed): mult job types * feat(jobs): urgent kws * fix(indeed): use new session obj per request * fix: emails as comma separated in output * fix: put num urgent words in output * chore: readme 2023-10-10 09:23:04 -07:00			`country_indeed="USA",`
add offset param & email extraction (#51) * add offset param * [enh]: extract emails 2023-09-28 16:11:28 -07:00			`# proxy="http://jobspy:5a4vpWtj8EeJ2hoYzk@ca.smartproxy.com:20001",`
			`)`

			`# formatting for pandas`
Multiple job types for Indeed, urgent keywords column (#56) * enh(indeed): mult job types * feat(jobs): urgent kws * fix(indeed): use new session obj per request * fix: emails as comma separated in output * fix: put num urgent words in output * chore: readme 2023-10-10 09:23:04 -07:00			`pd.set_option("display.max_columns", None)`
			`pd.set_option("display.max_rows", None)`
			`pd.set_option("display.width", None)`
			`pd.set_option("display.max_colwidth", 50) # set to 0 to see full job url / desc`
add offset param & email extraction (#51) * add offset param * [enh]: extract emails 2023-09-28 16:11:28 -07:00
			`# 1: output to console`
			`print(jobs)`

			`# 2: output to .csv`
Multiple job types for Indeed, urgent keywords column (#56) * enh(indeed): mult job types * feat(jobs): urgent kws * fix(indeed): use new session obj per request * fix: emails as comma separated in output * fix: put num urgent words in output * chore: readme 2023-10-10 09:23:04 -07:00			`jobs.to_csv("./jobs.csv", index=False)`
			`print("outputted to jobs.csv")`
add offset param & email extraction (#51) * add offset param * [enh]: extract emails 2023-09-28 16:11:28 -07:00
			`# 3: output to .xlsx`
			`# jobs.to_xlsx('jobs.xlsx', index=False)`

			`# 4: display in Jupyter Notebook (1. pip install jupyter 2. jupyter notebook)`
add long scrape example (#81) 2024-01-12 10:24:00 -08:00			`# display(jobs)`