Commit Graph

146 Commits (f010ce51b1a05554778a358945a4e6aa34bf0fe1)

Author SHA1 Message Date
Yariv Menachem 7c002ae76e fixed structure 2024-12-09 16:13:07 +02:00
Yariv Menachem 53a840e53e created main
fixed israel search on glasdoor and added to gitignore the csv and debug json
2024-12-08 17:30:06 +02:00
Jason Geffner 4e7ac9a583
Fix Google job search (#223)
The previous regex did not capture all expected matches in the returned content
2024-12-04 16:45:59 -06:00
Cullen Watson 338d854b96
fix(google): search (#216) 2024-10-25 14:54:14 -05:00
Cullen Watson 10a3592a0f docs:file 2024-10-24 15:26:49 -05:00
Cullen Watson b7905cc756 docs:file 2024-10-24 15:24:18 -05:00
Cullen Watson f6248c8386
enh: google jobs (#214) 2024-10-24 15:19:40 -05:00
Cullen Watson f395597fdd fix(indeed): offset 2024-10-22 19:25:07 -05:00
Cullen Watson 9f4083380d
indeed:remove tpe (#210) 2024-10-19 18:01:59 -05:00
Olzhas Arystanov 9207ab56f6
fix: extract tests out of src (#209) 2024-10-19 16:56:38 -05:00
Marcel Gozalbo Baró 6bc191d5c7
FEATURE: Add the "ca_cert" setting for providing a Certification Authority certificate in order to use proxies requiring it. (#204) 2024-10-08 17:46:46 -05:00
Cullen Watson 0cc34287f7 fix:turkey 2024-10-02 01:31:00 -05:00
Anton Pikhteryev 923979093b
Add Malta for linkedin country support (#198) 2024-09-19 20:41:22 -05:00
Cullen Watson f7b29d43a2
fix(indeed):sort relevance not date (#197) 2024-09-18 18:42:25 -05:00
Cullen Watson 6f1490458c
fix key error (#186) 2024-08-14 02:54:40 -05:00
Cullen Watson 6bb7d81ba8
change linkedin ep (#185) 2024-08-14 02:39:43 -05:00
Cullen Watson 0e046432d1
fix:variable bug (#181) 2024-08-05 12:47:55 -05:00
Cullen Watson 209e0e65b6
fix:malaysia indeed (#180) 2024-08-03 22:48:53 -05:00
Cullen Watson 8570c0651e
fix:key error (#176) 2024-07-21 13:05:18 -05:00
Cullen Watson 8678b0bbe4
enh: test on pr (#174) 2024-07-19 14:25:25 -05:00
Cullen Watson 60d4d911c9
lock file (#173) 2024-07-17 21:21:22 -05:00
Lluís Salord Quetglas 2a0cba8c7e
FEAT: Optional convertion to annual and know salary source (#170) 2024-07-17 21:05:33 -05:00
Cullen Watson 88c95c4ad5
enh: estimated salary (#169) 2024-07-16 19:20:34 -05:00
Cullen Watson 6330c14879 minor fix 2024-07-15 21:19:01 -05:00
Ali Bakhshi Ilani 48631ea271
Add company industry and job level to linkedin scraper (#166) 2024-07-15 21:07:39 -05:00
Cullen Watson edffe18e65
enh: listing source (#168) 2024-07-15 20:30:04 -05:00
Lluís Salord Quetglas 0988230a24
FEAT: Add Glassdoor logo data if available (#167) 2024-07-15 20:25:18 -05:00
Cullen Watson d000a81eb3
Salary parse (#163) 2024-06-09 17:45:38 -05:00
Cullen Watson ccb0c17660
enh: ziprecruiter full description (#162) 2024-06-09 16:21:01 -05:00
Cullen Watson 89a3ee231c
enh(li): job function (#160) 2024-05-28 16:01:29 -05:00
Cullen 6439f71433 chore: version 2024-05-28 15:39:24 -05:00
adamagassi 7f6271b2e0
LinkedIn scraper fixes: (#159)
Correct initial page offset calculation
Separate page variable from request counter
Fix job offset starting value
Increment offset by number of jobs returned instead of expected value
2024-05-28 15:38:13 -05:00
Cullen Watson 5cb7ffe5fd
enh: proxies (#157)
* enh: proxies

* enh: proxies
2024-05-25 14:04:09 -05:00
fasih hussain 08d63a87a2
chore: id added for JobPost schema (#152) 2024-05-20 11:45:52 -05:00
Cullen 1ffdb1756f fix: dup line 2024-04-30 12:11:48 -05:00
Lluís Salord Quetglas dcd7144318
FIX: Allow Indeed search term with complex syntax (#139) 2024-04-30 12:05:43 -05:00
Cullen Watson bf73c061bd
enh: linkedin company logo (#141) 2024-04-30 12:03:10 -05:00
Lluís Salord Quetglas 8dd08ed9fd
FEAT: Allow LinkedIn scraper to get external job apply url (#140) 2024-04-30 11:36:01 -05:00
Cullen 3e93454738 fix(indeed): readd param 2024-03-11 21:23:20 -05:00
VitaminB16 4b7bdb9313
feat: Adjust log verbosity via verbose arg (#128) 2024-03-11 14:38:44 -05:00
Cullen Watson ada38532c3 fix: indeed empty location term 2024-03-11 09:42:43 -05:00
Cullen Watson 3b0017964c fix: indeed empty search term 2024-03-11 09:21:11 -05:00
VitaminB16 94d8f555fd
format: Apply Black formatter to the codebase (#127) 2024-03-10 23:36:27 -05:00
Cullen Watson 0a669e9ba8
enh: indeed more fields (#126) 2024-03-09 01:40:01 -06:00
gigaSec a4f6851c32
Fix GlassDoor Country Vietnam(#122) 2024-03-04 17:35:57 -06:00
troy-conte db01bc6bbb
log search updates, fix glassdoor (#120) 2024-03-04 16:39:38 -06:00
Cullen Watson f8a4eccc6b
Remove pandas warning (#118) 2024-02-29 21:30:56 -06:00
Cullen Watson ba3a16b228
Description format (#107) 2024-02-14 16:04:23 -06:00
Cullen Watson aeb1a50d2c
fix job type search (#106) 2024-02-12 11:02:48 -06:00
VitaminB16 91b137ef86
feat: Ability to query by time posted for linkedin, indeed, glassdoor, ziprecruiter (#103) 2024-02-09 14:02:03 -06:00
Cullen Watson 2563c5ca08
enh: Indeed company url (#104) 2024-02-09 12:05:10 -06:00
Cullen Watson 6ec7c24f7f
enh(linkedin): search by company ids (#99) 2024-02-04 09:21:45 -06:00
Cullen Watson 02caf1b38d
fix(zr): date posted (#98) 2024-02-03 07:20:53 -06:00
Cullen Watson 8e2ab277da
fix(ziprecruiter): pagination (#97)
* fix(ziprecruiter): pagination

* chore: version
2024-02-02 20:48:28 -06:00
Cullen Watson ce3bd84ee5
fix: indeed parse description bug (#96)
* fix(indeed): full descr

* chore: version
2024-02-02 18:21:55 -06:00
Cullen Watson 13c7694474
Easy apply (#95)
* enh(glassdoor): easy apply filter

* enh(ziprecruiter): easy apply

* enh(indeed): use mobile headers

* chore: version
2024-02-02 17:47:15 -06:00
Cullen Watson bbe46fe3f4
enh(glassdoor): easy apply filter (#92) 2024-02-01 19:42:24 -06:00
Cullen Watson b97c73ffd6
fix: clean description (#88) 2024-01-28 21:50:41 -06:00
Cullen Watson 5b3627b244
enh: full description param (#85) 2024-01-22 20:22:32 -06:00
Cullen Watson 2ec3b04777
fix(ziprecruiter): init cookies (#82) 2024-01-12 12:28:35 -06:00
Cullen Watson a7ad616567
fix: linkedin no results (#80) 2024-01-10 14:01:10 -06:00
Cullen Watson 22870438c7
linkedin fix delays (#79) 2024-01-09 19:32:51 -06:00
Cullen Watson a5916edcdd
fix(glassdoor): add retry adapter (#77) 2024-01-03 12:04:32 -06:00
Augusto Gunsch 33d442bf1e
Add czech to Indeed (#72) 2023-12-02 02:42:54 -06:00
Vincent Yan eed7fca300
Get full indeed description (#70) 2023-11-27 15:00:36 -06:00
Faraz Khan dfb8c18c51
include location with 3 parts (#69) 2023-11-10 16:59:42 -06:00
Faraz Khan 81f70ff8a5
added salary data for linkedin (#68) 2023-11-09 14:57:15 -06:00
Cullen Watson cc9e7866b7
fix linkedin bug & add linkedin company url (#67) 2023-11-08 15:51:07 -06:00
Cullen Watson 2b7fea40a5 [fix] glassdoor duplicates 2023-10-30 20:29:55 -05:00
Cullen Watson d37f86e1b9 [fix] glassdoor location 2023-10-30 20:19:56 -05:00
Cullen Watson 0302ab14f5 glassdoor keywords 2023-10-30 20:07:31 -05:00
Cullen Watson 3f2b582445
add glassdoor (#66) 2023-10-30 19:57:36 -05:00
Cullen Watson 93223b6a38 bug fix 2023-10-30 13:57:23 -05:00
Cullen Watson e3fc222eb5
readd proxy support for zip (#64) 2023-10-29 08:54:56 -05:00
Cullen e2f6885d61 chore: format 2023-10-28 16:52:05 -05:00
Cullen 216d3fd39f ziprecruiter: 5s delay 2023-10-28 16:41:32 -05:00
Cullen Watson d3bfdc0a6e
ziprecruiter api (#63) 2023-10-28 16:17:28 -05:00
Cullen Watson ba5ed803ca
use ziprecuriter api (#62) 2023-10-28 15:51:29 -05:00
Cullen Watson f2cc74b7f2
Fix Indeed exceptions on parsing description 2023-10-18 14:25:53 -05:00
Cullen Watson 90fa4a4c4f feat: utils.py 2023-10-10 11:29:29 -05:00
Cullen Watson e5353e604d
Multiple job types for Indeed, urgent keywords column (#56)
* enh(indeed): mult job types

* feat(jobs):  urgent kws

* fix(indeed): use new session obj per request

* fix: emails as comma separated in output

* fix: put num urgent words in output

* chore: readme
2023-10-10 11:23:04 -05:00
Cullen Watson 628f4dee9c
[fix] indeed - min & max values swapped (#54) 2023-10-03 09:22:18 -05:00
Cullen Watson 008ca61e12 [fix] readd hyperlink param 2023-09-28 18:53:21 -05:00
Cullen Watson bff39a2625 [fix] util func 2023-09-28 18:33:14 -05:00
Cullen Watson c676050dc0 [fix] util func 2023-09-28 18:33:02 -05:00
Cullen Watson 9fb2fdd80f [fix] add utils.py 2023-09-28 18:25:56 -05:00
Cullen Watson af07c1ecbd
add offset param & email extraction (#51)
* add offset param

* [enh]: extract emails
2023-09-28 18:11:28 -05:00
Cullen Watson 558e352939 fix: job type param bug 2023-09-21 17:42:24 -05:00
Cullen Watson 59f739018a
Proxy support (#44)
* add proxy support

* return as data frame
2023-09-07 11:28:17 -05:00
Zachary Hampton 690739e858 - refactor & #41 bug fix 2023-09-06 16:32:51 -07:00
Cullen Watson fd883178be
Thread sites (#40) 2023-09-06 09:47:11 -05:00
Cullen Watson 1c264b8c58
Indeed country support (#38) 2023-09-05 12:17:22 -05:00
Cullen Watson 7ae7ecdee8
Validation error (#35) 2023-09-03 20:05:31 -05:00
Cullen Watson 7cc8f4864c move /tests to /src 2023-09-03 15:40:44 -05:00
Cullen Watson 24faa258df dir structure 2023-09-03 12:30:13 -05:00
Cullen Watson 8579c8e985 proj structure 2023-09-03 12:05:50 -05:00