Skip to content

Fixed PTC and Cargill scraper#661

Merged
lalalaurentiu merged 2 commits intopeviitor-ro:mainfrom
lalalaurentiu:main
Nov 7, 2025
Merged

Fixed PTC and Cargill scraper#661
lalalaurentiu merged 2 commits intopeviitor-ro:mainfrom
lalalaurentiu:main

Conversation

@lalalaurentiu
Copy link
Collaborator

This pull request updates the job scraping logic for the sites/ptc.py and sites/cargill.py scripts to improve data accuracy and compatibility with changes in the external API. The most important changes include switching to a new jobs API endpoint for PTC, updating the data extraction logic to match the new API response format, and standardizing the location fields for jobs in Bucharest.

PTC API integration and data extraction updates:

  • Switched the jobs API endpoint in sites/ptc.py to https://ptc.eightfold.ai/api/pcsx/search, which supports new filtering options and returns job data in a different structure.
  • Updated the logic for calculating the number of pages and extracting job positions to match the new API response format, specifically accessing job data from the nested data field.

Location data standardization:

  • Standardized the city and county fields for PTC jobs to always be "Bucuresti", ensuring consistency in location reporting.
  • Added a hardcoded "country": "Romania" field to each job entry in both sites/ptc.py and sites/cargill.py to ensure country information is always present. [1] [2]

@lalalaurentiu lalalaurentiu merged commit a2bcba1 into peviitor-ro:main Nov 7, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant