Automation project for scraping job data from peviitor.ro platform using OpenCode AI agents.
This project is developed and maintained by Asociația Opportunități și Cariere - a Romanian NGO dedicated to simplifying the job search process in Romania.
Our mission is to help people find jobs from hundreds of companies through our free platform peviitor.ro.
- Automated Job Scraping - Scrape jobs from 25+ company career pages
- Solr Integration - Push job data to Solr search engine
- URL Validation - Validate job URLs and remove 404/inactive jobs
- Company Management - Add and manage company scraping prompts
- PowerShell 5.1+ or PowerShell 7+
- Google Chrome (for automation)
- Docker Desktop (for Solr)
- Node.js 18+ (for Playwright tests)
-
Start Chrome with debugging:
powershell -ExecutionPolicy Bypass -File start-chrome.ps1
-
Start Solr:
docker start peviitor-solr -
Install test dependencies:
cd tests && npm install
| Command | Description |
|---|---|
/scrape [company] |
Scrape jobs from a company |
/add-website |
Add new company to websites.md |
/remove-404 |
Validate job URLs and remove inactive |
/update-solr |
Update Solr with new data |
/delete-solr |
Delete jobs from Solr |
/clean-project |
Clean temp files and update docs |
peviitor_opencode_AI_scrapers/
├── docs/ # HTML documentation
├── .opencode/commands/ # OpenCode commands
├── webscraper/ # Company scraping prompts
├── tests/ # Playwright tests
├── start-chrome.ps1 # Chrome startup script
├── SCHEMAS.md # Data schemas
├── AGENTS.md # Agent instructions
└── INSTRUCTIONS.md # Workflow instructions
Contributions are welcome! Please read our Contributing Guidelines and Code of Conduct.
This project is licensed under the MIT License - see the LICENSE file for details.
See SECURITY.md for reporting vulnerabilities.