The 6thStreet Selenium Link Scraper automates the discovery of product listing URLs from 6thstreet.com using a headless browser. It handles dynamically rendered pages and pagination to reliably collect all product links for further scraping or catalog building.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for 6thStreet Selenium Link Scraper you've just found your team β Let's Chat. ππ
This actor uses Selenium (or a headless browser) to load category, search result, or collection pages on 6thstreet.com β including pages that rely on JavaScript for rendering. It extracts all individual product page URLs and outputs them as a structured JSON array. This makes it a useful first-step tool for larger scraping workflows, where you first gather URLs, then feed them into a detailed product scraper.
- Accepts start URLs (category pages or search pages) as input
- Uses headless browser automation to load page content and ensure JS is executed
- Handles pagination when there are multiple result pages
- Extracts all product listing URLs from each page
- Returns structured output with a list of product URLs
| Feature | Description |
|---|---|
| Selenium-Based Rendering | Uses a headless browser to handle JS-heavy site behavior. |
| Link Extraction | Captures all product URLs from listing pages. |
| Pagination Support | Automatically iterates through paginated result pages. |
| Proxy & Delay Handling | Supports proxy configuration and delays for lower detection risk. |
| Structured Output | Returns a clean JSON list of URLs for further scraping workflows. |
| Field Name | Field Description |
|---|---|
| url | A product listing URL found on 6thstreet.com. |
[
{
"url": "https://www.6thstreet.com/product/womens-shoes/12345678"
},
{
"url": "https://www.6thstreet.com/product/mens-jacket/87654321"
}
]
6thstreet-selenium-link-scraper/
βββ src/
β βββ main.js (or main.py depending on implementation)
β βββ scraper/
β β βββ browser_launcher.js
β β βββ link_extractor.js
β β βββ pagination_handler.js
β βββ config/
β βββ input_schema.json
βββ package.json (or requirements.txt)
βββ README.md
- Catalog creators collect all product links from 6thstreet for feed generation or bulk analysis.
- Market researchers gather product lists to monitor pricing, availability, and assortment changes.
- E-commerce developers use link output as input for detailed product scraping pipelines.
- Data engineers build datasets containing all product URLs for downstream ETL or data enrichment tasks.
Does this Scraper fetch product details?
No β this actor only collects product page URLs. Use a separate scraper to fetch detailed product data like price, images, or descriptions.
Will it work on pages that load content dynamically (e.g., infinite scroll, JS render)?
Yes β since it uses headless browser automation, it handles JavaScript rendering and dynamic content loading.
Does it support pagination?
Yes β the scraper iterates through subsequent pages automatically when pagination exists.
What format does the output come in?
A JSON array where each element contains a url field with a product page link.
