Skip to content

ercedes-owe/6thStreet-Selenium-Link-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 

Repository files navigation

6thStreet Selenium Link Scraper

The 6thStreet Selenium Link Scraper automates the discovery of product listing URLs from 6thstreet.com using a headless browser. It handles dynamically rendered pages and pagination to reliably collect all product links for further scraping or catalog building.

Bitbash Banner

Telegram Β  WhatsApp Β  Gmail Β  Website

Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for 6thStreet Selenium Link Scraper you've just found your team β€” Let's Chat. πŸ‘†πŸ‘†

Introduction

This actor uses Selenium (or a headless browser) to load category, search result, or collection pages on 6thstreet.com β€” including pages that rely on JavaScript for rendering. It extracts all individual product page URLs and outputs them as a structured JSON array. This makes it a useful first-step tool for larger scraping workflows, where you first gather URLs, then feed them into a detailed product scraper.

How It Works

  • Accepts start URLs (category pages or search pages) as input
  • Uses headless browser automation to load page content and ensure JS is executed
  • Handles pagination when there are multiple result pages
  • Extracts all product listing URLs from each page
  • Returns structured output with a list of product URLs

Features

Feature Description
Selenium-Based Rendering Uses a headless browser to handle JS-heavy site behavior.
Link Extraction Captures all product URLs from listing pages.
Pagination Support Automatically iterates through paginated result pages.
Proxy & Delay Handling Supports proxy configuration and delays for lower detection risk.
Structured Output Returns a clean JSON list of URLs for further scraping workflows.

What Data This Scraper Extracts

Field Name Field Description
url A product listing URL found on 6thstreet.com.

Example Output

[
  {
    "url": "https://www.6thstreet.com/product/womens-shoes/12345678"
  },
  {
    "url": "https://www.6thstreet.com/product/mens-jacket/87654321"
  }
]

Directory Structure Tree

6thstreet-selenium-link-scraper/
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ main.js  (or main.py depending on implementation)  
β”‚   β”œβ”€β”€ scraper/  
β”‚   β”‚   β”œβ”€β”€ browser_launcher.js  
β”‚   β”‚   β”œβ”€β”€ link_extractor.js  
β”‚   β”‚   └── pagination_handler.js  
β”‚   └── config/  
β”‚       └── input_schema.json  
β”œβ”€β”€ package.json (or requirements.txt)  
└── README.md

Use Cases

  • Catalog creators collect all product links from 6thstreet for feed generation or bulk analysis.
  • Market researchers gather product lists to monitor pricing, availability, and assortment changes.
  • E-commerce developers use link output as input for detailed product scraping pipelines.
  • Data engineers build datasets containing all product URLs for downstream ETL or data enrichment tasks.

FAQs

Does this Scraper fetch product details?
No β€” this actor only collects product page URLs. Use a separate scraper to fetch detailed product data like price, images, or descriptions.

Will it work on pages that load content dynamically (e.g., infinite scroll, JS render)?
Yes β€” since it uses headless browser automation, it handles JavaScript rendering and dynamic content loading.

Does it support pagination?
Yes β€” the scraper iterates through subsequent pages automatically when pagination exists.

What format does the output come in?
A JSON array where each element contains a url field with a product page link.


Book a Call Watch on YouTube

Review 1

"Bitbash is a top-tier automation partner, innovative, reliable, and dedicated to delivering real results every time."

Nathan Pennington
Marketer
β˜…β˜…β˜…β˜…β˜…

Review 2

"Bitbash delivers outstanding quality, speed, and professionalism, truly a team you can rely on."

Eliza
SEO Affiliate Expert
β˜…β˜…β˜…β˜…β˜…

Review 3

"Exceptional results, clear communication, and flawless delivery.
Bitbash nailed it."

Syed
Digital Strategist
β˜…β˜…β˜…β˜…β˜…

Releases

No releases published

Packages

No packages published