Skip to content

[Archive.org] Not grabbing all URLs onscreen #109

@Tenome

Description

@Tenome

Site Link

https://archive.org/details/software?tab=collection&query=-subject%3A%28ps2%29+-subject%3A%28ps1%29+-subject%3A%28sega+saturn%29+-subject%3A%28ps3%29&page=7&sort=-publicdate&and%5B%5D=subject%3A%22PC+Game%22&and%5B%5D=subject%3A%22PC-98%22&and%5B%5D=subject%3A%22IBM+PC%22&and%5B%5D=subject%3A%22macintosh%22&and%5B%5D=subject%3A%22IBM+PC+Compatible%22&and%5B%5D=subject%3A%22mac%22&and%5B%5D=subject%3A%22Doujin%22&and%5B%5D=subject%3A%22Doujin+Games%22&and%5B%5D=subject%3A%22doujin+soft%22&and%5B%5D=subject%3A%22Doujin+games%22&and%5B%5D=subject%3A%22doujin+games%22&and%5B%5D=subject%3A%22Doujin+Game%22&and%5B%5D=subject%3A%22Doujin+game%22&and%5B%5D=subject%3A%22doujin+game%22&and%5B%5D=subject%3A%22doujin%22&and%5B%5D=mediatype%3A%22software%22&and%5B%5D=language%3A%22Japanese%22

Details

I'm trying to grab all the URLs in this search result, but the extension only grabs some of the links despite me loading way more than that into the browser. The filter used is https://archive.org/details/. I scrolled down and loaded the subsequent pages that way, but it seems to only grab what is roughly on screen instead of the entire list (around 177 URLs). You can test this yourself by scrolling down and letting it load more pages, and then searching for a URL from the top of the search results (which won't appear in the extracted list).

Support Information

Link Extractor - 0.7.8
Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:133.0) Gecko/20100101 Firefox/133.0
permissions.origins: []
options: {"linksDisplay":-1,"flags":"ig","lazyLoad":true,"removeDuplicates":true,"defaultFilter":true,"saveState":true,"linksTruncate":false,"linksNoWrap":false,"contextMenu":true,"showUpdate":false}
links-table: {"time":1732033215784,"start":0,"length":-1,"order":[[0,"asc"]],"search":{"caseInsensitive":true,"search":"","regex":true,"smart":true,"return":false,"_hungarianMap":{}},"columns":[{"visible":true,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}},{"visible":false,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}},{"visible":false,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}},{"visible":false,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}},{"visible":false,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}},{"visible":false,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}}],"childRows":[]}
domains-table: {"time":1732033215788,"start":0,"length":-1,"order":[[0,"asc"]],"search":{"caseInsensitive":true,"search":"","regex":true,"smart":true,"return":false,"_hungarianMap":{}},"columns":[{"visible":true,"search":{"caseInsensitive":true,"search":"","regex":false,"smart":true,"return":false}}],"childRows":[]}

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions