Skip to content
This repository was archived by the owner on Apr 12, 2019. It is now read-only.

Conversation

@TimHenkelmann
Copy link
Contributor

Worldcat only displays the first 5000 entries (https://www.worldcat.org/search?q=dt%3Asco&fq=yr%3A1800&dblist=638&start=4991 gives the same results as https://www.worldcat.org/search?q=dt%3Asco&fq=yr%3A1800&dblist=638&start=10000). This resulted in tons of duplicates, therefore a hard limit of 5000 entries was implemented.
... sorry @kordianbruck you should not have trusted me! ;)

if (linkCounter > total_links) {
// worldcat only shows the first 5000 pages
console.log(linkCounter)
if (linkCounter == 500) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Theres a 0 missing here no?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope, that's right as we get 10 entries per link, so thats 500*10=5000 in total.
But I just removed an unnecessary comment from the cli.js that is no longer needed (because earlier the script ran out of RAM )

@kordianbruck
Copy link
Contributor

@TimHenkelmann can we still merge this?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants