Skip to content

Conversation

@adelavega
Copy link
Contributor

@adelavega adelavega commented Oct 17, 2025

They can already be processed SpringerSource with minor tweaks. Will add ~2k articles

Also Adding these sources:

  • AmerPsych
  • MDPI
  • Second Sage source

This PR also saves HTML in database and outputs it optionally to a directory

@adelavega adelavega changed the title Add Nat and BMC Add new sources for: Nature, BMC, AmerPsych, MDPI and Sage2 Oct 18, 2025
@adelavega
Copy link
Contributor Author

Tests are failing because HTML can't be legal fetched using the tests

@adelavega adelavega merged commit 02aa72d into master Oct 23, 2025
1 check failed
@jdkent jdkent mentioned this pull request Oct 23, 2025
@jdkent
Copy link
Collaborator

jdkent commented Oct 23, 2025

I fetched the text, sue me. We can move cassettes to a private source and download them at test time, just seems more of a pain than it's worth.

@jdkent jdkent added this to Planning Dec 19, 2025
@jdkent jdkent moved this to Done in Planning Dec 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants