
PDF Trio - GitHub
We address these challenges with an ensemble of classifiers that use confidence values to cover all the cases. There are still some edge cases, but incidence rate is at most a few percent.
Synchronize prod's edition.json with repo's config_edition.page #2062
Challenges Note: the changes are committed in json format but the wiki only seems to support this page being edited as yml by an admin. 😖
Index normalized author name in solr #178 - GitHub
Mar 26, 2013 · One of the current challenges is solr takes a while to update and its becoming increasingly difficult to keep our tiny solr instance sync'd with IA's borrow availability data.
Stage ISBNdb Imports & Enable JIT Importing · Issue #7658 ... - GitHub
Mar 15, 2023 · Challenges: Slow: dataset size is too slow for us to batch_import in a reasonable amount of time Performance: importing 30M records may impact solr + db + site performance Noise: …
Import 3.5k Open Access Programming Books #10519 - GitHub
Mar 1, 2025 · You'll notice we may have several challenges to think through, because the metadata we have likely doesn't include publication date, book cover, or several other attributes we likely want!
Prototype: Enter a URL -> Book Import (Preview) #9405 - GitHub
Jun 6, 2024 · Open mekarpeles opened this issue Jun 6, 2024 · 4 comments Open Prototype: Enter a URL -> Book Import (Preview) #9405 mekarpeles opened this issue Jun 6, 2024 · 4 comments …
pdf_trio/README.md at master · internetarchive/pdf_trio · GitHub
We address these challenges with an ensemble of classifiers that use confidence values to cover all the cases. There are still some edge cases, but incidence rate is at most a few percent.
dweb-archive/docs/archive_architecture_ipfs.md at master ...
Files added via urlstore aren’t advertised to the DHT and cant be added due to scaling challenges on IPFS gateways Streams retrieved always return true even when not working, so cant fall back
Book Donations Flow · Issue #4398 · internetarchive/openlibrary
One cost, however, which has become especially evident due to additional challenges posed by the pandemic, is that sponsorship poses logistic challenges and constraints which do not exist within our …
Replace `docker-compose` with `docker compose` · Issue #7594
Mar 3, 2023 · If together we can solve the problem, that would help us better understand challenges people run into as well and we can update the documentation to reflect what we've learned.