Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

do not store/render images for bots #2046

Open
bwsevelyn opened this issue Sep 4, 2024 · 8 comments
Open

do not store/render images for bots #2046

bwsevelyn opened this issue Sep 4, 2024 · 8 comments
Labels
bug Something isn't working enhancement New feature or request

Comments

@bwsevelyn
Copy link

When a generally benign bot makes requests of the Aspen server, medium and small images are saved locally. Generally this is great and makes loading faster for patrons, but after time this eats up space on Aspen servers and not necessarily for covers that are searched often by actual patrons.

If crawling by bots only returns data and not images, this could help save space on servers while at the same time ensuring that real patron searches are treated as originally intended.

@kylemhall kylemhall added bug Something isn't working enhancement New feature or request labels Sep 4, 2024
@mdnoble73
Copy link
Member

Some bots are specifically looking for images, is the intention to not serve those bots or to treat those ones differently?

@kylemhall
Copy link
Member

What do these bots do? Do they benefit the libraries?

@mdnoble73
Copy link
Member

mdnoble73 commented Sep 4, 2024

Bing has an image search as well as Google I believe. Not sure about benefit to libraries although libraries are always pro getting their catalog into Google. Perhaps a switch to enable or disable would be appropriate. I'd say a switch would be the MVP with a configurable normal/don't cache/don't serve option on a per bot basis with a default would be the best version.

@kylemhall
Copy link
Member

Considering those images are from other sources I cannot imagine they create much value as opposed to the record data.

@mdnoble73
Copy link
Member

Libraries can already block content by user agent, so perhaps an option to extend that to block covers separately from content or to change an option for what is blocked for a given user agent?

@kylemhall
Copy link
Member

Yes

@lizrea
Copy link
Contributor

lizrea commented Sep 5, 2024

100% by user agent would be amazing

@bwsevelyn
Copy link
Author

thank you, yes, by user agent would be great!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants