has someone already written the javascript to scrape the hi-res tiles from the minnesota institute of art? they are rudely claiming they provide hi-res downloads which, um, no, not *close* to what they have in the viewer.

(I Need a print of this)

collections.artsmia.org/art/13

Follow

scraping procedure, long 

@maya Not in the brainspace to write code right now, but the size is encoded in the image's metadata at iiif.dx.artsmia.org/139987.jpg (number matches the item number in the original URL), in the width/height properties.

Each tile is then at iiif.dx.artsmia.org/139987.jpg, where first number is the item ID again, the URL segment after that is startX,startY,endX,endY and the URl segment after *that* is outputWidth,outputHeight (latter can be omitted like it is here, will default to the former).

If outputWidth is equal to endX-startX (and same for height/y), then you get the tile at the original resolution, if not then it gets scaled to the specified output size.

512 pixels seems to be maximum output dimensions per tile, but a loop to fetch all the tiles based on the width/height metadata + something like imagemagick/graphicsmagick should be able to stitch it back together

· · Web · 2 · 0 · 2

scraping procedure, long 

@joepie91 yeah, that's about what i figured :( I found their own tool github.com/artsmia/collection- and the relevant metadata github.com/artsmia/collection/ but the tool, interestingly, doesn't 403 – just gives an image with an error... not giving up yet :)

re: scraping procedure, long 

@joepie91 @maya here is the spec for this; it’s an open standard iiif.io/api/image/3.0/#21-imag

but keywords like `full` and `max` don’t seem to be working

re: scraping procedure, long 

@Lady they seem to use a different API for their internal use, and the README suggests they *mean* this only to be internal github.com/artsmia/collection-

still, I'll give 'em an email to see if they can hook me up. the blood collages are public domain (in letter and spirit) so I'm hopeful :)

re: scraping procedure, long 

@maya @Lady cc @huertanix museum-archival-digital tech

re: scraping procedure, long 

@maya yeah if you just email like “hey could you hook me up with the original for X public domain resource” i would be very surprised if they were like “no”

re: scraping procedure, long 

@maya i’m wondering if some of the resources in their collections have licensing restrictions which allow them to display them publicly but forbid duplication and that’s why their IIIF throws an error above a certain size

re: scraping procedure, long 

@Lady I have written this email! Thank you for giving me the courage to do so lol.

They have a policy where they don't provide hi res versions of material not in the public domain so maybe it's something there getting twisted

Sign in to participate in the conversation
Pixietown

Small server part of the pixie.town infrastructure. Registration is closed.