Simple Image Presentation Interface (SIPI) - Introduction

What is SIPI?

1. A IIIF Image API V3 level 2 conformant image server

  • SIPI is a full multithreaded, high performance, level2 conformant IIIF written in C++. For the JPEG2000 implementation, it relies on the commercial kakadu-library, but otherwise it is completely open source on github. It offers special support for multipage PDF's (through a SIPI-specific extensions to the IIIF Image API).
  • SIPI has been designed with long term preservation for images regarded as culutral heritage in mind. Thus it offers some unique features fpr this purpose:
  • all file format conversions try to preserve all metadata (EXIF, XMP, IPTC etc.). These functionality is based on the open source exiv2 library.
  • SIPI can deal with and convert ICC color profiles based on the littlecms library.
  • SIPI can embed important preservation data such as the checksum of the pixel values, original filename etc. in the file headers.
  • it supports SSL (https://…)
  • SIPI embeds the scripting language LUA that allows a very flexible, highly customizable deployment that can be adapted to the enviroment SIPI is being used in. Before serving any request, a configurable LUA script ("pre flight script") is being executed that can check access rights, restrictions or other stuff. SIPI LUA has been extended with many SIPI-specific functions (including image conversion, HTTP-client etc.)

2. An ordinary HTTP webserver

  • SIPI is also a normal webserver that is able to deliver arbitrary files. It also implements LUA embedded into HTML pages.
  • Using SIPI LUA scripts and routing, RESTful interfaces may be implemented. E.g. image upload and conversions may by supported.

3. An image format conversion tool

Generic format conversions

  • image format conversion are supported between TIFF, JPEG2000, JPG, PNG and PDF (PDF with some limitations). SIPI can be used either as standalone command line tool or in server mode using LUA scripting. SIPI preserves most embedded metadata (EXIF, IPTC, TIFF, XMP) and is preserving and/or converting ICC color profiles.

Preservation metadata (SIPI specific)

  • SIPI is able to add SIPI specific metadata to most file formats. These metadata are relevant for long-term preservation and include the following information:
    • original filename: The original file name before conversion
    • original mimetype: The mimetype of the original image before conversion
    • pixel checksum: A checksum (e.g. SH256) of the original pixel values. This checksum can be used to verify that a format conversion didn't alter the image content.
    • icc profile: (optional) The raw ICC profile as binary string. This field is added if the fileformat has no standard way to embed ICC color profiles (e.g. JPEG).

4. Integrated sqlite3 Database

SIPI has an integrated sqlite3 database that can be used with special LUA extensions. Thus, SIPI can be used as a standalone media server with extended functionality. The sqlite3 database may be used to store metadata about images, user data etc.

Who is behind SIPI?

SIPI is developed and maintained by the "Data and Service Center for the Humanities" (DaSCH), a Swiss national research infrastructure financed by the Swiss National Science Foundation with contributions by the Universities of Basel and Lausanne.

How to get SIPI?

  • The easiest way is to use the docker image provided on dockerhup daschswiss/sipi. The dockerized version has the binary kakadu library compiled in.
  • You can compile SIPI from the sources on github. Since SIPI uses many third-party open source libraries, compiling Yourself is tedious and my be frustrating (but possible). You have to provide the licensed source of kakadu by Yourself. See kakadu software on how to get a licensed version of the kakadu code. SIPI should compile on Linux (Ubuntu) and (with some hand-work) OS X.

SIPI as IIIF-Server

Extensions to the IIIF-Standard

Access to PDF Pages

SIPI is able to deliver PDF's either as full file or as images using an extended IIIF-URL to access a specific page of a multipage PDF as image using the usual IIIF syntax with small extensions: - In case of a PDF, info.json includes a field numpages that indicates the total number of pages the PDF has - the image-ID given in the IIIF URL must incude a pagenumber specificer @pagenum with an integer between 1 and the maximum number of pages, e.g.,1000/default.jpg The given URL would return page #12 of the PDF test.pdf with a height of 1000 pixels. Thus, all IIIF URL parts will work as expected.

Access to non-image files

Sometimes it would be helpful to deliver non-image files such as XML, CSV etc. from the same directory tree as the IIIF-conformant images: - if the url has the form http(s)://{server}/{prefix}/{fileid}/info.json, SIPI returns a JSON containing information about the file. The JSON has the from - @context: "" - id: "http(s)://{server}/{prefix}/{fileid}" - mimeType: {mimetype} . Please note that SIPI determines the mimetype using the magic number. Due to the limitations thereof the mimetype cannot be determined exactly.