Scheduled DB Maintenance: January 21st - 8:00 AM to 10:00 AM. Confluence will be unavailable during this time.

Navigation:
Documentation
Archive



Page Tree:

Child pages
  • CMIS Types and Paths Map for Book Model -- WORKING DRAFT

This wiki space contains archival documentation of Project Bamboo, April 2008 - March 2013.

Skip to end of metadata
Go to start of metadata

Folder Structure

"The book's title or source repository identifier"  - bamboo:book

  • "book" - bamboo:folder
    • "pages" - bamboo:folder
      • "item identifier-page ordinal" - bamboo:page
      • "item identifier-page ordinal" - bamboo:page
      • ...
    • "source" - bamboo:folder

Summary of Types

These are the possible values for the cmis:objectTypeId CMIS property.

See Book Model (Draft) for a more detailed description of the book model.

Book Information

bamboo:book

Folder containing all content relating to a specific book or text

bamboo:contents

Atom document describing a view of the organization of the book's text

bamboo:book-tei

TEI transcription of the book

bamboo:volume-plaintext

Plain text transcription of the book

bamboo:folder

Structural folder within the Book model, such as for 'book', 'pages', and 'source' folders

Page Information

bamboo:page

Folder containing the representations for the given page (image, XHTML, TEI, plain text, etc.)

     This folder should have the following properties:

        bamboo:seq
        The sequence number of the page.

        bamboo:page
        The page number of the page.

        bamboo:label
        The label to be displayed for the page.

        bamboo:owner
        The account name for the owner of this page

Page representations Information

bamboo:page-image

A high quality jpg image of a book page

bamboo:page-thumb150

Thumbnail for page as a jpeg image with width=150 pixels. This may be a reduced version of the page-image (if available) or a rendered version of page text.

bamboo:page-image-jp2

JPeg2000 image of the page, such as provided from HathiTrust. 

bamboo:page-xhtml

Semantic XHTML (that is without formatting tags) of a book's page text. CSS classes will be mappable to TEI elements.

bamboo:page-tei

TEI markup of the page..

bamboo:page-morphadorned

TEI markup of the page with word-level morphological markup...

bamboo:page-plaintext

Plain text transcript of the page, such as from OCR process..

bamboo:page-djvuxml

DjVu markup transcript of the page..

Source Content -- Downloaded from contributing repository

CMIS path bookobject cmis:path/source.

bamboo:source-mets

METS document describing the Book as supplied by contributing repository

bamboo:source-page-image

A scanned image of a page (any image format) as supplied by contributing repository

bamboo:source-page-ocr

Raw OCR text of a book page as supplied by a contributing repository

bamboo:source-page-xml

XML text and/or metadata for a book page as supplied by a contributing repository

CMIS Type XML (Preliminary)

See attached files.

Bamboo Type support by Repository and CI Connector

Below is a matrix showing connector/repository support for each bamboo type. "Repository" means that particular type is obtained from the repository. "Connector" means that particular type is manufactured by the connector from other content obtained from the repository. For example, the Perseus connector creates plaintext, xhtml, and TEI xml pages from the Perseus source TEI transcript. "Connector" in italics indicates a feature planned for the connector that hasn't yet been implemented. A type that is underlined for a given repository indicates that  type will be used on a demonstrator this December. Blank cells indicate that type is not supported by the specific connector at this time (for example, page-morphadorned from Hathi). This usually means we don't have a use-case that requires that feature or haven't discussed the need for it.

cmis:objectTypeId property

mime-type

TCP (UIUC)

Hathi

Perseus

bamboo:page-plaintext

text/plain

repository

connector

connector

bamboo:page-xhtml

text/html

repository

 

connector

bamboo:page-tei

text/xml

repository

connector

connector

bamboo:page-morphadorned

text/xml

repository

 

 

bamboo:page-image

image/jpeg

repository

connector

 


 

bamboo:page-thumb150

image/jpeg

repository

connector

 

 

 

bamboo:book-tei

text/xml

repository

 

repository

bamboo:book-plaintext

text/plain

connector

connector

connector

bamboo:source-mets

text/xml

 

 

 

bamboo:source-aggregate

application/zip

 

repository

 

bamboo:source-bib-marc

application/json

 

repository

 

bamboo:source-page-image-jp2

image/jp2

 

repository

 

bamboo:source-page-xml

text/xml

 

 

 

bamboo:source-page-ocr

text/plain

 

 

 

  • No labels