It looks like you're offline.
Open Library logo
additional options menu
Last edited by AgentSapphire
November 21, 2009 | History

Open Library Data


[API][1] | [Docs][2] | Data | [Hub][4] | [Terms of Use][5]  •  [Bugs][6] | [Code][7]

Open Library is a repository for a vast wealth of data, and we encourage new sources and are always eager to form new partnerships. Since the project's inception, we have developed a process for working with new data feeds, and merging it into our current system.

Our Process

Aside from some special cases (e.g. lists of ISBNs, book covers, holdings data), we take each data sources, write a processor for it, and output Python dictionaries. As records are added, an algorithm detects whether the book is already represented in the database. In that case, some new fields from the incoming record may be added to the record in the database, such as additional identifiers, new subjects, and tables of contents. The success of determining duplicates depends on the quality and accuracy of the data in the records. We hope to make it easy to merge duplicates manually through the user interface so that Open Library users can do what the algorithm cannot.

We are currently analyzing relationships between works (example: all of these editions of Tom Sawyer are all editions the same conceptual work). From this we can add relationships to each object and create new objects (like works). This process is known in the library world as "FRBRization". See http://frbr.org for more information.

Bulk Upload

If you want to send your bibliographic records to the Open Library, please follow these few suggestions:

How to upload

BookServer

BookServer is an Internet Archive initiative intended to enable content creators and distributors to distribute digital books via a simple catalog format. At Open Library, we are excited about BookServer as it enables anyone to set up their own shingle and bring attention to a subset of books they specialize in. The Internet Archive is providing an open source OPDS aggregator as part of the bookserver project.

For More Information

Example Catalogs

BookServer is a useful mechanism for aggregating feeds and identifying books that are classified with different identifiers. If you create a catalog, there are a few rules-of-thumb to follow to ensure your catalogs will be included and your books properly identified.

  1. Provide crawlable feeds
  2. Provide identifiers that can be used for de-duping (ISBN, etc)
  3. Provide additional metadata aggregators can often understand a 'fuzzy' match - throw those obscure identifiers out there!

BookServer is a work in progress. We invite you to read the spec and get involved in the mailing list here: http://code.google.com/p/openpub/wiki/OPDS

History

October 2, 2023 Edited by AgentSapphire update frbr url
May 19, 2022 Edited by raybb simpler signup link
May 2, 2022 Edited by Mek Edited without comment.
May 2, 2022 Edited by Mek Edited without comment.
November 21, 2009 Created by George Added new Data page