Legacy Lakesuperior code.

Stefano Cossu 9dd67c86f5 Query for all graphs if default graph is given. 7 tahun lalu
data dc50b0a51d Make extract_imr compatible with bdb back end; add RDF types for resource graphs. 7 tahun lalu
doc 426da2e198 Add stats page and stub admin tools page. 7 tahun lalu
etc.skeleton 0db6d2b584 Remove deadlock by using async workers in GUnicorn. 7 tahun lalu
lakesuperior 9dd67c86f5 Query for all graphs if default graph is given. 7 tahun lalu
static 4090e51570 SPARQL query UI and API. 7 tahun lalu
tests 3844bbfad1 Place lookup_1bound and lookup_2bound in main class; merge default graph. 7 tahun lalu
util f6cceb8da1 Close store outside of store layout bootstrap() method 7 tahun lalu
.gitignore 2fdc1b902e Initial commit: some boilerplate borrowed from Combine, basic folder structure and documentation. 7 tahun lalu
LICENSE 2fdc1b902e Initial commit: some boilerplate borrowed from Combine, basic folder structure and documentation. 7 tahun lalu
README.md 00e4c33cc4 Remove SPARQL connector; update documentation. 7 tahun lalu
conftest.py 73f0d191e0 Put test_delete_from_ctx on hold (inconsistent results). 7 tahun lalu
fcrepo d49241b86b GUnicorn. 7 tahun lalu
profiler.py 854b6e4296 Update performance doc. 7 tahun lalu
requirements.txt de3e8dbc03 Build minimal requirements list with pipdeptree. 7 tahun lalu
server.py 653a629afc Bare-bones Web UI to browse resources. 7 tahun lalu

README.md

LAKEsuperior

LAKEsuperior is an experimental Fedora Repository implementation.

Guiding Principles

LAKEsuperior aims at being an uncomplicated, efficient Fedora 4 implementation.

Its main goals are:

  • Simplicity of design: LAKEsuperior relies on LMDB, an embedded, high-performance key-value store, for storing metadata and on the filesystem to store binaries.
  • Efficiency: while raw speed is important, LAKEsuperior also aims at being conservative with resources. Its memory and CPU footprint are small. Python C extensions are used where possible to improve performance.
  • Reliability: fully ACID-compliant writes guarantee consistency of data.
  • Ease of management: Contents can be queried directly via term search or SPARQL without the aid of external indices. Scripts and interfaces for repository administration and monitoring are shipped with the standard release.
  • Portability: aims at maintaining a minimal set of dependencies.

Key features

  • Drop-in replacement for Fedora4 (with some caveats: see Delta document)—currently being tested with Hyrax 2
  • Term-based search (planned) and SPARQL Query API + UI
  • No performance penalty for storing many resources under the same container; no kudzu pairtree segmentation 1
  • Constant performance writing to a resource with many children or members; option to omit children in retrieval
  • Migration tools (planned)
  • Python API (planned): Authors of Python clients can use LAKEsuperior as an embedded repository with no HTTP traffic or interim RDF serialization & de-serialization involved.

Implementation of the official Fedora API specs (Fedora 5.x and beyond) is not foreseen in the short term, however it would be a natural evolution of this project if it gains support.

Please make sure you read the Delta document for divergences with the official Fedora4 implementation.

Installation

Dependencies

  1. Python 3.5 or greater.
  2. The LMDB database library. It should be included in most Linux distributions' standard package repositories.
  3. A message broker supporting the STOMP protocol. For testing and evaluation purposes, Coilmq is included in the dependencies and should be automatically installed.

Installation steps

  1. Install dependencies as indicated above
  2. Create a virtualenv in a project folder: virtualenv -p <python 3.5+ exec path> <virtualenv folder>
  3. Initialize the virtualenv: source <path_to_virtualenv>/bin/activate
  4. Clone this repo
  5. cd into repo folder
  6. Install dependencies: pip install -r requirements.txt
  7. Copy the etc.skeleton folder to a separate location
  8. Set the configuration folder location in the environment: export FCREPO_CONFIG_DIR=<your config dir location> (alternatively you can add this line to your virtualenv activate script)
  9. Configure the application
  10. Start your STOMP broker
  11. Run util/bootstrap.py to initialize the binary and graph stores
  12. Run ./fcrepo for a multi-threaded server or flask run for a single-threaded development server

Production deployment

If you like fried repositories for lunch, deploy before 11AM.

Status and development

LAKEsuperior is in alpha status. Please see the TODO list for a rudimentary road map and status.

Further documentation

Miscellaneous documents are in the doc folder. They will be organized and linked better some day.


1 However if your client splits pairtrees upstream, such as Hyrax does, that obviously needs to change to get rid of the path segments.