Legacy Lakesuperior code.

Stefano Cossu 695beb3847 PATCH fragments. 7 gadi atpakaļ
data 78dd52b09a Change `urn:fc*` namespaces to `info:fc*`. 7 gadi atpakaļ
doc 06608acbab Resurrect resource; set extract_imr strict by default. 7 gadi atpakaļ
etc.skeleton 544dc691e9 Connector and layout overhaul: configurable back ends (SPARQL + BerkeleyDB) 7 gadi atpakaļ
lakesuperior 695beb3847 PATCH fragments. 7 gadi atpakaļ
static 653a629afc Bare-bones Web UI to browse resources. 7 gadi atpakaļ
tests 695beb3847 PATCH fragments. 7 gadi atpakaļ
util 544dc691e9 Connector and layout overhaul: configurable back ends (SPARQL + BerkeleyDB) 7 gadi atpakaļ
.gitignore 2fdc1b902e Initial commit: some boilerplate borrowed from Combine, basic folder structure and documentation. 7 gadi atpakaļ
LICENSE 2fdc1b902e Initial commit: some boilerplate borrowed from Combine, basic folder structure and documentation. 7 gadi atpakaļ
README.md deec682ea6 Legacy pairtree split option (unsure if it will be necessary to 7 gadi atpakaļ
conftest.py 544dc691e9 Connector and layout overhaul: configurable back ends (SPARQL + BerkeleyDB) 7 gadi atpakaļ
fcrepo d49241b86b GUnicorn. 7 gadi atpakaļ
profiler.py 5ad49c7dd8 Profiling and testing tool. 7 gadi atpakaļ
requirements.txt de3e8dbc03 Build minimal requirements list with pipdeptree. 7 gadi atpakaļ
server.py 653a629afc Bare-bones Web UI to browse resources. 7 gadi atpakaļ

README.md

LAKEsuperior

LAKEsuperior is an experimental Fedora Repository implementation.

Basic concepts

LAKEsuperior aims at being an uncomplicated, efficient Fedora 4 implementation.

Key features:

  • Drop-in replacement for Fedora4 (with some caveats: see Delta document)—currently being tested with Hyrax 2
  • Stores metadata in a graph store, binaries in filesystem
  • Strives to be faster than Modeshape implementation (benchmarks TBD)
  • Simple search and SPARQL Query API via back-end triplestore (alpha 2)
  • No performance issues storing many resources under the same container; no kudzu pairtree segmentation [1]
  • Mitigates "many member" issue: constant performance writing to a resource with many children or members; option to omit children in retrieval
  • Flexible back-end layouts: options to organize information in back end
  • Migration tool (in alpha2)

Implementation of the official Fedora API specs (Fedora 5.x and beyond) is not foreseen in the short term, however it would be a natural evolution of this project if it gains support.

Please make sure you read the Delta document for divergences with the official Fedora4 implementation.

Alpha 1 application code consists of less than 2200 lines and strives to maintain a linear, intuitive code structure to foster collaboration. TODO link to tech overview and approach

Installation

Dependencies

  1. A triplestore. Fuseki is the benchmark used so far in development. Other implementations are possible as long as they support RDF 1.1 and SPARQL over HTTP
  2. A message broker supporting the STOMP protocol. If you have a separate instance of official Fedora listening to port 61613, that will do the job
  3. Python 3.5 or greater

Installation steps

  1. Install dependencies as indicated above
  2. Create a virtualenv in a project folder: virtualenv -p <python 3.5+ exec path> <virtualenv folder>
  3. Initialize the virtualenv: source <path_to_virtualenv>/bin/activate
  4. Clone this repo
  5. cd into repo folder
  6. Install dependencies: pip install -r requirements.txt
  7. Copy the etc.skeleton folder to a separate location
  8. Set the configuration folder location in the environment: export FCREPO_CONFIG_DIR=<your config dir location> (alternatively you can add this line to your virtualenv activate script)
  9. Configure the application
  10. Start your triplestore and STOMP broker
  11. Run util/bootstrap.py to initialize the binary and graph stores
  12. Run ./fcrepo for a multi-threaded server or flask run for a single-threaded development server

Production deployment

If you like fried repositories for lunch, deploy before 11AM.

Status and development

LAKEsuperior is in alpha status. Please see the TODO list for a rudimentary road map and status.

Further documentation

The design documents are in the doc/pdf folder. @TODO needs update

[1]: However if your client splits pairtrees upstream, such as Hyrax does, that obviously needs to change to get rid of the path segments.