Legacy Lakesuperior code.

Stefano Cossu 9389677e46 Sphinx setup. 7 years ago
.github b4e058bf84 Contributing docs for Github. (#12) 7 years ago
data df6d039f9b Add timestamps to root node on bootstrap. 7 years ago
docker 3d7962ead4 Fix protocol in Docker config as well. 7 years ago
docs 9389677e46 Sphinx setup. 7 years ago
etc.defaults 36b40204d2 Stand up messaging. 7 years ago
lakesuperior a813ac161f Docstring conversion #3. 7 years ago
static 4090e51570 SPARQL query UI and API. 7 years ago
tests 341a29bc23 Merge development branch. 7 years ago
util d554979eee Several improvements to benchmark tool: 7 years ago
.gitignore f56960605f Zero-config startup; default locations in data directory. 7 years ago
.travis.yml 0c858807c4 Travis fiddling (#30) 7 years ago
Dockerfile 561784bf08 Add Dockerfile and support files (#13) 7 years ago
LICENSE 2fdc1b902e Initial commit: some boilerplate borrowed from Combine, basic folder structure and documentation. 7 years ago
README.rst af458f1604 Convert all .md docs to .rst. 7 years ago
code_of_conduct.md 84e744b073 Doc update. 7 years ago
conftest.py 6980366c72 Separate environments between inside and outside app context. 7 years ago
docker-compose.yml 82d8112911 Add reference to automated build (#28) 7 years ago
fcrepo 64e4d4fcc2 Add preload option to gunicorn. 7 years ago
lsup-admin 0c244601f7 Docstring conversion #1. 7 years ago
profiler.py 8554f845a3 Adapt profiler script to multi-modal access. 7 years ago
requirements.txt 11601d8b0b Merge branch 'master' into migration_tool 7 years ago
server.py b5c922a8fb Use a global variable rather than thread-local storage for env; 7 years ago

README.rst

LAKEsuperior
============

|build status|

LAKEsuperior is an alternative `Fedora
Repository `__ implementation.

Guiding Principles
------------------

LAKEsuperior aims at being an uncomplicated, efficient Fedora 4
implementation.

Its main goals are:

- **Reliability:** Based on solid technologies with stability in mind.
- **Efficiency:** Small memory and CPU footprint, high scalability.
- **Ease of management:** Tools to perform monitoring and maintenance
included.
- **Simplicity of design:** Straight-forward architecture, robustness
over features.

Key features
------------

- Drop-in replacement for Fedora4 (with some
`caveats `__); currently being tested
with Hyrax 2
- Very stable persistence layer based on
`LMDB `__ and filesystem. Fully
ACID-compliant writes guarantee consistency of data.
- Term-based search (*planned*) and SPARQL Query API + UI
- No performance penalty for storing many resources under the same
container; no
`kudzu `__
pairtree segmentation \ `1 <#f1>`__\
- Extensible `provenance metadata `__ tracking
- `Multi-modal
access `__: HTTP
(REST), command line interface and native Python API.
- Fits in a pocket: you can carry 50M triples in an 8Gb memory stick.

Implementation of the official `Fedora API
specs `__ (Fedora 5.x and beyond) is not
foreseen in the short term, however it would be a natural evolution of
this project if it gains support.

Please make sure you read the `Delta
document `__ for divergences with the
official Fedora4 implementation.

Target Audience
---------------

LAKEsuperior is for anybody who cares about preserving data in the long
term.

Less vaguely, LAKEsuperior is targeted at who needs to store large
quantities of highly linked metadata and documents.

Its Python/C environment and API make it particularly well suited for
academic and scientific environments who would be able to embed it in a
Python application as a library or extend it via plug-ins.

LAKEsuperior is able to be exposed to the Web as a `Linked Data
Platform `__ server. It also acts as
a SPARQL query (read-only) endpoint, however it is not meant to be used
as a full-fledged triplestore at the moment.

In its current status, LAKEsuperior is aimed at developers and hands-on
managers who are interested in evaluating this project.

Quick Install: Running in Docker
--------------------------------

You can run LAKEsuperior in Docker for a hands-off quickstart.

`Docker `__ is a containerization platform that
allows you to run services in lightweight virtual machine environments
without having to worry about installing all of the prerequisites on
your host machine.

1. Install the correct `Docker Community
Edition `__ for your
operating system.
2. Clone this repo:
``git clone https://github.com/scossu/lakesuperior.git``
3. ``cd`` into repo folder
4. Run ``docker-compose up``

LAKEsuperior should now be available at ``http://localhost:8000/``.

The provided Docker configuration includes persistent storage as a
self-container Docker volume, meaning your data will persist between
runs. If you want to clear the decks, simply run
``docker-compose down -v``.

Manual Install (a bit less quick, a bit more power)
---------------------------------------------------

**Note:** These instructions have been tested on Linux. They may work on
Darwin with little modification, and possibly on Windows with some
modifications. Feedback is welcome.

Dependencies
~~~~~~~~~~~~

1. Python 3.5 or greater.
2. A message broker supporting the STOMP protocol. For testing and
evaluation purposes, `CoilMQ `__ is
included with the dependencies and should be automatically installed.

Installation steps
~~~~~~~~~~~~~~~~~~

1. Create a virtualenv in a project folder:
``virtualenv -p ``
2. Activate the virtualenv: ``source /bin/activate``
3. Clone this repo:
``git clone https://github.com/scossu/lakesuperior.git``
4. ``cd`` into repo folder
5. Install dependencies: ``pip install -r requirements.txt``
6. Start your STOMP broker, e.g.: ``coilmq &``. If you have another
queue manager listening to port 61613 you can either configure a
different port on the application configuration, or use the existing
message queue.
7. Run ``./lsup-admin bootstrap`` to initialize the binary and graph
stores
8. Run ``./fcrepo``.

Configuration
~~~~~~~~~~~~~

The app should run for testing and evaluation purposes without any
further configuration. All the application data are stored by default in
the ``data`` directory.

To change the default configuration you should:

1. Copy the ``etc.skeleton`` folder to a separate location
2. Set the configuration folder location in the environment:
``export FCREPO_CONFIG_DIR=`` (you can add
this line at the end of your virtualenv ``activate`` script)
3. Configure the application
4. Bootstrap the app or copy the original data folders to the new
location if any loction options changed
5. (Re)start the server: ``./fcrepo``

The configuration options are documented in the files.

**Note:** ``test.yml`` must specify a different location for the graph
and for the binary stores than the default one, otherwise running a test
suite will destroy your main data store. The application will issue an
error message and refuse to start if these locations overlap.

Production deployment
~~~~~~~~~~~~~~~~~~~~~

If you like fried repositories for lunch, deploy before 11AM.

Status and development
----------------------

LAKEsuperior is in **alpha** status. Please see the `project
issues `__ list for a
rudimentary road map.

Contributing
------------

This has been so far a single person’s off-hours project (with much
input from several sides). In order to turn into anything close to a
Beta release and eventually to a production-ready implementation, it
needs some community love.

Contributions are welcome in all forms, including ideas, issue reports,
or even just spinning up the software and providing some feedback.
LAKEsuperior is meant to live as a community project.

Documentation
-----------------------

The documenation is maintained in `Read The Docs
`__.

--------------

1 However if your client splits pairtrees upstream, such as Hyrax does,
that obviously needs to change to get rid of the path segments.
`↩ <#a1>`__

.. |build status| image:: http://img.shields.io/travis/scossu/lakesuperior/master.svg?style=flat
:target: https://travis-ci.org/username/repo