%!s(int64=7) %!d(string=hai) anos · 97ea394c2b
--- a/README.rst
+++ b/README.rst
@@ -6,102 +6,18 @@ LAKEsuperior
 
															 LAKEsuperior is an alternative `Fedora
														
 
															 Repository <http://fedorarepository.org>`__ implementation.
														
 
															-Guiding Principles
														
 
															-------------------
														
 
															-
														
 
															-LAKEsuperior aims at being an uncomplicated, efficient Fedora 4
														
 
															-implementation.
														
 
															-
														
 
															-Its main goals are:
														
 
															+Documentation
														
 
															+-------------
														
 
															--  **Reliability:** Based on solid technologies with stability in mind.
														
 
															--  **Efficiency:** Small memory and CPU footprint, high scalability.
														
 
															--  **Ease of management:** Tools to perform monitoring and maintenance
														
 
															-   included.
														
 
															--  **Simplicity of design:** Straight-forward architecture, robustness
														
 
															-   over features.
														
 
															+The full documentation is maintained in `Read The Docs
														
 
															+<http://lakesuperior.readthedocs.io/>`__. Please refer to that for more info.
														
 
															-Key features
														
 
															+Installation
														
 
															 ------------
														
 
															--  Drop-in replacement for Fedora4 (with some
														
 
															-   `caveats <docs/fcrepo4_deltas.md>`__); currently being tested
														
 
															-   with Hyrax 2
														
 
															--  Very stable persistence layer based on
														
 
															-   `LMDB <https://symas.com/lmdb/>`__ and filesystem. Fully
														
 
															-   ACID-compliant writes guarantee consistency of data.
														
 
															--  Term-based search (*planned*) and SPARQL Query API + UI
														
 
															--  No performance penalty for storing many resources under the same
														
 
															-   container; no
														
 
															-   `kudzu <https://www.nature.org/ourinitiatives/urgentissues/land-conservation/forests/kudzu.xml>`__
														
 
															-   pairtree segmentation \ `1 <#f1>`__\ 
														
 
															--  Extensible `provenance metadata <docs/model.md>`__ tracking
														
 
															--  `Multi-modal
														
 
															-   access <docs/architecture.md#multi-modal-access>`__: HTTP
														
 
															-   (REST), command line interface and native Python API.
														
 
															--  Fits in a pocket: you can carry 50M triples in an 8Gb memory stick.
														
 
															-
														
 
															-Implementation of the official `Fedora API
														
 
															-specs <https://fedora.info/spec/>`__ (Fedora 5.x and beyond) is not
														
 
															-foreseen in the short term, however it would be a natural evolution of
														
 
															-this project if it gains support.
														
 
															-
														
 
															-Please make sure you read the `Delta
														
 
															-document <docs/fcrepo4_deltas.md>`__ for divergences with the
														
 
															-official Fedora4 implementation.
														
 
															-
														
 
															-Target Audience
														
 
															----------------
														
 
															-
														
 
															-LAKEsuperior is for anybody who cares about preserving data in the long
														
 
															-term.
														
 
															-
														
 
															-Less vaguely, LAKEsuperior is targeted at who needs to store large
														
 
															-quantities of highly linked metadata and documents.
														
 
															-
														
 
															-Its Python/C environment and API make it particularly well suited for
														
 
															-academic and scientific environments who would be able to embed it in a
														
 
															-Python application as a library or extend it via plug-ins.
														
 
															-
														
 
															-LAKEsuperior is able to be exposed to the Web as a `Linked Data
														
 
															-Platform <https://www.w3.org/TR/ldp-primer/>`__ server. It also acts as
														
 
															-a SPARQL query (read-only) endpoint, however it is not meant to be used
														
 
															-as a full-fledged triplestore at the moment.
														
 
															-
														
 
															-In its current status, LAKEsuperior is aimed at developers and hands-on
														
 
															-managers who are interested in evaluating this project.
														
 
															-
														
 
															-Quick Install: Running in Docker
														
 
															---------------------------------
														
 
															-
														
 
															-You can run LAKEsuperior in Docker for a hands-off quickstart.
														
 
															-
														
 
															-`Docker <http://docker.com/>`__ is a containerization platform that
														
 
															-allows you to run services in lightweight virtual machine environments
														
 
															-without having to worry about installing all of the prerequisites on
														
 
															-your host machine.
														
 
															-
														
 
															-1. Install the correct `Docker Community
														
 
															-   Edition <https://www.docker.com/community-edition>`__ for your
														
 
															-   operating system.
														
 
															-2. Clone this repo:
														
 
															-   ``git clone https://github.com/scossu/lakesuperior.git``
														
 
															-3. ``cd`` into repo folder
														
 
															-4. Run ``docker-compose up``
														
 
															-
														
 
															-LAKEsuperior should now be available at ``http://localhost:8000/``.
														
 
															-
														
 
															-The provided Docker configuration includes persistent storage as a
														
 
															-self-container Docker volume, meaning your data will persist between
														
 
															-runs. If you want to clear the decks, simply run
														
 
															-``docker-compose down -v``.
														
 
															-
														
 
															-Manual Install (a bit less quick, a bit more power)
														
 
															----------------------------------------------------
														
 
															-
														
 
															-**Note:** These instructions have been tested on Linux. They may work on
														
 
															-Darwin with little modification, and possibly on Windows with some
														
 
															-modifications. Feedback is welcome.
														
 
															+The following instructions are aimed at a manual install using this git
														
 
															+repository. For a hands-off install using Docker, see
														
 
															+:doc:`the setup documentation <setup>`.
														
 
															 Dependencies
														
 
															 ~~~~~~~~~~~~
														
@@ -129,43 +45,6 @@ Installation steps
 
															    stores
														
 
															 8. Run ``./fcrepo``.
														
 
															-Configuration
														
 
															-~~~~~~~~~~~~~
														
 
															-
														
 
															-The app should run for testing and evaluation purposes without any
														
 
															-further configuration. All the application data are stored by default in
														
 
															-the ``data`` directory.
														
 
															-
														
 
															-To change the default configuration you should:
														
 
															-
														
 
															-1. Copy the ``etc.skeleton`` folder to a separate location
														
 
															-2. Set the configuration folder location in the environment:
														
 
															-   ``export FCREPO_CONFIG_DIR=<your config dir location>`` (you can add
														
 
															-   this line at the end of your virtualenv ``activate`` script)
														
 
															-3. Configure the application
														
 
															-4. Bootstrap the app or copy the original data folders to the new
														
 
															-   location if any loction options changed
														
 
															-5. (Re)start the server: ``./fcrepo``
														
 
															-
														
 
															-The configuration options are documented in the files.
														
 
															-
														
 
															-**Note:** ``test.yml`` must specify a different location for the graph
														
 
															-and for the binary stores than the default one, otherwise running a test
														
 
															-suite will destroy your main data store. The application will issue an
														
 
															-error message and refuse to start if these locations overlap.
														
 
															-
														
 
															-Production deployment
														
 
															-~~~~~~~~~~~~~~~~~~~~~
														
 
															-
														
 
															-If you like fried repositories for lunch, deploy before 11AM.
														
 
															-
														
 
															-Status and development
														
 
															-----------------------
														
 
															-
														
 
															-LAKEsuperior is in **alpha** status. Please see the `project
														
 
															-issues <https://github.com/scossu/lakesuperior/issues>`__ list for a
														
 
															-rudimentary road map.
														
 
															-
														
 
															 Contributing
														
 
															 ------------
														
@@ -178,17 +57,8 @@ Contributions are welcome in all forms, including ideas, issue reports,
 
															 or even just spinning up the software and providing some feedback.
														
 
															 LAKEsuperior is meant to live as a community project.
														
 
															-Documentation
														
 
															------------------------
														
 
															-
														
 
															-The documenation is maintained in `Read The Docs
														
 
															-<http://lakesuperior.readthedocs.io/en/latest/>`__.
														
 
															-
														
 
															---------------
														
 
															-
														
 
															-1 However if your client splits pairtrees upstream, such as Hyrax does,
														
 
															-that obviously needs to change to get rid of the path segments.
														
 
															-`↩ <#a1>`__
														
 
															+See :doc:`related document <contributing>` for further details onhow to fork,
														
 
															+improve, document and test the project.
														
 
															 .. |build status| image:: http://img.shields.io/travis/scossu/lakesuperior/master.svg?style=flat
														
 
															    :target: https://travis-ci.org/username/repo
														
--- a/docs/architecture.rst
+++ b/docs/architecture.rst
@@ -39,10 +39,10 @@ jobs for example.
 
															 The Python API is divided in three main areas:
														
 
															--  `Resource API <../../lakesuperior/api/resource.py>`__. This API is in
														
 
															-   charge of all the resource CRUD operations and implements the
														
 
															-   majority of the Fedora specs.
														
 
															--  `Admin API <../../lakesuperior/api/admin.py>`__. This exposes utility
														
 
															-   methods, mostly long-running maintenance jobs.
														
 
															--  `Query API <../../lakesuperior/api/query.py>`__. This provides
														
 
															-   several facilities for querying repository data.
														
 
															+-  Resource API: this API in charge of all the resource CRUD operations and
														
 
															+   implements the majority of the Fedora specs.
														
 
															+-  Admin API: exposes utility methods, mostly long-running maintenance jobs.
														
 
															+-  Query API: provides several facilities for querying repository data.
														
 
															+
														
 
															+
														
 
															+See :doc:`API documentation<api>` for more details.
														
--- a/docs/conf.py
+++ b/docs/conf.py
@@ -53,7 +53,7 @@ master_doc = 'index'
 
															 # General information about the project.
														
 
															 project = 'lakesuperior'
														
 
															-copyright = '2018, Stefano Cossu'
														
 
															+copyright = '2018, Everybody & Nobody'
														
 
															 author = 'Stefano Cossu'
														
 
															 # The version info for the project you're documenting, acts as replacement for
														
@@ -61,7 +61,7 @@ author = 'Stefano Cossu'
 
															 # built documents.
														
 
															 #
														
 
															 # The short X.Y version.
														
 
															-version = '1.0.alpha'
														
 
															+version = '1.0-alpha'
														
 
															 # The full version, including alpha/beta/rc tags.
														
 
															 release = '1.0.0-alpha.8'
														
@@ -89,7 +89,7 @@ todo_include_todos = True
 
															 # The theme to use for HTML and HTML Help pages.  See the documentation for
														
 
															 # a list of builtin themes.
														
 
															 #
														
 
															-html_theme = 'alabaster'
														
 
															+html_theme = 'sphinx_rtd_theme'
														
 
															 # Theme options are theme-specific and customize the look and feel of a theme
														
 
															 # further.  For a list of options available for each theme, see the
														
--- a/docs/fcrepo4_deltas.rst
+++ b/docs/fcrepo4_deltas.rst
@@ -123,9 +123,9 @@ treated as a fully qualified identifier. The ``fcrepo:hasVersionLabel``
 
															 predicate, however ambiguous in this context, will be kept until the
														
 
															 adoption of Memento, which will change the retrieval mechanisms.
														
 
															-Also, if a POST is issued on the same resource ``fcr:versions`` location
														
 
															-using a version ID that already exists, LAKEsuperior will just mint a
														
 
															-random identifier rather than returning an error.
														
 
															+Another notable difference is that if a POST is issued on the same resource
														
 
															+``fcr:versions`` location using a version ID that already exists, LAKEsuperior
														
 
															+will just mint a random identifier rather than returning an error.
														
 
															 Deprecation track
														
 
															 -----------------
														
@@ -144,9 +144,6 @@ This should not pose a problem if a client does not have ``rest``
 
															 hard-coded in its code, but in any event, the ``/rest`` endpoint is
														
 
															 provided for backwards compatibility.
														
 
															-LAKEsuperior adds the (currently stub) ``query`` endpoint. Other
														
 
															-endpoints for non-LDP services may be opened in the future.
														
 
															-
														
 
															 Automatic LDP class assignment
														
 
															 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
														
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -1,35 +1,133 @@
 
															-.. lakesuperior documentation master file, created by
														
 
															-   sphinx-quickstart on Sat Mar 24 23:05:46 2018.
														
 
															-   You can adapt this file completely to your liking, but it should at least
														
 
															-   contain the root `toctree` directive.
														
 
															-
														
 
															 LAKEsuperior
														
 
															-========================================
														
 
															+============
														
 
															+
														
 
															+|build status|
														
 
															+
														
 
															+LAKEsuperior is an alternative `Fedora
														
 
															+Repository <http://fedorarepository.org>`__ implementation.
														
 
															+
														
 
															+Fedora is a mature repository software system historically adopted by
														
 
															+major cultural heritage institutions. It exposes an
														
 
															+`LDP <https://www.w3.org/TR/ldp-primer/>`__ endpoint to manage
														
 
															+any type of binary files and their metadata in Linked Data format.
														
 
															+
														
 
															+Guiding Principles
														
 
															+------------------
														
 
															+
														
 
															+LAKEsuperior aims at being an uncomplicated, efficient Fedora 4
														
 
															+implementation.
														
 
															+
														
 
															+Its main goals are:
														
 
															+
														
 
															+-  **Reliability:** Based on solid technologies with stability in mind.
														
 
															+-  **Efficiency:** Small memory and CPU footprint, high scalability.
														
 
															+-  **Ease of management:** Tools to perform monitoring and maintenance
														
 
															+   included.
														
 
															+-  **Simplicity of design:** Straight-forward architecture, robustness
														
 
															+   over features.
														
 
															+
														
 
															+Key features
														
 
															+------------
														
 
															+
														
 
															+-  Drop-in replacement for Fedora4 (with some
														
 
															+   :doc:`caveats <fcrepo4_deltas>`); currently being tested
														
 
															+   with Hyrax 2
														
 
															+-  Very stable persistence layer based on
														
 
															+   `LMDB <https://symas.com/lmdb/>`__ and filesystem. Fully
														
 
															+   ACID-compliant writes guarantee consistency of data.
														
 
															+-  Term-based search (*planned*) and SPARQL Query API + UI
														
 
															+-  No performance penalty for storing many resources under the same
														
 
															+   container; no
														
 
															+   `kudzu <https://www.nature.org/ourinitiatives/urgentissues/land-conservation/forests/kudzu.xml>`__
														
 
															+   pairtree segmentation \ `1 <#f1>`__\ 
														
 
															+-  Extensible :doc:`provenance metadata <model>` tracking
														
 
															+-  :doc:`Multi-modal access <architecture>`: HTTP
														
 
															+   (REST), command line interface and native Python API.
														
 
															+-  Fits in a pocket: you can carry 50M triples in an 8Gb memory stick.
														
 
															+
														
 
															+Implementation of the official `Fedora API
														
 
															+specs <https://fedora.info/spec/>`__ (Fedora 5.x and beyond) is not
														
 
															+foreseen in the short term, however it would be a natural evolution of
														
 
															+this project if it gains support.
														
 
															+
														
 
															+Please make sure you read the :doc:`Delta
														
 
															+document <fcrepo4_deltas>` for divergences with the
														
 
															+official Fedora4 implementation.
														
 
															+
														
 
															+Target Audience
														
 
															+---------------
														
 
															+
														
 
															+LAKEsuperior is for anybody who cares about preserving data in the long
														
 
															+term.
														
 
															+
														
 
															+Less vaguely, LAKEsuperior is targeted at who needs to store large
														
 
															+quantities of highly linked metadata and documents.
														
 
															+
														
 
															+Its Python/C environment and API make it particularly well suited for
														
 
															+academic and scientific environments who would be able to embed it in a
														
 
															+Python application as a library or extend it via plug-ins.
														
 
															+
														
 
															+LAKEsuperior is able to be exposed to the Web as a `Linked Data
														
 
															+Platform <https://www.w3.org/TR/ldp-primer/>`__ server. It also acts as
														
 
															+a SPARQL query (read-only) endpoint, however it is not meant to be used
														
 
															+as a full-fledged triplestore at the moment.
														
 
															+
														
 
															+In its current status, LAKEsuperior is aimed at developers and hands-on
														
 
															+managers who are interested in evaluating this project.
														
 
															+
														
 
															+Status and development
														
 
															+----------------------
														
 
															+
														
 
															+LAKEsuperior is in **alpha** status. Please see the `project
														
 
															+issues <https://github.com/scossu/lakesuperior/issues>`__ list for a
														
 
															+rudimentary road map.
														
 
															+
														
 
															+Contributing
														
 
															+------------
														
 
															+
														
 
															+This has been so far a single person’s off-hours project (with much
														
 
															+input from several sides). In order to turn into anything close to a
														
 
															+Beta release and eventually to a production-ready implementation, it
														
 
															+needs some community love.
														
 
															+
														
 
															+Contributions are welcome in all forms, including ideas, issue reports,
														
 
															+or even just spinning up the software and providing some feedback.
														
 
															+LAKEsuperior is meant to live as a community project.
														
 
															+
														
 
															+--------------
														
 
															+
														
 
															+1 However if your client splits pairtrees upstream, such as Hyrax does,
														
 
															+that obviously needs to change to get rid of the path segments.
														
 
															+`↩ <#a1>`__
														
 
															+
														
 
															+.. |build status| image:: http://img.shields.io/travis/scossu/lakesuperior/master.svg?style=flat
														
 
															+   :target: https://travis-ci.org/username/repo
														
 
															+
														
 
															+Indices and tables
														
 
															+------------------
														
 
															+
														
 
															+* :ref:`genindex`
														
 
															+* :ref:`modindex`
														
 
															+* :ref:`search`
														
 
															 .. toctree::
														
 
															    :maxdepth: 2
														
 
															-   :caption: Contents:
														
 
															+   :caption: Contents
														
 
															+    Installation and Configuration <setup>
														
 
															     Architecture Overview <architecture>
														
 
															     Divergences from Fedora 4 <fcrepo4_deltas>
														
 
															-    Content Model <model>
														
 
															-    Messaging SPI <messaging>
														
 
															+    Messaging <messaging>
														
 
															     Migration Guide <migration>
														
 
															     Command Line Reference <cli>
														
 
															-    Storage Implementation <storage>
														
 
															     Performance Benchmarks <performance>
														
 
															-    API documentation <api>
														
 
															 .. toctree::
														
 
															-   :maxdepth: 3
														
 
															-   :caption: Technical notes:
														
 
															-
														
 
															-    notes/indexing_strategy
														
 
															-
														
 
															+   :maxdepth: 1
														
 
															+   :caption: In-depth tech & design
														
 
															-Indices and tables
														
 
															-==================
														
 
															-
														
 
															-* :ref:`genindex`
														
 
															-* :ref:`modindex`
														
 
															-* :ref:`search`
														
 
															+    Contributing <contributing>
														
 
															+    API documentation <api>
														
 
															+    Indexing Strategy <indexing_strategy>
														
 
															+    Storage Implementation <storage>
														
 
															+    Content Model <model>
														
--- a/docs/notes/indexing_strategy.rst
+++ b/docs/notes/indexing_strategy.rst
--- a/docs/setup.rst
+++ b/docs/setup.rst
@@ -0,0 +1,90 @@
 
															+Installation & Configuration
														
 
															+============================
														
 
															+
														
 
															+Quick Install: Running in Docker
														
 
															+--------------------------------
														
 
															+
														
 
															+You can run LAKEsuperior in Docker for a hands-off quickstart.
														
 
															+
														
 
															+`Docker <http://docker.com/>`__ is a containerization platform that
														
 
															+allows you to run services in lightweight virtual machine environments
														
 
															+without having to worry about installing all of the prerequisites on
														
 
															+your host machine.
														
 
															+
														
 
															+1. Install the correct `Docker Community
														
 
															+   Edition <https://www.docker.com/community-edition>`__ for your
														
 
															+   operating system.
														
 
															+2. Clone the LAKEsuperior git repository:
														
 
															+   ``git clone https://github.com/scossu/lakesuperior.git``
														
 
															+3. ``cd`` into repo folder
														
 
															+4. Run ``docker-compose up``
														
 
															+
														
 
															+LAKEsuperior should now be available at ``http://localhost:8000/``.
														
 
															+
														
 
															+The provided Docker configuration includes persistent storage as a
														
 
															+self-container Docker volume, meaning your data will persist between
														
 
															+runs. If you want to clear the decks, simply run
														
 
															+``docker-compose down -v``.
														
 
															+
														
 
															+Manual Install (a bit less quick, a bit more power)
														
 
															+---------------------------------------------------
														
 
															+
														
 
															+**Note:** These instructions have been tested on Linux. They may work on
														
 
															+Darwin with little modification, and possibly on Windows with some
														
 
															+modifications. Feedback is welcome.
														
 
															+
														
 
															+Dependencies
														
 
															+~~~~~~~~~~~~
														
 
															+
														
 
															+1. Python 3.5 or greater.
														
 
															+2. A message broker supporting the STOMP protocol. For testing and
														
 
															+   evaluation purposes, `CoilMQ <https://github.com/hozn/coilmq>`__ is
														
 
															+   included with the dependencies and should be automatically installed.
														
 
															+
														
 
															+Installation steps
														
 
															+~~~~~~~~~~~~~~~~~~
														
 
															+
														
 
															+1. Create a virtualenv in a project folder:
														
 
															+   ``virtualenv -p <python 3.5+ exec path> <virtualenv folder>``
														
 
															+2. Activate the virtualenv: ``source <path_to_virtualenv>/bin/activate``
														
 
															+3. Clone this repo:
														
 
															+   ``git clone https://github.com/scossu/lakesuperior.git``
														
 
															+4. ``cd`` into repo folder
														
 
															+5. Install dependencies: ``pip install -r requirements.txt``
														
 
															+6. Start your STOMP broker, e.g.: ``coilmq &``. If you have another
														
 
															+   queue manager listening to port 61613 you can either configure a
														
 
															+   different port on the application configuration, or use the existing
														
 
															+   message queue.
														
 
															+7. Run ``./lsup-admin bootstrap`` to initialize the binary and graph
														
 
															+   stores
														
 
															+8. Run ``./fcrepo``.
														
 
															+
														
 
															+Configuration
														
 
															+-------------
														
 
															+
														
 
															+The app should run for testing and evaluation purposes without any
														
 
															+further configuration. All the application data are stored by default in
														
 
															+the ``data`` directory.
														
 
															+
														
 
															+To change the default configuration you should:
														
 
															+
														
 
															+1. Copy the ``etc.skeleton`` folder to a separate location
														
 
															+2. Set the configuration folder location in the environment:
														
 
															+   ``export FCREPO_CONFIG_DIR=<your config dir location>`` (you can add
														
 
															+   this line at the end of your virtualenv ``activate`` script)
														
 
															+3. Configure the application
														
 
															+4. Bootstrap the app or copy the original data folders to the new
														
 
															+   location if any loction options changed
														
 
															+5. (Re)start the server: ``./fcrepo``
														
 
															+
														
 
															+The configuration options are documented in the files.
														
 
															+
														
 
															+**Note:** ``test.yml`` must specify a different location for the graph
														
 
															+and for the binary stores than the default one, otherwise running a test
														
 
															+suite will destroy your main data store. The application will issue an
														
 
															+error message and refuse to start if these locations overlap.
														
 
															+
														
 
															+Production deployment
														
 
															+---------------------
														
 
															+
														
 
															+If you like fried repositories for lunch, deploy before 11AM.