README.rst 6.9 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194
  1. LAKEsuperior
  2. ============
  3. |build status|
  4. LAKEsuperior is an alternative `Fedora
  5. Repository <http://fedorarepository.org>`__ implementation.
  6. Guiding Principles
  7. ------------------
  8. LAKEsuperior aims at being an uncomplicated, efficient Fedora 4
  9. implementation.
  10. Its main goals are:
  11. - **Reliability:** Based on solid technologies with stability in mind.
  12. - **Efficiency:** Small memory and CPU footprint, high scalability.
  13. - **Ease of management:** Tools to perform monitoring and maintenance
  14. included.
  15. - **Simplicity of design:** Straight-forward architecture, robustness
  16. over features.
  17. Key features
  18. ------------
  19. - Drop-in replacement for Fedora4 (with some
  20. `caveats <docs/fcrepo4_deltas.md>`__); currently being tested
  21. with Hyrax 2
  22. - Very stable persistence layer based on
  23. `LMDB <https://symas.com/lmdb/>`__ and filesystem. Fully
  24. ACID-compliant writes guarantee consistency of data.
  25. - Term-based search (*planned*) and SPARQL Query API + UI
  26. - No performance penalty for storing many resources under the same
  27. container; no
  28. `kudzu <https://www.nature.org/ourinitiatives/urgentissues/land-conservation/forests/kudzu.xml>`__
  29. pairtree segmentation \ `1 <#f1>`__\
  30. - Extensible `provenance metadata <docs/model.md>`__ tracking
  31. - `Multi-modal
  32. access <docs/architecture.md#multi-modal-access>`__: HTTP
  33. (REST), command line interface and native Python API.
  34. - Fits in a pocket: you can carry 50M triples in an 8Gb memory stick.
  35. Implementation of the official `Fedora API
  36. specs <https://fedora.info/spec/>`__ (Fedora 5.x and beyond) is not
  37. foreseen in the short term, however it would be a natural evolution of
  38. this project if it gains support.
  39. Please make sure you read the `Delta
  40. document <docs/fcrepo4_deltas.md>`__ for divergences with the
  41. official Fedora4 implementation.
  42. Target Audience
  43. ---------------
  44. LAKEsuperior is for anybody who cares about preserving data in the long
  45. term.
  46. Less vaguely, LAKEsuperior is targeted at who needs to store large
  47. quantities of highly linked metadata and documents.
  48. Its Python/C environment and API make it particularly well suited for
  49. academic and scientific environments who would be able to embed it in a
  50. Python application as a library or extend it via plug-ins.
  51. LAKEsuperior is able to be exposed to the Web as a `Linked Data
  52. Platform <https://www.w3.org/TR/ldp-primer/>`__ server. It also acts as
  53. a SPARQL query (read-only) endpoint, however it is not meant to be used
  54. as a full-fledged triplestore at the moment.
  55. In its current status, LAKEsuperior is aimed at developers and hands-on
  56. managers who are interested in evaluating this project.
  57. Quick Install: Running in Docker
  58. --------------------------------
  59. You can run LAKEsuperior in Docker for a hands-off quickstart.
  60. `Docker <http://docker.com/>`__ is a containerization platform that
  61. allows you to run services in lightweight virtual machine environments
  62. without having to worry about installing all of the prerequisites on
  63. your host machine.
  64. 1. Install the correct `Docker Community
  65. Edition <https://www.docker.com/community-edition>`__ for your
  66. operating system.
  67. 2. Clone this repo:
  68. ``git clone https://github.com/scossu/lakesuperior.git``
  69. 3. ``cd`` into repo folder
  70. 4. Run ``docker-compose up``
  71. LAKEsuperior should now be available at ``http://localhost:8000/``.
  72. The provided Docker configuration includes persistent storage as a
  73. self-container Docker volume, meaning your data will persist between
  74. runs. If you want to clear the decks, simply run
  75. ``docker-compose down -v``.
  76. Manual Install (a bit less quick, a bit more power)
  77. ---------------------------------------------------
  78. **Note:** These instructions have been tested on Linux. They may work on
  79. Darwin with little modification, and possibly on Windows with some
  80. modifications. Feedback is welcome.
  81. Dependencies
  82. ~~~~~~~~~~~~
  83. 1. Python 3.5 or greater.
  84. 2. A message broker supporting the STOMP protocol. For testing and
  85. evaluation purposes, `CoilMQ <https://github.com/hozn/coilmq>`__ is
  86. included with the dependencies and should be automatically installed.
  87. Installation steps
  88. ~~~~~~~~~~~~~~~~~~
  89. 1. Create a virtualenv in a project folder:
  90. ``virtualenv -p <python 3.5+ exec path> <virtualenv folder>``
  91. 2. Activate the virtualenv: ``source <path_to_virtualenv>/bin/activate``
  92. 3. Clone this repo:
  93. ``git clone https://github.com/scossu/lakesuperior.git``
  94. 4. ``cd`` into repo folder
  95. 5. Install dependencies: ``pip install -r requirements.txt``
  96. 6. Start your STOMP broker, e.g.: ``coilmq &``. If you have another
  97. queue manager listening to port 61613 you can either configure a
  98. different port on the application configuration, or use the existing
  99. message queue.
  100. 7. Run ``./lsup-admin bootstrap`` to initialize the binary and graph
  101. stores
  102. 8. Run ``./fcrepo``.
  103. Configuration
  104. ~~~~~~~~~~~~~
  105. The app should run for testing and evaluation purposes without any
  106. further configuration. All the application data are stored by default in
  107. the ``data`` directory.
  108. To change the default configuration you should:
  109. 1. Copy the ``etc.skeleton`` folder to a separate location
  110. 2. Set the configuration folder location in the environment:
  111. ``export FCREPO_CONFIG_DIR=<your config dir location>`` (you can add
  112. this line at the end of your virtualenv ``activate`` script)
  113. 3. Configure the application
  114. 4. Bootstrap the app or copy the original data folders to the new
  115. location if any loction options changed
  116. 5. (Re)start the server: ``./fcrepo``
  117. The configuration options are documented in the files.
  118. **Note:** ``test.yml`` must specify a different location for the graph
  119. and for the binary stores than the default one, otherwise running a test
  120. suite will destroy your main data store. The application will issue an
  121. error message and refuse to start if these locations overlap.
  122. Production deployment
  123. ~~~~~~~~~~~~~~~~~~~~~
  124. If you like fried repositories for lunch, deploy before 11AM.
  125. Status and development
  126. ----------------------
  127. LAKEsuperior is in **alpha** status. Please see the `project
  128. issues <https://github.com/scossu/lakesuperior/issues>`__ list for a
  129. rudimentary road map.
  130. Contributing
  131. ------------
  132. This has been so far a single person’s off-hours project (with much
  133. input from several sides). In order to turn into anything close to a
  134. Beta release and eventually to a production-ready implementation, it
  135. needs some community love.
  136. Contributions are welcome in all forms, including ideas, issue reports,
  137. or even just spinning up the software and providing some feedback.
  138. LAKEsuperior is meant to live as a community project.
  139. Documentation
  140. -----------------------
  141. The documenation is maintained in `Read The Docs
  142. <http://lakesuperior.readthedocs.io/en/latest/>`__.
  143. --------------
  144. 1 However if your client splits pairtrees upstream, such as Hyrax does,
  145. that obviously needs to change to get rid of the path segments.
  146. `↩ <#a1>`__
  147. .. |build status| image:: http://img.shields.io/travis/scossu/lakesuperior/master.svg?style=flat
  148. :target: https://travis-ci.org/username/repo