4 years ago · 4b192334a2
--- a/README.md
+++ b/README.md
@@ -1,28 +1,45 @@
 
				-# LSUP_RDF
			
 
				+# `lsup_rdf`
			
 
				 
			
 
				 **This project is work in progress.**
			
 
				 
			
 
				-This C library was initially meant to replace RDFLib dependency and Cython code
			
 
				+Embedded RDF (and maybe later, generic graph) store and manipulation library.
			
 
				+
			
 
				+## Purpose
			
 
				+
			
 
				+The goal of this library is to provide extremely efficient and compact
			
 
				+handling of RDF data. At least a C API and Python bindings are planned.
			
 
				+
			
 
				+This library can be thought of as SQLite or BerkeleyDB for graphs. It can be
			
 
				+embedded directly in a program and store persistent data without the need of
			
 
				+running a server.
			
 
				+
			
 
				+Two graph back ends are available: a memory one based on hash maps and a
			
 
				+disk-based one based on [LMDB](https://symas.com/lmdb/), an extremely fast and
			
 
				+compact embedded key-store value. Graphs can be created independently with
			
 
				+either back end within the same program. Triples in the persistent back end are
			
 
				+fully indexed and optimized for a balance of lookup speed, data compactness,
			
 
				+and write performance (in order of importance).
			
 
				+
			
 
				+This library was initially meant to replace RDFLib dependency and Cython code
			
 
				 in [Lakesuperior](https://notabug.org/scossu/lakesuperior) in an effort to
			
 
				-reduce code clutter and speed up RDF handling; it later became a project for an
			
 
				-independent RDF library. Initial experiments with Redland as a more efficient
			
 
				-replacement for RDFLib were not successful due to the complexity of the API and
			
 
				-the lack of an adequate persistent back end.
			
 
				+reduce code clutter and speed up RDF handling; it is now a project for an
			
 
				+independent RDF library, but unless the contributor base expands, it will
			
 
				+remain focused on serving Lakesuperior.
			
 
				+
			
 
				 
			
 
				-This library makes two graph back ends available: a memory one based on hash
			
 
				-maps and a disk-based one based on LMDB, an extremely fast and compact embedded
			
 
				-key-store value. Graphs can be created independently with either back end
			
 
				-within the  same program. Triples in the persistent back end are fully indexed
			
 
				-and optimized for a balance of lookup speed, data compactness, and write
			
 
				-performance (in order of importance).
			
 
				+## Development Status
			
 
				 
			
 
				-The API is not yet defined and at the moment this project is only intended for
			
 
				-curious developers and researchers.
			
 
				+**Pre-alpha.** The API is not yet defined and may change radically. The code
			
 
				+may not compile, or throw a fit when run. At the moment this project is only
			
 
				+intended for curious developers and researchers.
			
 
				 
			
 
				 This is also my first stab at writing a C library (coming from Python) and an
			
 
				 unpaid fun project, so don't be surprised if you find some gross stuff.
			
 
				 
			
 
				-## In Scope – Short Term
			
 
				+
			
 
				+## Road Map
			
 
				+
			
 
				+### In Scope – Short Term
			
 
				 
			
 
				 The short-term goal is to support usage in Lakesuperior and a workable set
			
 
				 of features as a standalone library:
			
@@ -35,18 +52,22 @@ of features as a standalone library:
 
				 - Validation of RDF triples
			
 
				 - Fast graph Lookup using matching patterns
			
 
				 - Graph boolean operations
			
 
				-- Serialization and de-serialization to/from N-Triples and Turtle
			
 
				-- Compile-time configuration of max graph size (efficiency vs. huge capacity)
			
 
				+- Serialization and de-serialization to/from N-Triples and N-Quads
			
 
				+- Serialization and de-serialization to/from Turtle and TriG
			
 
				+- Compile-time configuration of max graph size (efficiency vs. capacity)
			
 
				 - Python bindings
			
 
				+- Basic command line utilities
			
 
				 
			
 
				-## Possibly In scope – Long Term
			
 
				+### Possibly In scope – Long Term
			
 
				 
			
 
				 - Binary serialization and hashing of graphs
			
 
				 - Binary protocol for synchronizing remote replicas
			
 
				 - Lua bindings
			
 
				-- SPARQL queries (We'll see... Will definitely need help)
			
 
				-- C++ compatibility (requires external contribution and use cases)
			
 
				 
			
 
				-## Likely Out of Scope
			
 
				+### Likely Out of Scope
			
 
				 
			
 
				-- JSON-LD serialization
			
 
				+(Unless provided and maintained by external contributors)
			
 
				+
			
 
				+- C++ bindings
			
 
				+- JSON-LD de/serialization
			
 
				+- SPARQL queries (We'll see... Will definitely need help)