Library of Congress script transliterator.
|
2 سال پیش | |
---|---|---|
data | 2 سال پیش | |
transliterator | 2 سال پیش | |
.gitignore | 2 سال پیش | |
Dockerfile | 2 سال پیش | |
LICENSE | 2 سال پیش | |
NOTES.md | 2 سال پیش | |
README.md | 2 سال پیش | |
TODO.md | 2 سال پیش | |
entrypoint.sh | 2 سال پیش | |
requirements.txt | 2 سال پیش | |
uwsgi.ini | 2 سال پیش | |
wsgi.py | 2 سال پیش |
REST API service to convert non-Latin scripts to Latin, and vice versa.
Build container in current dir:
docker build -t transliterator:latest .
Start container:
docker run -e TXL_FLASK_SECRET=changeme -p 8000:8000 transliterator:latest
/
renders a simple HTML form to test the transliteration service.
GET /health
Useful endpoint for health checks.
200 OK
if the service is running.
GET /languages
List all the languages supported.
200 OK
MIME type: application/json
Content: a JSON object of the supported language tables. Keys are the keywords
used throughout the API, e.g. for /transliterate
. Each key is paired with an
object that contains some basic metadata about the language features. At the
moment, only the human-readable name is available.
GET /table/<lang>
Dump a language table.
<lang>
: Language code as given by the /languages
endpoint.200 OK
MIME type: application/json
Content: language configuration as a JSON object with all the transliteration rules as they are read by the application. If the table inherits from a parent, the computed values from the merged tables are shown.
POST /transliterate/<lang>[/r2s]
Transliterate an input string in a given language.
<lang>
: Language code as given by the /languages
endpoint.r2s
: if appended to the URI, the transliteration is intended to be
Roman-to-script, and the input string should be Latin text. If not, the
default behavior is followed, which is interpreting the input as a script
in the given language, and returning the Romanized text.text
: Input text to be transliterated.200 OK
on successful operation.400 Bad Request
for an invalid request. The reason for the failure is
normally printed in the response body.MIME Type: text/plain
Content: transliterated string. Characters not found in the mapping are copied verbatim (see "Configuration files" section for more information).
TODO