Stanford CoreNLP Server - Dockerized Alternative

This repository is dedicated to exposing and tracking a customized Docker Image of the traditional Stanford CoreNLP server. This build intends to be used as a dedicated service to provide for personal or small research teams.

A ready-to-use image from Docker Hub is provided, along with the deploy instructions and the possibility of downloading and customizing the image through the Dockerfile using simple build instructions. The current supported version is 4.5.2; however, it is possible to modify the VERSION variable via the Dockerfile.

Also, deployment testing is provided using plain HTTP via the curl command using the official python library stanza.

DISCLAIMER: this is not an official documentation guide.

Deploy Alternatives

To deploy the prebuilt docker image, two options are provided: use the docker-compose tool and the docker command from the CLI utility.

Deploy using Docker CLI

Directly run the 'docker' command like the following example. e.g., changing two variables. For more information, check the Official Documentation

docker run -e JAVA_XMX=12g -e ANNOTATORS=tokenize,ssplit,parse -p 9000:9000 d1egoprog/stanford-corenlp:4.5.2

Deploy using docker-compose

Download the prepared 'docker-compose.yaml' file from the repository via wget and execute the command using the utility.

wget https://raw.githubusercontent.com/d1egoprog/docker-stanford-corenlp/main/docker-compose.yaml
docker-compose up -d

Happy hacking!! 🖖🖖.

Testing the Installation

To check the functionality, you can open a web browser window to your docker-engine IP and the chosen service, e.g., PORT=9000, generally on localhost:9000.

Consuming by HTTP request

Also, to test your local or remote service, just send the following curl request into your preferred CLI.

curl --data 'The quick brown fox jumped over the lazy dog.' 'http://localhost:9000/?properties={%22annotators%22%3A%22tokenize%2Cssplit%2Cpos%22%2C%22outputFormat%22%3A%22json%22}' -o -

Consuming by Library

To use the official python library stanza a small example has been prepared in a jupyter notebook, stanza-example

Build Alternatives

For more information on the configuration and functionality of the Stanford OpenNLP Server use the official documentation.

Build using Docker CLI

To build the image locally, clone the repository:

git clone https://github.com/d1egoprog/stanford-corenlp-docker.git

Use the docker command CLI tool to build and run:

docker build -t stanford-corenlp:4.5.2 stanford-corenlp/.
docker run -p 9000:9000 stanford-corenlp:4.5.2

All the JVM parameters can be accessed by editing the Dockerfile and rebuilding the image by default; the parameters configured are:

ENV JAVA_XMX 8G
ENV ANNOTATORS all
ENV TIMEOUT_MILLISECONDS 60000
ENV THREADS 5
ENV MAX_CHAR_LENGTH 100000
ENV PORT 9000

If you do not want to edit the Dockerfile, environment variables can be overwritten from the docker run command, e.g., changing the JVM memory parameter JAVA_XMX to reserve more memory.

docker run -e JAVA_XMX=12g -p 9000:9000 stanford-corenlp:4.5.2

Build using docker-compose

If preferred, a docker-compose file is also available with the standard build from the docker file and an override configuration (same parameters from Dockerfile); this should be changed to set your desired annotators specific computing requirements. To run the service, run the command:

docker-compose -f build.yaml up -d

Or use the compose file to build the Docker image, storing the following into a new build.yaml file. Also is possible to override the variables, e.g., changing the JVM memory parameter JAVA_XMX to reserve more memory or changing ANNOTATORS task performed by the server.

version: '3.7'

services:
  stanford_corenlp:
    image: d1egoprog/stanford-corenlp:4.5.2
    ports:
      - "9000:9000"
    environment: 
      - JAVA_XMX=12G
      - ANNOTATORS=tokenize
    restart: always

Contact

If you have any questions in deployment or if any error is found, please contact me or open an issue. And contributing is always welcome. The Github repository Issues.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
stanford-corenlp		stanford-corenlp
test		test
LICENSE		LICENSE
README.md		README.md
build.yaml		build.yaml
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stanford CoreNLP Server - Dockerized Alternative

Deploy Alternatives

Deploy using Docker CLI

Deploy using docker-compose

Testing the Installation

Consuming by HTTP request

Consuming by Library

Build Alternatives

Build using Docker CLI

Build using docker-compose

Contact

About

Releases 2

Languages

License

d1egoprog/docker-stanford-corenlp

Folders and files

Latest commit

History

Repository files navigation

Stanford CoreNLP Server - Dockerized Alternative

Deploy Alternatives

Deploy using Docker CLI

Deploy using docker-compose

Testing the Installation

Consuming by HTTP request

Consuming by Library

Build Alternatives

Build using Docker CLI

Build using docker-compose

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Languages