Skip to content
This repository has been archived by the owner on Mar 19, 2021. It is now read-only.

Releases: stickeritis/sticker

0.11.1

10 Jun 12:27
Compare
Choose a tag to compare
Bump version to 0.11.1

0.11.0

28 Mar 18:37
Compare
Choose a tag to compare
Bump version to 0.11.0

0.10.0: Revert "Generate Rust files from protobuf files on each build"

18 Oct 12:19
Compare
Choose a tag to compare
This reverts commit f64a16af7ef2b77fa69019486720861fd42e38a4.

Add support for pipelines

09 Oct 13:56
Compare
Choose a tag to compare
  • Add Pipeline data structure to sticker-utils that allows loading of several models, that are then applied in sequence.
  • Update sticker-tag and sticker-server to support annotation pipelines. These commands now accept multiple model configuration files as arguments and will apply these models in sequence when tagging.
  • Add the --input and --output options to sticker-tag. Since we cannot disambiguate between model configuration arguments and input/output corpora, the input and output files are now specified through these two flags. If these flags are not used, stdin/stdout will be used as before.
  • The address/port to bind to is now specified using the addr option of sticker-server. By default, sticker-server will bind to localhost:4000.

Updated CoNLL-X dependency

09 Oct 13:52
Compare
Choose a tag to compare

Update the CoNLL-X dependency to version 0.12. Unfortunately, this requires bumping the version, since this is an API-breaking change.

Reduce model load time with quantized embeddings

09 Oct 13:51
Compare
Choose a tag to compare

This release contains one large change: the loading of quantized models is speeded up by computing the unknown word embedding as an avarage of the subquantizers, rather than an average of all in-vocab word embeddings.

0.6.1

17 Aug 18:58
Compare
Choose a tag to compare
Bump version to 0.6.1

Subword representations using byte RNNs

17 Aug 12:02
Compare
Choose a tag to compare

This release adds support for word representations using byte RNNs. These word representations can be enabled by simply adding

[input]
subwords = true

For consistency, the embeddings.word and embeddings.tag options have also been moved to the input section. Hopefully, this is one of the last changes to the configuration file format.

Various options have been added to the graph writing scripts to control the hyper parameters of byte RNNs.

0.5.1

13 Aug 08:16
Compare
Choose a tag to compare
Bump version to 0.5.1

TensorBoard summaries, sticker-pretrain, support for Tensorflow 1.14

17 Jul 08:59
Compare
Choose a tag to compare

Features

  • Add support for writing TensorBoard summaries.
  • Split out pretraining support into a separate sticker-pretrain utility.
  • Add the --residual option for RNN residual connections.
  • Add the --maxlen option to sticker-{pretrain,train} for discarding long sentences.
  • Write the model hyperparameters to the graph and stdout. Provide the sticker-graph-metadata utility to print the hyperparameters.
  • Add the gpu_allow_growth option to allocate video RAM on a by-need basis.

Maintenance/bugfixes

  • Restore the hidden_size option for dilated convolutions.
  • Remove the DefaultConfig class from sticker-graph.
  • Support Tensorflow 1.14.
  • Fix CRF support.
  • Factor out tagger construction boilerplate to TaggerWrapper.

Thanks to @twuebi for the many contributions to this release!