Skip to content

Latest commit

 

History

History
38 lines (32 loc) · 1.51 KB

CONTRIBUTORS.md

File metadata and controls

38 lines (32 loc) · 1.51 KB

Description of team member contribution

Group ρ

Azfar Imtiaz

  • Vectorization, one-hot encoding of vocabulary
  • Vectorization, load word vectors from Gensim
  • Design a basic working network with PyTorch nn module
  • Build the forward function with biases and weights
  • Use sparse matrices instead of dense matrices for combating memory issues
  • Clean up the code, create script for utility functions and remove extra scripts
  • Convert the two separate language and translation model scripts into single scripts with parameter to specify model type
  • Contributed to the report

Elin Hagman

  • Open and reading text files, preprocessing text and tokenization.
  • Building trigrams from vobabulary sentences.
  • Added options for GPU mode
  • Added zero grad before back propagation
  • Added log_softmax to help with vanishing gradients
  • Loading data in batches
  • Creating scripts for running training and testing in terminal
  • Contributed to the report

Sandra Derbring

  • Splitting vocabulary into training and test data sets by percentage.
  • Vectorization, removing out-of-vocabulary words
  • Vectorization, building trigrams of vectors
  • Writing cross entropy functionality for calculating loss
  • Building predict function in the neural network
  • Creating testing script
  • Breaking out training and testing initialization into separate files
  • Saving models to files with possibility to resume training if interrupted
  • Trained and tested models for the report
  • Implemented sanity checks for arguments
  • Contributed to the report