Training & Tuning CRNN Handwriting Models on Google Cloud

The code in this repository has been used to perform hyperparameter tuning, model training, and (not yet) model deployment within Google Cloud's infrastructure.

This code is built on Python 3.7.12.

Google Cloud Platform account requirements

Billable Google Cloud account (hyperparameter tuning, especially, is expensive)
The following APIs enabled:
- Cloud Storage
- Vertex AI
- Artifact Registry
- (optional) GPU quota of >= 1 (helpful tutorial)

Google Cloud Platform environment setup

In Cloud Storage, upload your image set (create a bucket if necessary, e.g. gs://fmnh_datasets/). Do this using the web UI, or using gsutil on your local machine (e.g. gsutil -m cp -r <source_folder> gs://fmnh_datasets/<dataset_name>).
In Vertex AI, navigate to Workbench --> Managed Notebooks, and click New Notebook near the top.
Name your new notebook, make sure it's in the same Region as everything else, change Permission to "Service account," and under "Advanced" be sure to check "Enable terminal" (and anything else you want to change).
In the meantime, go to Artifact Registry and create a repository for your Docker image, making sure it's in the same Region as everything else. Navigate inside this new repo and click "Setup Instructions" near the top.
Copy the "Configure Docker" command. (Something similar to gcloud auth configure-docker us-central1-docker.pkg.dev)
Return to Vertex AI. After a few minutes, you can click "Open Jupyer Lab" for the new notebook.
Clone this repo into your notebook. (using a Terminal window, e.g. git clone https://github.com/emcdona1/handwriting_models_on_vertex_ai/)
Within your managed notebook, in a Terminal window, navigate to the repo folder. (e.g. 'cd handwriting_models_on_vertex_ai`).
In the same Terminal window, paste and run your configure Docker command.
Copy the image set (including metadata) into the workspace. (using a Terminal window, e.g. gsutil -m cp -r gs://fmnh_datasets/IAM_Words ./)

See modules for specific steps after setup.

Contributors and licensing

This code has been developed by Beth McDonald (emcdona1, Field Museum).

This code was developed under the guidance of Dr. Matt von Konrat (Field Museum), and Dr. Rick Ree (Field Museum).

This project was made possible thanks to the Grainger Bioinformatics Center at the Field Museum.

Please contact Dr. von Konrat for licensing inquiries.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
hyperparameter_tuner		hyperparameter_tuner
one_model_trainer		one_model_trainer
test_set_predictions		test_set_predictions
transfer_learning		transfer_learning
utilities		utilities
.gitignore		.gitignore
README.md		README.md
build_docker-hyperparameter_tuner.sh		build_docker-hyperparameter_tuner.sh
build_docker-one_model_trainer.sh		build_docker-one_model_trainer.sh
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training & Tuning CRNN Handwriting Models on Google Cloud

Google Cloud Platform account requirements

Google Cloud Platform environment setup

Contributors and licensing

About

Releases

Packages

Languages

emcdona1/handwriting_models_on_vertex_ai

Folders and files

Latest commit

History

Repository files navigation

Training & Tuning CRNN Handwriting Models on Google Cloud

Google Cloud Platform account requirements

Google Cloud Platform environment setup

Contributors and licensing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages