Skip to content

Framework for hit identification in High Throughput Screening (HTS)

License

Notifications You must be signed in to change notification settings

zaritskylab/AnomalyDetectionScreening

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Anomaly detection for high-content image-based phenotypic cell profiling

High-content image-based phenotypic profiling combines automated microscopy and analysis to identify phenotypic alterations in cell morphology and provide insight into the cell's physiological state. Classical representations of the phenotypic profile can not capture the full underlying complexity in cell organization, while recent weakly machine-learning based representation-learning methods are hard to biologically interpret. We used the abundance of control wells to learn the in-distribution of control experiments and use it to formulate a self-supervised reconstruction anomaly-based representation that encodes the intricate morphological inter-feature dependencies while preserving the representation interpretability. The performance of our anomaly-based representations was evaluated for downstream tasks with respect to two classical representations across four public Cell Painting datasets. Anomaly-based representations improved reproducibility, Mechanism of Action classification, and complemented classical representations. Unsupervised explainability of autoencoder-based anomalies identified specific inter-feature dependencies causing anomalies. The general concept of anomaly-based representations can be adapted to other applications in cell biology.

Downloading data

All augmented per-well aggregated Cell Painting datasets were downloaded from the Cell Painting Gallery (CPG) (https://registry.opendata.aws/cellpainting-gallery/).

Project setup and run:

  1. Clone this repository.
  2. Open cmd/shell/terminal and go to project folder: cd AnomalyDetectionScreening
  3. Create a conda environment: conda create -n pytorch_anomaly python=3.10.9
  4. Activate the conda environment conda activate pytorch_anomaly
  5. Install the required packages: pip install -r requirements.txt
  6. Run python ads/main.py. This script will run the anomaly detection pipeline and subsequent analyses: train the anomaly detection model, calculate percent replicating, train mechanism of action (MoA) and generate SHAP-based anomaly explanations.
  7. Run additional notebooks under 'notebooks/' to generate figures and tables.

Citation

For a complete discussion of our findings please view our preprint:

TBD.

About

Framework for hit identification in High Throughput Screening (HTS)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published