Skip to content

This script automates the generation of tautomers from a CSV file containing SMILES and MoleculeID columns. It utilises the open-source Java library "ambit-tautomers-2.0.0-SNAPSHOT" for tautomer generation.

Notifications You must be signed in to change notification settings

nar-n/Tautomer-Generator-with-AMBIT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

Tautomer Generator

This script automates the generation of tautomers from a CSV file containing SMILES and MoleculeID columns. It utilises the open-source Java library "ambit-tautomers-2.0.0-SNAPSHOT" for tautomer generation.

Prerequisites

Before running this code, ensure you have the following dependencies installed:

  1. Java Development Kit (JDK): Ensure you have Java installed on your system.
  2. ambit-tautomers-2.0.0-SNAPSHOT.jar: Download the JAR file from the provided link.
  3. Pandas
  4. RDKit

Prepare Input:

  1. Place your input CSV file containing SMILES and MoleculeID. Ensure the CSV file follows the specified format.
  2. Download the ambit-tautomers-2.0.0-SNAPSHOT.jar from the provided link (https://sourceforge.net/projects/ambit/files/Ambit2/AMBIT%20applications/tautomers/ambit-tautomers-2.0.0-SNAPSHOT.jar) and ensure you have Java installed on your system.

Please note that:

  1. CSV and JAR files location: Ensure both the input CSV file and the ambit-tautomers-2.0.0-SNAPSHOT.jar file are in the same folder as the python script file.
  2. Input csv file name: Specify the input csv file name in the script.
  3. Tautomers Option: Choose between generating 'all' tautomers or selecting the 'best' tautomer for each molecule based on your requirements.

The script generates

  1. Tautomers for each molecule in the input CSV file. The tautomers will be saved in the same directory as the input CSV file, as separate SDF files.
  2. Additionally, a CSV file named tautomers_output.csv will be generated, summarising the tautomers.

About

This script automates the generation of tautomers from a CSV file containing SMILES and MoleculeID columns. It utilises the open-source Java library "ambit-tautomers-2.0.0-SNAPSHOT" for tautomer generation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages