Skip to content

Latest commit

 

History

History
41 lines (12 loc) · 1.69 KB

File metadata and controls

41 lines (12 loc) · 1.69 KB

image

This project aims at predicting chemical structure of metabolites from LC-MS/MS spectra using Deep Canonical Correlation Analysis(DeepCCA). DeepCCA is a deep learning extension of CCA. This work is done in three phases as outlined below.

1. Data processing and Embeddings

In this notebook, we clean, intergrate and generate embeddings of structure and spectra dataset.

2. Model development

This notebook contains DeepCCA optimization codes.

3. Prediction and Evaluation for model development

Here we perfom a cross modal retrieval. It takes in the spectra embeddings and outputs the most likely structure of that spectra. Next we evaluate using Tanimoto scores whether the predicted structure is similar to the true structure.

Training the final model

After selecting best performing hyperparameters, we train the final model in this notebook

Final model predictions and Evaluation

The final model is used to predict the structures of query spectrum in this notebook