Skip to content

How to cluster structures based solely on 3Di #543

@luisbustom-oss

Description

@luisbustom-oss

Hi,

For a set of aminoacid sequences in a .fa file, I generated a db using Prost5.

foldseek createdb file.fa db --prostt5-model weights --gpu 1

I was also able to get the 3Di translations as .fasta file

foldseek lndb db_h db_ss_h
foldseek convert2fasta db_ss db_ss.fasta

However, when trying to cluster such structres...:
foldseek easy-cluster db cluster/cluster tmp --alignment-type 0

... it does not work:

Time for read in: 0h 0m 0s 210ms
tmp/2433386163849198049/clu_tmp/11610238211166517392/clustering.sh: line 124: 2435900 Segmentation fault      (core dumped) "$MMSEQS" clust "$INPUT" "${TMP_PATH}/pref_rescore1" "${TMP_PATH}/pre_clust" ${CLUSTER_PAR}
Error: Pre-clustering step died
Error: Search died

Could you please help me figure out how to proceed? I should say that I only want to cluster my structures on the basis of 3Di alignment, and not amino acid sequence.

Thank you in advance for your time

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions