Skip to content

New Featurizers (removing old, ineffective ones) #185

Open
@ardunn

Description

@ardunn

Our results depend entirely on the quality of our descriptors. We need to add better ones, if possible, and prune the "best" sets to keep only the best ones.

A few ideas for featurizers to add to matminer (some of which are already issues on the matminer repo):

  • SOAP thru dscribe or native implementation
  • MBTR thru dscribe or native
  • U-apI thru ABCNN or native
  • Matscholar word embeddings (for definite compositions, structure prototypes, and combinations of these via dot product)
  • A structure featurizer based on: A site featurizer that averages/combines composition properties, where the compositions are the bonds [determined via crystalnn], and the composition properties can be from any combination of composition featurizers (this is already done somewhat through Ward et al. LocalPropertyDifference)
  • Flat version of SineCoulombMatrix or CoulombMatrix

Adding these and then evaluating whether we should add them to automatminer (i.e., by evaluating them by themselves on some easy test problems using the same model) should be a priority.

Metadata

Metadata

Assignees

No one assigned

    Labels

    major enhancementv2.0Issues and enhancements for upcoming major release v2.0

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions