Describes conventions we aim to adhere to for all data in the model

General

  • BioEntity
    • The identifier should always be filled even it is just duplicationg another identifier e.g. if Gene has organismDbId but no other identifier then the same value should be set as identifier
    • The identifier of a BioEntity is always a Synonym where the source is the provider of the identifer
    • Additional identifiers, names, accessions, etc should be a Synonym in the same way
    • When a data source is considered a primary provider the source Database should be evidence for the BioEntity
  • Synonym
    • Synonym.type should be either "name", "identifier" or "accession"