Skip to content

Incorrect mapping of CHEK2 #107

@jstone-dev

Description

@jstone-dev

In MaveDB, CHEK2 variants have been mapped to NP_001005735.1, which is a poor choice, rather than the MANE Select protein ID NP_009125.1. The cause appears to be the fact that UTA has an outdated MANE Select transcript version (NM_007194.3; NM_007194.4 is current). The mechanism by which this leads to poor transcript selection is discussed at https://github.com/bbi-lab/variant-annotation/blob/main/docs/map_variants.md#transcript-selection.

I'd suggest that this could be addressed by

  • Updating UTA (if it now contains the current version)
  • Adding an option to force transcript selection (as has been done in the standalone pipeline linked above)
  • Or (perhaps) loosening the UTA query so that we find older versions of MANE transcripts. This approach might involve assumptions we don't want to make, so it should probably be accompanied by flagging the mapping somehow.

Metadata

Metadata

Assignees

No one assigned

    Labels

    app: workerTask implementation touches the workertype: bugSomething isn't workingworkstream: clinicalTask relates to clinical features

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions