Skip to content

Unexpected behavior: Nextclade ignores CDS unlinked to gene if gene is in gff3 #1576

@anna-parker

Description

@anna-parker

I'm not sure if this is really a bug or just nextclade not behaving as I would expect from the documentation.

@chaoran-chen created gene maps for the H5N1 segment M as follows:

.	.	CDS	26	784	.	+	.	gene="M1"
.	.	gene	26	1007	.	+	.	gene=M2;ID=gene-M2
.	.	CDS	26	51	.	+	.	gene=M2;ID=cds-M2;Parent=gene-M2
.	.	CDS	740	1007	.	+	.	gene=M2;ID=cds-M2;Parent=gene-M2

From the documentation this looked accurate to me. However, I now realized that this leads nextclade to ignore the M1 gene:

Image

Removing the second line (with gene) resolved the issue:

.	.	CDS	26	784	.	+	.	gene="M1"
.	.	CDS	26	51	.	+	.	gene=M2;ID=cds-M2
.	.	CDS	740	1007	.	+	.	gene=M2;ID=cds-M2

Image

My naive assumption is that nextclade ignores the CDS that is not linked to a gene if a gene is present - however I would not expect this behavior from the docs.

This can be tested with examples here: nextstrain/nextclade_data#217

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions