Skip to content

Missing features in GFF3 data #108

@thobalose

Description

@thobalose

Hello,

I have a gff3 file with the following limits:

gff3

I am using the following to limit to the features of interest:

limit_info=dict(
            gff_source_type=[
                ('ena', 'transcript'), ('ena', 'CDS'), ('ena', 'gene'), ('ena', 'exon'),
                ('ena', 'tRNA_gene'), ('ena', 'ncRNA_gene'), ('ena', 'rRNA_gene'), 
                ('ena', 'pseudogene')
            ],
            gff_source=['ena']
)

However, I am having trouble retrieving data about the transcripts and CDS when parsing and looking for feature.type == 'transcript' or feature.type == 'CDS' in rec.features. It is as if these are not being captured.
What's interesting is that, when the limit_info dict is reduced to ('ena', 'transcript') or ('ena', 'CDS'), I do retrieve this data.

What might be the issue here?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions