[Feature request] give a header to each fasta #7

Louis-MG · 2023-04-05T17:32:15Z

I found myself using your tool to obtain more hindsight into kmer presence and absence in genomes, specifically for kmers unique to certain genomes. Because the output of UniqueKmer is in the following form:

>sequence1
kmer1
kmer2
kmer3

tools will count them as one continuous sequence kmer1kmer2kmer3 which is unintended and will even create kmers that do not exist in the original genome.

Could you add an option to give a header to each kmer ? it would then look like this :

>sequence1:kmer:1
kmer1
>sequence1:kmer:2
kmer2
>sequence1:kmer:3
kmer3

I wrote a command line with awk for that but it would be convenient to have it as an option:

awk -i inplace '{if (/>/) {line=$0; sum=0} else {sum+=1; KMER=$0; print line ":kmer:" sum "\n" KMER} }' unique_kmers.fasta

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] give a header to each fasta #7

[Feature request] give a header to each fasta #7

Louis-MG commented Apr 5, 2023 •

edited

Loading

[Feature request] give a header to each fasta #7

[Feature request] give a header to each fasta #7

Comments

Louis-MG commented Apr 5, 2023 • edited Loading

Louis-MG commented Apr 5, 2023 •

edited

Loading