forked from Sanofi-Public/CodonBERT
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathsample.fasta
10 lines (10 loc) · 8.5 KB
/
sample.fasta
1
2
3
4
5
6
7
8
9
10
> 4fd56f12-d10d-11ec-8ffd-f217b3d50617
AUGAAGACCAUCAUCGCCCUGUCUUACAUCCUGUGCCUGGUGUUCGCCCAGAAGAUCCCCGGCAACGACAAUAGCACCGCCACCCUGUGCCUGGGCCACCACGCCGUGCCUAAUGGCACCAUCGUGAAGACCAUCACCAACGACAGAAUCGAGGUGACCAAUGCCACCGAGCUGGUGCAGAAUAGCAGCAUCGGCGAGAUCUGCGACAGCCCACACCAGAUCCUGGAUGGCGGCAAUUGUACCCUGAUCGACGCCCUGCUGGGCGAUCCUCAGUGCGACGGCUUCCAGAAUAAGGAGUGGGACCUCUUCGUGGAGCGGAGCCGGGCAAAUAGCAACUGCUACCCAUACGACGUGCCUGACUACGCCAGCCUGAGAAGCCUGGUGGCCAGCAGCGGCACCCUGGAGUUCAAGAACGAGUCCUUCAACUGGACCGGCGUGAAGCAGAACGGCACCAGCUCCGCCUGCAUCCGGGGCUCCAGCAGCUCCUUCUUCAGCCGGCUGAAUUGGCUGACCCACCUGAACUACACCUACCCCGCCCUGAACGUGACCAUGCCUAACAAGGAGCAGUUCGACAAGCUGUACAUCUGGGGCGUGCACCACCCCAGCACCGAUAAAGACCAGAUCAGCCUGUUCGCCCAGCCUAGCGGCAGAAUCACCGUGAGCACCAAGAGAAGCCAGCAGGCCGUGAUCCCCAAUAUCGGCAGCAGACCCCGGAUCAGAGAUAUCCCAUCCCGGAUCAGCAUCUACUGGACCAUCGUGAAGCCCGGCGACAUCCUGCUGAUCAACAGCACCGGCAACCUGAUCGCCCCCCGGGGCUACUUCAAGAUCCGGAGCGGCAAGAGCAGCAUCAUGAGAAGCGACGCCCCCAUCGGCAAGUGCAAGUCCGAGUGCAUCACCCCCAACGGCAGCAUUCCCAACGACAAGCCCUUCCAGAACGUGAACAGAAUCACAUACGGCGCCUGCCCUAGAUACGUGAAGCAGAGCACCCUGAAGCUGGCCACCGGCAUGCGGAAUGUGCCUGAGAAACAGACCCGGGGCAUCUUUGGCGCCAUCGCCGGCUUCAUCGAGAAUGGCUGGGAGGGCAUGGUGGACGGCUGGUACGGCUUCCGGCACCAGAACAGCGAGGGCCGGGGCCAGGCCGCCGAUCUGAAGUCCACCCAGGCCGCCAUCGACCAGAUCAAUGGCAAGCUGAAUAGACUGAUCGGCAAGACCAAUGAGAAGUUCCACCAGAUCGAGAAGGAGUUCAGCGAGGUGGAGGGCAGAGUGCAGGAUCUGGAGAAGUACGUGGAGGACACCAAGAUCGACCUGUGGAGCUACAACGCCGAGCUGCUGGUGGCCCUGGAGAAUCAGCACACCAUCGACCUGACCGACUCUGAGAUGAACAAACUCUUCGAGAAGACCAAGAAGCAGCUGAGAGAGAAUGCCGAAGACAUGGGCAAUGGCUGUUUCAAGAUCUACCACAAGUGUGACAAUGCCUGCAUCGGCAGCAUCCGGAAUGAGACAUACGACCACAACGUGUACCGGGAUGAGGCCCUGAAUAAUAGAUUCCAGAUCAAGGGCGUGGAGCUGAAGUCCGGCUACAAGGACUGGAUCCUGUGGAUCAGCUUCGCCAUGAGCUGCUUCCUGCUGUGCAUCGCCCUGCUGGGCUUCAUCAUGUGGGCCUGCCAGAAGGGCAAUAUCAGAUGCAAUAUCUGCAUUUGAUGA
> 4fd56fe4-d10d-11ec-8ffd-f217b3d50617
AUGAAGACCAUUAUCGCCCUGAGCUACAUCCUGUGCCUGGUGUUUGCUCAGAAGAUCCCUGGCAACGACAAUUCCACCGCCACCCUGUGCCUGGGCCACCACGCCGUGCCAAAUGGCACAAUCGUGAAGACCAUCACCAAUGACAGAAUCGAGGUGACCAAUGCCACCGAGCUGGUGCAGAACAGCAGCAUCGGCGAGAUCUGCGACAGCCCUCACCAGAUUCUGGAUGGGGGCAAUUGCACCCUGAUUGAUGCCCUGCUGGGCGAUCCCCAGUGUGACGGAUUCCAGAAUAAGGAGUGGGACCUCUUCGUGGAGAGAAGCCGGGCCAAUAGCAAUUGCUACCCUUACGAUGUGCCUGACUACGCCAGCCUGAGAAGCCUGGUGGCCUCCAGCGGCACACUGGAGUUCAAGAAUGAGAGCUUCAAUUGGACCGGCGUGAAGCAGAAUGGCACCAGCAGCGCCUGCAUCAGAGGCUCCAGCAGCAGCUUCUUCUCUCGGCUGAAUUGGCUGACCCACCUGAAUUACACCUACCCCGCCCUGAAUGUGACCAUGCCAAACAAGGAGCAGUUCGACAAGCUGUACAUCUGGGGCGUGCACCACCCUAGCACCGACAAAGACCAGAUCAGCCUGUUCGCCCAGCCCUCCGGCAGAAUCACCGUGAGCACCAAGCGGUCCCAGCAGGCCGUGAUCCCUAAUAUCGGCUCUCGGCCCCGGAUCAGAGAUAUCCCCAGCAGAAUCAGCAUCUACUGGACCAUCGUGAAGCCAGGCGACAUCCUGCUGAUCAACAGCACCGGCAACCUGAUCGCCCCUAGAGGCUACUUCAAGAUCCGGUCUGGCAAGAGCAGCAUCAUGAGAAGCGACGCCCCCAUUGGCAAGUGCAAGAGCGAGUGCAUCACCCCUAAUGGCAGCAUCCCCAAUGAUAAGCCCUUCCAGAAUGUGAACCGGAUCACCUACGGCGCCUGUCCUCGGUACGUGAAGCAGAGCACCCUGAAGCUGGCCACCGGCAUGCGGAAUGUGCCUGAGAAGCAGACCAGAGGCAUUUUCGGCGCUAUCGCCGGCUUCAUCGAGAAUGGCUGGGAGGGCAUGGUGGAUGGCUGGUAUGGCUUCCGGCACCAGAAUAGCGAGGGCAGAGGCCAGGCCGCCGACCUGAAGUCCACACAGGCCGCCAUCGACCAGAUCAAUGGCAAGCUGAACCGGCUGAUCGGCAAGACCAAUGAGAAGUUCCACCAGAUCGAGAAGGAGUUCAGCGAGGUGGAGGGCAGAGUGCAGGAUCUGGAGAAGUACGUGGAGGACACCAAGAUCGACCUGUGGUCCUACAAUGCCGAGCUGCUGGUGGCUCUGGAGAAUCAGCACACCAUCGACCUGACAGAUAGCGAGAUGAACAAACUCUUCGAAAAGACCAAAAAGCAGCUGAGAGAGAAUGCUGAAGACAUGGGCAAUGGCUGCUUCAAAAUCUACCACAAGUGCGACAAUGCUUGCAUCGGCUCCAUCCGGAAUGAGACAUAUGAUCACAACGUCUACAGAGACGAAGCCCUGAAUAAUAGAUUCCAGAUCAAGGGCGUGGAGCUGAAGUCCGGCUACAAGGACUGGAUCCUGUGGAUCAGCUUCGCCAUGAGCUGCUUCCUGCUGUGCAUCGCCCUGCUGGGCUUCAUCAUGUGGGCCUGCCAGAAGGGCAAUAUCAGAUGCAACAUCUGCAUUUGAUGA
> 4fd57020-d10d-11ec-8ffd-f217b3d50617
AUGAAAACAAUAAUAGCCCUGUCCUACAUCCUGUGUCUGGUGUUCGCCCAGAAGAUCCCCGGCAACGAUAACAGUACAGCCACCCUGUGCCUUGGGCACCACGCCGUGCCCAACGGCACAAUCGUGAAGACCAUCACGAACGAUCGCAUCGAGGUCACCAAUGCUACUGAGCUGGUGCAGAACAGUAGCAUUGGUGAGAUCUGCGAUUCCCCCCACCAGAUCCUGGAUGGGGGGAACUGUACUCUGAUCGAUGCCCUUCUGGGCGAUCCACAGUGCGACGGGUUCCAGAACAAGGAGUGGGACCUGUUUGUGGAGCGGAGUAGGGCUAACAGCAACUGUUACCCCUACGACGUGCCAGACUACGCAAGCCUGCGUAGUCUGGUGGCUUCCUCCGGGACACUGGAGUUCAAGAACGAGAGCUUCAACUGGACCGGAGUGAAGCAGAACGGGACGAGCUCCGCCUGUAUCAGGGGGAGCUCGUCCAGCUUCUUCAGCCGGCUGAACUGGCUGACCCAUCUGAACUACACUUAUCCUGCUCUGAACGUGACCAUGCCCAACAAGGAGCAGUUCGAUAAGCUGUACAUUUGGGGGGUGCACCACCCCUCAACCGACAAGGACCAGAUCAGCCUGUUCGCCCAGCCCUCCGGCCGUAUCACCGUGAGCACCAAGCGCUCCCAGCAGGCCGUGAUCCCGAACAUCGGGUCACGGCCUCGGAUUAGGGACAUCCCUAGCCGGAUCAGCAUUUACUGGACUAUUGUGAAGCCAGGAGACAUCCUGCUGAUCAAUAGUACCGGUAACCUGAUCGCACCUCGCGGGUACUUCAAGAUCAGGAGCGGCAAGAGCAGCAUCAUGCGCUCUGACGCUCCUAUCGGGAAGUGCAAGAGCGAGUGCAUUACUCCCAACGGGAGUAUUCCUAAUGACAAGCCCUUCCAGAAUGUCAAUAGGAUCACCUACGGUGCCUGCCCACGGUACGUGAAGCAGAGCACCCUGAAGCUGGCGACUGGCAUGCGGAAUGUGCCCGAGAAGCAGACUCGGGGCAUAUUCGGCGCCAUCGCCGGCUUCAUCGAGAACGGCUGGGAGGGCAUGGUGGACGGGUGGUACGGCUUCAGGCACCAGAACAGCGAGGGACGGGGCCAGGCUGCCGAUCUGAAGUCUACUCAGGCUGCCAUUGAUCAGAUCAAUGGCAAGCUGAAUAGACUGAUCGGCAAGACCAACGAGAAGUUCCACCAGAUCGAGAAGGAGUUCUCCGAGGUGGAGGGCCGGGUCCAGGACCUGGAGAAGUACGUGGAGGACACCAAGAUCGACCUGUGGAGCUACAAUGCAGAGCUGCUGGUGGCUCUCGAGAACCAGCACACUAUUGAUCUCACUGACAGUGAGAUGAAUAAGCUGUUCGAGAAGACCAAGAAGCAGCUCCGCGAGAACGCGGAGGACAUGGGGAAUGGUUGCUUCAAGAUAUAUCACAAGUGUGACAACGCUUGUAUUGGAAGCAUCCGCAACGAAACCUAUGACCACAACGUGUACAGGGACGAGGCCCUGAACAAUCGGUUCCAGAUCAAGGGUGUGGAGCUGAAGAGUGGCUACAAGGACUGGAUCCUGUGGAUCUCCUUCGCCAUGAGUUGCUUCCUGCUGUGUAUUGCCCUUCUGGGGUUCAUUAUGUGGGCCUGCCAGAAGGGCAAUAUCAGGUGCAACAUCUGCAUU
> 4fd57048-d10d-11ec-8ffd-f217b3d50617
AUGAAAACCAUAAUCGCCCUGAGCUACAUCCUGUGCCUGGUGUUCGCCCAGAAGAUCCCCGGCAACGACAAUAGCACAGCCACCCUGUGCCUGGGGCACCACGCCGUGCCCAACGGCACCAUCGUGAAGACCAUCACCAACGAUCGGAUCGAGGUGACCAACGCCACCGAGCUGGUGCAGAACAGCAGCAUCGGGGAGAUAUGUGACAGCCCCCACCAGAUCCUGGAUGGGGGCAACUGCACCCUGAUCGACGCCCUGCUGGGGGACCCCCAGUGCGACGGGUUCCAGAACAAGGAGUGGGACCUGUUCGUGGAGCGGAGCAGGGCCAACAGCAACUGCUACCCCUACGACGUGCCAGACUACGCCAGCCUGCGGAGUCUGGUGGCUUCCUCCGGCACCCUGGAGUUCAAGAACGAGAGCUUCAACUGGACCGGAGUGAAGCAGAAUGGGACCAGUUCAGCCUGCAUCAGAGGGAGCAGCAGCUCCUUCUUCAGCAGGCUGAACUGGCUGACCCACCUGAACUACACCUACCCAGCCCUGAACGUGACCAUGCCCAACAAGGAGCAGUUCGACAAGCUGUACAUCUGGGGGGUGCACCACCCCUCCACCGACAAGGACCAGAUCAGCCUGUUCGCCCAGCCCAGCGGGCGGAUCACUGUGAGCACCAAGCGCUCACAGCAGGCCGUGAUCCCCAACAUCGGAUCACGGCCUCGGAUCCGGGACAUCCCCUCCCGGAUCUCCAUCUACUGGACUAUUGUGAAGCCCGGGGACAUCCUGCUGAUCAACAGCACCGGCAACCUGAUCGCCCCCCGGGGCUACUUCAAGAUCAGGUCCGGCAAGAGCAGCAUCAUGCGCUCUGAUGCUCCCAUUGGGAAGUGCAAGAGCGAGUGCAUCACCCCCAAUGGGAGCAUCCCCAACGACAAGCCCUUCCAGAACGUGAACAGGAUCACCUACGGGGCCUGCCCACGGUACGUGAAGCAGAGCACCCUGAAGCUGGCCACCGGCAUGCGCAACGUGCCUGAGAAGCAGACACGGGGCAUCUUCGGGGCCAUCGCCGGGUUCAUCGAGAACGGCUGGGAGGGCAUGGUGGACGGCUGGUACGGCUUCCGCCACCAGAACAGCGAGGGCCGGGGCCAGGCCGCCGAUCUGAAGUCUACCCAGGCCGCCAUUGAUCAGAUCAAUGGCAAGCUGAAUAGACUGAUCGGCAAGACCAACGAGAAGUUCCACCAGAUCGAGAAGGAGUUCUCCGAGGUGGAGGGCCGGGUGCAGGACCUGGAGAAGUACGUGGAGGACACCAAGAUCGACCUGUGGAGCUACAACGCCGAGCUGCUGGUGGCCCUGGAGAAUCAGCACACCAUUGAUCUGACCGACAGCGAGAUGAACAAGCUGUUCGAGAAGACCAAGAAGCAGCUGCGGGAGAACGCUGAGGACAUGGGCAACGGAUGCUUCAAGAUCUACCACAAGUGUGACAACGCUUGUAUCGGAAGCAUCCGGAACGAAACCUAUGACCACAACGUGUACAGGGACGAGGCCCUGAACAACCGGUUCCAGAUCAAGGGCGUGGAGCUGAAGUCCGGGUACAAGGACUGGAUCCUGUGGAUCUCCUUCGCCAUGUCCUGCUUCCUGCUGUGUAUUGCCCUGCUGGGGUUCAUCAUGUGGGCCUGCCAGAAGGGCAAUAUCAGAUGCAACAUCUGCAUC
> 4fd57070-d10d-11ec-8ffd-f217b3d50617
AUGAAAACAAUAAUCGCCCUGUCCUAUAUCCUGUGCCUGGUCUUCGCCCAGAAGAUCCCAGGCAACGACAACUCCACCGCCACCCUGUGUCUGGGUCAUCACGCGGUGCCUAAUGGCACCAUCGUGAAGACCAUCACCAACGAUAGGAUCGAGGUGACCAAUGCUACUGAGCUGGUGCAGAACAGUAGCAUUGGCGAGAUCUGCGACAGCCCCCACCAGAUCCUGGAUGGGGGCAACUGUACGCUGAUCGACGCCCUGCUGGGGGACCCCCAGUGCGAUGGCUUCCAGAACAAGGAGUGGGACCUCUUUGUGGAGCGGAGUAGGGCUAACAGCAACUGUUACCCCUACGACGUGCCAGACUACGCAAGCCUGCGUAGUCUGGUGGCUUCCUCCGGGACACUGGAGUUCAAGAACGAGAGCUUCAACUGGACCGGAGUGAAGCAGAACGGGACGAGCUCCGCCUGUAUCAGGGGGAGCUCGUCCAGCUUCUUCAGCCGGCUGAACUGGCUGACCCACCUGAACUACACUUAUCCUGCCCUGAACGUGACCAUGCCCAACAAGGAGCAGUUCGAUAAGCUGUACAUUUGGGGGGUGCACCACCCCUCAACUGAUAAGGAUCAGAUCUCACUGUUCGCCCAGCCCUCUGGGCGGAUCACAGUGAGCACAAAGAGGUCCCAGCAGGCCGUGAUCCCGAACAUCGGGUCACGGCCUCGGAUCCGGGACAUCCCGUCCCGGAUCUCCAUCUACUGGACCAUCGUGAAGCCUGGGGACAUUCUGCUGAUCAACUCCACUGGGAAUCUGAUUGCCCCCCGGGGGUACUUCAAGAUUCGCAGUGGAAAGAGCAGCAUCAUGCGCUCUGAUGCUCCCAUUGGGAAGUGCAAGAGCGAGUGCAUUACCCCCAAUGGGAGCAUCCCGAAUGACAAGCCAUUCCAGAAUGUCAACAGGAUCACGUACGGGGCCUGCCCCCGGUACGUGAAGCAGAGCACCCUCAAGCUGGCCACCGGCAUGAGGAACGUGCCUGAGAAGCAGACACGGGGUAUCUUCGGGGCCAUCGCCGGCUUCAUCGAAAACGGCUGGGAGGGGAUGGUGGACGGCUGGUACGGCUUCCGCCAUCAGAACUCCGAGGGCCGCGGCCAGGCCGCGGACCUCAAGUCUACUCAGGCUGCCAUUGAUCAGAUCAAUGGCAAGCUGAAUAGACUCAUUGGGAAGACCAAUGAGAAGUUCCACCAGAUUGAGAAGGAGUUCUCAGAGGUGGAGGGCCGGGUCCAGGACCUGGAGAAGUACGUUGAGGACACAAAGAUCGACCUUUGGUCCUACAACGCCGAGCUUCUGGUGGCCCUGGAGAAUCAGCACACCAUUGAUCUGACAGACAGCGAGAUGAACAAGCUGUUUGAGAAGACCAAGAAGCAGCUGAGGGAGAAUGCCGAGGACAUGGGCAACGGAUGCUUCAAGAUCUACCACAAGUGUGACAACGCUUGUAUCGGAAGCAUCCGGAAUGAAACCUAUGACCACAACGUGUACAGGGACGAGGCCCUGAACAAUAGGUUUCAGAUCAAGGGGGUGGAGUUGAAGUCAGGAUAUAAGGACUGGAUCCUGUGGAUCAGCUUUGCCAUGAGCUGCUUCCUGCUGUGUAUUGCCCUUCUGGGGUUCAUUAUGUGGGCCUGCCAGAAGGGCAAUAUCAGAUGCAAUAUUUGCAUC