SEARCH

Sequence

GENOMES

Enrichment analyses (GO, KEGG, KOG, NR, SwissProt) were conducted,
including CDS regions and upstream 2k sequences.

BH - BnaA01T001400BH

CDS Sequence (7074 bp)
ATGTGTGATACAAGATTACGTTCCAAGTGGCGGAGTGTGATTGGCAAGATGGCATCAACCTCACTCCTCA
AGGCATCTCCTGTGTTGGACAAATCGGAGTGGGTCAAGGGACAAAGCGTTCTCTTCCGTCAGCCATCTTC
CGCCGCAGTCGTCATACGTAACCGTGCCACCTCCCTTACCGTCCGTGCCGCTTCTTCCTACGCCGATGAG
CTTGTTAAGACAGCGAAAACAATTGCGTCTCCTGGACGAGGAATCTTGGCGATGGACGAGTCGAACGCGA
CTTGCGGGAAGCGTTTGGACTCGATAGGGCTAGAGAACACTGAGGCAAACCGACAAGCATACAGGACATT
GCTGGTCTCTGCACCAGGACTCGGACAGTACATCTCCGGTGCAATCCTGTTCGAGGAGACTCTGTATCAG
TCGACCACCGAAGGCAAGAAAATGGTCGACGTCCTCGTCGAGCAGAACATCGTCCCCGGTATCAAAGTCG
ACAAGGGTTTGGTCCCACTTGTTGGCTCTAACAATGAGTCATGGTGCCAAGGGCTCGATGGTCTATCCTC
CCGAACTGCTGCTTACTATCAACAGGGTGCTCGTTTCGCCAAATGGCGTACTGTCGTGAGCATTCCCAAT
GGTCCGTCTGCTCTAGCTGTAAAAGAAGCTGCTTGGGGCCTTGCCCGATACGCTGCCATTTCACAAGACA
GTGGATTGGTTCCAATAGTGGAGCCAGAGATATTGTTGGACGGAGAACACGACATCGACAGGACTTACGA
AGTAGCAGAGAAGGTCTGGGCTGAGGTTTTCTTTTACCTTGCTCAGAACAATGTCATGTTTGAAGGTATT
CTCCTGAAGCCGAGCATGGTGACTCCCGGAGCCGAGTCTAAAGACAGAGCTACTCCTGAACAAGTTGCCT
CCTACACCCTTAAACTCCTCCGCAACAGAATCCCTCCGGCCTTCTTGTCGGGAGGACAGTCTGAGTTGGA
GGCAACGTTGAACCTGAACGCGATGAACCAAGCACCAAACCCGTGGCATGTATCCTTCTCCTACGCACGT
GCCCTGCAGAACACATGTTTGAAAACATGGGGAGGCAGAGCTGAGAACGTGAACGCAGCTCAGACCACTC
TATTGGCTAGAGCCAAGGCCAATTCGCTGGCTCAGCTTGGAAAATACACAGGAGAAGGTGAGTCTGAAGA
GGCTAAGGAGGGCTCGCTGAATATAAGAGCCCAACGTAGGGACGTCGACGTGGCAGGCAAAAAGGTCATT
AACAAGATTTGTCACGTCACTGCCCCATCAAGCTCTCGGGAGAGAAAGAGAAGCCGGCGAATCTGTGATC
TGGGGGACCAAATCAATGCCGATCGGTCTAGTGGTCCCTCTAGACGGGGAGAGAGAGAGAGGAAATGCAA
ACTGAAGAAGAAGAAGAAGAAGAAGAACCAAATCAGTAGAGCAGAGATCCTCACCCGATTCCGAGCAAAA
TCCACAAGGAAGATGGAAAAGACACACATGTCTGAGAAGATACTGGTTCTCGTGAGACTGAGGCCCCCTA
ACCAGAAAGAGATTGCCTCTAACGAACCTACGGAGGATTGGGAATGTCTCAATGATACCACCATTTTGTA
CAGAAGAAACACCTTCCGCCAATCCTCCAACTTTCCTTCTGCTTATTCTTTTGATAGAGTATACGGAGGT
GAATGTTCCACCAGACAAGTCTACGAGAATGGAACCAAGGACATCGCTCTCTCTGTTGTCAAAGGAATCA
ATTGTAGTATTTTTGCGTACGGCCAGACGAGTAGCGGAAAGACTTACACCATGTCTGCTATCACTGAGTT
TGCTCATGAAGAAAGAGCATTTTCCGTTAAATTTTCAGCTATAGAGATCTACAATGAGGCCATCCGAGAT
TTGCTCAGCTCCGATGGTACATCCCTTAGGCTACGAGACGATCCTGAGAAAGGGACAGTGGTTGAAAAAG
CCACAGAGGAAACTCTGCGAGATTGGAACCATCTCAAGGACCTTCTATCTGTTTGTGAAGCACAACGTAA
GATTGGTGAAACCTCACTGAATGAGAGAAGTTCCAGATCTCATCAGATTATCAGACTGACGGTTGAAAGC
TCTGCTCGTGAGTTCTTAGACAAAGAAAACTCCACCACCCTCATGGCGAGTGTGAATTTCATAGATCTGG
CGGGAAGCGAGCGTGCATCACAGGCGATGTCAGCTGGTGCGAGGCTCAAGGAAGGCTGCCATATCAACCG
AAGTTTGCTTACTCTTGGAACTGTGATCCGTAAACTTAGTAAGGGAAGGCAAGGGCACATCAACTTTAGA
GACTCCAAGCTCACACGAATTCTACAGCCATGCTTGGGTGGTAATTCAAGAACCGCCATCATCTGCACTC
TGAGCCCAGCGAGGAGTCACGTCGAGCTAACGAGGAACACTCTCTTGTTTGCTTGCTGTGCAAAGGAAGT
TACCACAAAGGCCCGGATCAACGTCGTTATGTCAGACAAGGCCCTGCTGCAGCAACTACAGCGTGAGCTT
GCGAGGCTCGAGACCGAGCTGAGAACCCCTGCCCCGCCTGCCTCGAAATGTGATTGTGCGATGACGGTGA
GGAAGAAGAATCTTCAGATACAAAAGATGGAAAAGGAGATGGCAGAGTTGAGAGAAGAGAGAGATCTTGC
TCAATCCCGGCTTGAAGATATCATGAGAATGGTTGAACTCGATGAGGCCTCAAAGTGTGGAACTCCGCAG
CACATAGACAAGTGGGAGGATGGTTCAGTGTCACAGACATCAATAACAAGAGCTTATGTTGGGTCTCATT
CTGAGGATGATGATGATGAAGAGCTGCCTACGCGTTCTGAAGATCCATCAGAAGAATATTGCAGAGAAGT
TCAATGCATTGAGATTGAGACATCAGCTACAGTCAACCACAAAGACGAAAAAAGAGCAGAACCTAAGAAC
ATTTTAGGCCCTAGTGTAGGCCAAAACGTGAGGTTGAGAAGCTGGAACCGCAGAGAGACCGCGAGCACTC
CACCTGAAAACATAGGGACAGAAAGACCAGAAGAAGAGAGTCACAAGAAGATAGTGTTTTCTGGTTTAGA
GTTGGGTTCTAGTGTGTCAAGGAATGATTCGCTGTCTTCTTGCGGGAGCGATTCCACTGCGACTCAGAGC
ATCAGAACGCCCTTGGGAGAAGAGGGAGGTATCACTAGCATCCGCACTTTCGTTGATGGTCTTAAGGAGA
TGGCTAAGCGTCAAGGCCAGGTGTCGATTGGTGATGATGAATCTGGTAAAATGGGGAGGGATATTGGTCT
GGTCATTATGGACATGGAATTTGAGAGGCAGCGGCGAGAGATTGTTGAGCTATGGCAAAGCTGTAACGTC
TCTTTGGTGCATAGAACATACTTCTACTTGCTCTTCAAAGGAGATGATGAGGCTGATTCAATCTACATTG
GGGTTGAGCTAAGAAGACTTTTGTTTATGAAAGCTCGCTTCTCTCAGGGAAACCAAACTTTGGAAGGAGG
AGAAACCTTAACACTGGCTTCAAGCCGGAAGGCGCTGCACGGGGAGAGAATGATGCTGAGCAAGCTGGTG
GGGAAAAGGTTTTCAGGAGAAGAGAGGAGAAGAATGTATCACAAGTTTGGGATTGGTGTCAACTCCAAAC
GCAGGCGTTTACAACTCGTGAACGAGCTTTGGAGCAATCCCAAGGACATGAGTCAGGTTGTGGAAAGCGC
AGATGTTGTAGGGAAGCTTGTGAGGTTCGCTGAGCAAGGGAGAGCCATGAAGGAGATGTTTGGTCTGGCC
TTCACCCCTCCTTCCTTCTTGACAGCTCAGAGATGGCGAGCCAACGTTGGTGTTTCTCTCAACAAAGTGA
AATCAAATGCAGGTCGGTTAGCCAGTGAAAATGGAATGTACAGGAGAAGACGAGGGCTGCGGCAGCGCCA
AGGAAAGACGATACTAACCTCCCTCCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTCCTTTGGCTCTTCT
CCCGAGATCTTTGCTCTCTATTCCCTTCGCTCCTTTCTCCTTCCATATCCTCAGAGGATGGAGAGCGGTG
GTCAGTATGAAAATGGGCGTTACAACCCCGATTACTACAAAGAAGGAACACACTCTGTCTGGAATGCGAT
GCCTAATCATCATCAGACAAAGGAGGACCAACATAATGCTCTGGTCATGAATCAGAAGATCATGTCCATC
CTTGCCGAGAGAGACGCTGCCCTCAAGGAAAGAGATGACGCCCTGGCTGCCAAGCAGGAAGCTTTAGCCG
CTCGGGACGAAGCACTTGACCTACGCGACAAAGCTCTCTCTCTAAGAGATAATGCTATTCTGGAGAGGGA
CAGTGCCTTAAGTGCTCTTCAGTTCCGTGAACACAATCTAAACTACATTTTGTCACGTGCAAAGCTCGGT
GCCTCCCAAAGCTCTCACTTACCCAACCCTTCCCCCTTGTCAACTATTCCACATGAAGCTGCTCCAAGTA
AAAGAAAAAAAAAGCGCAAACAGGAAACAAGGTCAAAGGGAAAGAGAGTAGGCGAAGATCATGTTGCTTC
TCCTGGAAAGAAATGCAGAAAAGATTGGGACAGTAACGTTGTCGGCTTAAACCTTGTCACCTTCGATGAG
ACGACAATGCCAGTGCCCATGTGCACTTGTACTGGTACTGCTCGTCACTGTTACAAATGGGGGAACGGCG
GGTGGCAATCATCATGCTGCACTACCACTTTGTCTCTGTATCCTCTTCCGCAGATGCCAAACAAGCGCCA
TTCTCGAGTGGGCGGTAGGAAAATGAGCGGAAACGTCTTCTCCAGGCTACTTAGCCGTTTAGCTGGCCAA
GGCCACGACCTCTCCTCTCCCGTTGATCTCAAGGATTATTGGGCTAGGCACGGCACCAATCGCTACATCA
CGATCATGTTCAAAGGCTTAAACCAGTTCTTCATTCTCTCCGCTCTTCTGACGGATCATCATCTACTACG
GCGTGTCCCTAATCGTTCCCCCTGTTCTCATCTTGCTCAGTCTGGACATCACCAGATGTTCCAGCAACTC
AACATCAGCCAGTTAAGACAGGAGCCACAGTCCCAGCAGAACCTGATGGTAACATGTCAGCAAAATGGAC
AAGTAGACCGGGTGTTGAGGTGTTGGCTGGATCAAAGGATTGGAGTTGCATATCAACTAGGCAAACATGT
GTTGAATTATTATATGAAATCCAGTTATATTTCAGATCATGAAATATCTAAAACCCTAAAAACATTATCA
CACTCTCTATTCATCTCTGTGTACGGCGACGAGGGAGCTATGTCCGATGTAGTAGCGGAAGGAGATGCAA
AGCCGACGAAGCAGTTCAGTGTGTATGAAGCGACATCGGAGGAGCTTATAGAACGGTCGATGGCGCCTAT
AAAAAAGGAGTTCCTATGTCCTCCTCCGAGCCGTTCCCTAAAGCAAAGTGACGTGAAGGCGCCTCATCCG
AGCTTAGTCCAGGAGAAGAAATCGAAACGACAGCTCAAAAGAGAACGCCGCGAGAAATGTGCGATAAACC
TGTGTCCGCAGGTTTCGAGAACAGAAGACGTTGATTCCTGTCAGTATAAAGAGAAATGCCGCTTTAATCA
TGACATTCAAGCTTTCAAAGCTCAGAAACCAGATGATATAGAGGGGCAGTGCCCATTTGTGGCCTCTGGG
ATCAAGTGTCAATATGGTTTATCATGCAGATTCTTTGGCACCCACAAAGATTTATCTGGAATTTCTGATG
CCGAGATTAACTTTTTCAACAAAGAGACACAGAGGCTTTTATGGAAAAATAACATGACTTTTCCCAAGGC
AGATGCCAAACTTAAATCCCTTGGCCTTATGGGCCATGCCAAAAAAAGCAATGTCGCTCAAGAGAATGAC
GCAGAAAAACCTCTAGATGGTGCTCAGACAAATGAAGATGTTGATATTCCTGGACCATTAGAAACTGAAG
ATGTTCGTCCCACGAAAAAGGCCAAATCTGACGAGACTTCTAAACTAGGACACATTATTGATGGGGTGAT
GAATGTAGACGATGAAACGGAAAAGACCGGGAATTCTACTTCAAAAGCCAAGATAGAGGATGACGAGGAT
ATCATCAAAGTTATTGAAACCGATGGGAGCCTAAAATCGCATCCTCGCGAAAAGAAAAAGCTTATTGATT
TTAGGGATAAGTTGTACCTTGCACCCCTAACAACTGTGGGCAATCTTCCCTTCAGAAGACTTTGCAAAGT
TTTGGGAGCTGATGTGACTTGTGGTGAGATGGCCATGTGTACAAATCTTTTGCAGGGCCAAGCTTCCGAA
TGGGCTCTGCTTAGAAGGCATTCATCGGAAGATCTGTTTGGTGTTCAGATTTGCGATATTGGGGATTGGG
GAGCCACTGCAGTGACGATTCATGGGCGGTCAAGACAACAACGTTATAGCAAGTCTGCGGATTGGGACTA
TATATACCAGTGTACCAAAAACGCTTCCCCTAACCTACAAGTTATAGGAAATGGAGATGTGTACTCTTTT
TTAGACTGGAACAAACACAAGTCTGACTGTCCTGAGTTGTCTAGCTGCATGATTGCTCGCGGAGCACTGA
TCAAGCCTTGGATATTTACTGAAATTAAAGAGCAACGGCACTGGGACATTACCTCCGGGGAAAGACTCAA
CATCTTCAAGGACTTTGTACGTTTTGGTCTTCAACATTGGGGATCCGATACAAAAGGAGTTGAGACAACT
AGGCATTTCTTACTGGAGTGGCTAAGCTACACGTTTAGGTACATACCTGTAGGTCTGCTGGATGTGATCC
CGCAGCAAATCAACTGGCGTCCGCCTTCTTACTTTGGTCGTGATGATCTCGAGACTCTCATGATGTCTGA
ATCTGCGGGTGACTGGGTGAGGATATCGGAATTGCTGCTTGGAAAGGTTCCGGAAGGCTTCACGTTTGCC
CCCAAACACAAATCCAACGCTTATGATCGAGCTGAAAATGGCTAA
Upstream Sequence
ATGTGTGATACAAGATTACGTTCCAAGTGGCGGAGTGTGATTGGCAAGATGGCATCAACCTCACTCCTCA
AGGCATCTCCTGTGTTGGACAAATCGGAGTGGGTCAAGGGACAAAGCGTTCTCTTCCGTCAGCCATCTTC
CGCCGCAGTCGTCATACGTAACCGTGCCACCTCCCTTACCGTCCGTGCCGCTTCTTCCTACGCCGATGAG
CTTGTTAAGACAGCGAAAACAATTGCGTCTCCTGGACGAGGAATCTTGGCGATGGACGAGTCGAACGCGA
CTTGCGGGAAGCGTTTGGACTCGATAGGGCTAGAGAACACTGAGGCAAACCGACAAGCATACAGGACATT
GCTGGTCTCTGCACCAGGACTCGGACAGTACATCTCCGGTGCAATCCTGTTCGAGGAGACTCTGTATCAG
TCGACCACCGAAGGCAAGAAAATGGTCGACGTCCTCGTCGAGCAGAACATCGTCCCCGGTATCAAAGTCG
ACAAGGGTTTGGTCCCACTTGTTGGCTCTAACAATGAGTCATGGTGCCAAGGGCTCGATGGTCTATCCTC
CCGAACTGCTGCTTACTATCAACAGGGTGCTCGTTTCGCCAAATGGCGTACTGTCGTGAGCATTCCCAAT
GGTCCGTCTGCTCTAGCTGTAAAAGAAGCTGCTTGGGGCCTTGCCCGATACGCTGCCATTTCACAAGACA
GTGGATTGGTTCCAATAGTGGAGCCAGAGATATTGTTGGACGGAGAACACGACATCGACAGGACTTACGA
AGTAGCAGAGAAGGTCTGGGCTGAGGTTTTCTTTTACCTTGCTCAGAACAATGTCATGTTTGAAGGTATT
CTCCTGAAGCCGAGCATGGTGACTCCCGGAGCCGAGTCTAAAGACAGAGCTACTCCTGAACAAGTTGCCT
CCTACACCCTTAAACTCCTCCGCAACAGAATCCCTCCGGCCTTCTTGTCGGGAGGACAGTCTGAGTTGGA
GGCAACGTTGAACCTGAACGCGATGAACCAAGCACCAAACCCGTGGCATGTATCCTTCTCCTACGCACGT
GCCCTGCAGAACACATGTTTGAAAACATGGGGAGGCAGAGCTGAGAACGTGAACGCAGCTCAGACCACTC
TATTGGCTAGAGCCAAGGCCAATTCGCTGGCTCAGCTTGGAAAATACACAGGAGAAGGTGAGTCTGAAGA
GGCTAAGGAGGGCTCGCTGAATATAAGAGCCCAACGTAGGGACGTCGACGTGGCAGGCAAAAAGGTCATT
AACAAGATTTGTCACGTCACTGCCCCATCAAGCTCTCGGGAGAGAAAGAGAAGCCGGCGAATCTGTGATC
TGGGGGACCAAATCAATGCCGATCGGTCTAGTGGTCCCTCTAGACGGGGAGAGAGAGAGAGGAAATGCAA
ACTGAAGAAGAAGAAGAAGAAGAAGAACCAAATCAGTAGAGCAGAGATCCTCACCCGATTCCGAGCAAAA
TCCACAAGGAAGATGGAAAAGACACACATGTCTGAGAAGATACTGGTTCTCGTGAGACTGAGGCCCCCTA
ACCAGAAAGAGATTGCCTCTAACGAACCTACGGAGGATTGGGAATGTCTCAATGATACCACCATTTTGTA
CAGAAGAAACACCTTCCGCCAATCCTCCAACTTTCCTTCTGCTTATTCTTTTGATAGAGTATACGGAGGT
GAATGTTCCACCAGACAAGTCTACGAGAATGGAACCAAGGACATCGCTCTCTCTGTTGTCAAAGGAATCA
ATTGTAGTATTTTTGCGTACGGCCAGACGAGTAGCGGAAAGACTTACACCATGTCTGCTATCACTGAGTT
TGCTCATGAAGAAAGAGCATTTTCCGTTAAATTTTCAGCTATAGAGATCTACAATGAGGCCATCCGAGAT
TTGCTCAGCTCCGATGGTACATCCCTTAGGCTACGAGACGATCCTGAGAAAGGGACAGTGGTTGAAAAAG
CCACAGAGGAAACTCTGCGAGATTGGAACCATCTCAAGGACCTTCTATCTGTTTGTGAAGCACAACGTAA
GATTGGTGAAACCTCACTGAATGAGAGAAGTTCCAGATCTCATCAGATTATCAGACTGACGGTTGAAAGC
TCTGCTCGTGAGTTCTTAGACAAAGAAAACTCCACCACCCTCATGGCGAGTGTGAATTTCATAGATCTGG
CGGGAAGCGAGCGTGCATCACAGGCGATGTCAGCTGGTGCGAGGCTCAAGGAAGGCTGCCATATCAACCG
AAGTTTGCTTACTCTTGGAACTGTGATCCGTAAACTTAGTAAGGGAAGGCAAGGGCACATCAACTTTAGA
GACTCCAAGCTCACACGAATTCTACAGCCATGCTTGGGTGGTAATTCAAGAACCGCCATCATCTGCACTC
TGAGCCCAGCGAGGAGTCACGTCGAGCTAACGAGGAACACTCTCTTGTTTGCTTGCTGTGCAAAGGAAGT
TACCACAAAGGCCCGGATCAACGTCGTTATGTCAGACAAGGCCCTGCTGCAGCAACTACAGCGTGAGCTT
GCGAGGCTCGAGACCGAGCTGAGAACCCCTGCCCCGCCTGCCTCGAAATGTGATTGTGCGATGACGGTGA
GGAAGAAGAATCTTCAGATACAAAAGATGGAAAAGGAGATGGCAGAGTTGAGAGAAGAGAGAGATCTTGC
TCAATCCCGGCTTGAAGATATCATGAGAATGGTTGAACTCGATGAGGCCTCAAAGTGTGGAACTCCGCAG
CACATAGACAAGTGGGAGGATGGTTCAGTGTCACAGACATCAATAACAAGAGCTTATGTTGGGTCTCATT
CTGAGGATGATGATGATGAAGAGCTGCCTACGCGTTCTGAAGATCCATCAGAAGAATATTGCAGAGAAGT
TCAATGCATTGAGATTGAGACATCAGCTACAGTCAACCACAAAGACGAAAAAAGAGCAGAACCTAAGAAC
ATTTTAGGCCCTAGTGTAGGCCAAAACGTGAGGTTGAGAAGCTGGAACCGCAGAGAGACCGCGAGCACTC
CACCTGAAAACATAGGGACAGAAAGACCAGAAGAAGAGAGTCACAAGAAGATAGTGTTTTCTGGTTTAGA
GTTGGGTTCTAGTGTGTCAAGGAATGATTCGCTGTCTTCTTGCGGGAGCGATTCCACTGCGACTCAGAGC
ATCAGAACGCCCTTGGGAGAAGAGGGAGGTATCACTAGCATCCGCACTTTCGTTGATGGTCTTAAGGAGA
TGGCTAAGCGTCAAGGCCAGGTGTCGATTGGTGATGATGAATCTGGTAAAATGGGGAGGGATATTGGTCT
GGTCATTATGGACATGGAATTTGAGAGGCAGCGGCGAGAGATTGTTGAGCTATGGCAAAGCTGTAACGTC
TCTTTGGTGCATAGAACATACTTCTACTTGCTCTTCAAAGGAGATGATGAGGCTGATTCAATCTACATTG
GGGTTGAGCTAAGAAGACTTTTGTTTATGAAAGCTCGCTTCTCTCAGGGAAACCAAACTTTGGAAGGAGG
AGAAACCTTAACACTGGCTTCAAGCCGGAAGGCGCTGCACGGGGAGAGAATGATGCTGAGCAAGCTGGTG
GGGAAAAGGTTTTCAGGAGAAGAGAGGAGAAGAATGTATCACAAGTTTGGGATTGGTGTCAACTCCAAAC
GCAGGCGTTTACAACTCGTGAACGAGCTTTGGAGCAATCCCAAGGACATGAGTCAGGTTGTGGAAAGCGC
AGATGTTGTAGGGAAGCTTGTGAGGTTCGCTGAGCAAGGGAGAGCCATGAAGGAGATGTTTGGTCTGGCC
TTCACCCCTCCTTCCTTCTTGACAGCTCAGAGATGGCGAGCCAACGTTGGTGTTTCTCTCAACAAAGTGA
AATCAAATGCAGGTCGGTTAGCCAGTGAAAATGGAATGTACAGGAGAAGACGAGGGCTGCGGCAGCGCCA
AGGAAAGACGATACTAACCTCCCTCCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTCCTTTGGCTCTTCT
CCCGAGATCTTTGCTCTCTATTCCCTTCGCTCCTTTCTCCTTCCATATCCTCAGAGGATGGAGAGCGGTG
GTCAGTATGAAAATGGGCGTTACAACCCCGATTACTACAAAGAAGGAACACACTCTGTCTGGAATGCGAT
GCCTAATCATCATCAGACAAAGGAGGACCAACATAATGCTCTGGTCATGAATCAGAAGATCATGTCCATC
CTTGCCGAGAGAGACGCTGCCCTCAAGGAAAGAGATGACGCCCTGGCTGCCAAGCAGGAAGCTTTAGCCG
CTCGGGACGAAGCACTTGACCTACGCGACAAAGCTCTCTCTCTAAGAGATAATGCTATTCTGGAGAGGGA
CAGTGCCTTAAGTGCTCTTCAGTTCCGTGAACACAATCTAAACTACATTTTGTCACGTGCAAAGCTCGGT
GCCTCCCAAAGCTCTCACTTACCCAACCCTTCCCCCTTGTCAACTATTCCACATGAAGCTGCTCCAAGTA
AAAGAAAAAAAAAGCGCAAACAGGAAACAAGGTCAAAGGGAAAGAGAGTAGGCGAAGATCATGTTGCTTC
TCCTGGAAAGAAATGCAGAAAAGATTGGGACAGTAACGTTGTCGGCTTAAACCTTGTCACCTTCGATGAG
ACGACAATGCCAGTGCCCATGTGCACTTGTACTGGTACTGCTCGTCACTGTTACAAATGGGGGAACGGCG
GGTGGCAATCATCATGCTGCACTACCACTTTGTCTCTGTATCCTCTTCCGCAGATGCCAAACAAGCGCCA
TTCTCGAGTGGGCGGTAGGAAAATGAGCGGAAACGTCTTCTCCAGGCTACTTAGCCGTTTAGCTGGCCAA
GGCCACGACCTCTCCTCTCCCGTTGATCTCAAGGATTATTGGGCTAGGCACGGCACCAATCGCTACATCA
CGATCATGTTCAAAGGCTTAAACCAGTTCTTCATTCTCTCCGCTCTTCTGACGGATCATCATCTACTACG
GCGTGTCCCTAATCGTTCCCCCTGTTCTCATCTTGCTCAGTCTGGACATCACCAGATGTTCCAGCAACTC
AACATCAGCCAGTTAAGACAGGAGCCACAGTCCCAGCAGAACCTGATGGTAACATGTCAGCAAAATGGAC
AAGTAGACCGGGTGTTGAGGTGTTGGCTGGATCAAAGGATTGGAGTTGCATATCAACTAGGCAAACATGT
GTTGAATTATTATATGAAATCCAGTTATATTTCAGATCATGAAATATCTAAAACCCTAAAAACATTATCA
CACTCTCTATTCATCTCTGTGTACGGCGACGAGGGAGCTATGTCCGATGTAGTAGCGGAAGGAGATGCAA
AGCCGACGAAGCAGTTCAGTGTGTATGAAGCGACATCGGAGGAGCTTATAGAACGGTCGATGGCGCCTAT
AAAAAAGGAGTTCCTATGTCCTCCTCCGAGCCGTTCCCTAAAGCAAAGTGACGTGAAGGCGCCTCATCCG
AGCTTAGTCCAGGAGAAGAAATCGAAACGACAGCTCAAAAGAGAACGCCGCGAGAAATGTGCGATAAACC
TGTGTCCGCAGGTTTCGAGAACAGAAGACGTTGATTCCTGTCAGTATAAAGAGAAATGCCGCTTTAATCA
TGACATTCAAGCTTTCAAAGCTCAGAAACCAGATGATATAGAGGGGCAGTGCCCATTTGTGGCCTCTGGG
ATCAAGTGTCAATATGGTTTATCATGCAGATTCTTTGGCACCCACAAAGATTTATCTGGAATTTCTGATG
CCGAGATTAACTTTTTCAACAAAGAGACACAGAGGCTTTTATGGAAAAATAACATGACTTTTCCCAAGGC
AGATGCCAAACTTAAATCCCTTGGCCTTATGGGCCATGCCAAAAAAAGCAATGTCGCTCAAGAGAATGAC
GCAGAAAAACCTCTAGATGGTGCTCAGACAAATGAAGATGTTGATATTCCTGGACCATTAGAAACTGAAG
ATGTTCGTCCCACGAAAAAGGCCAAATCTGACGAGACTTCTAAACTAGGACACATTATTGATGGGGTGAT
GAATGTAGACGATGAAACGGAAAAGACCGGGAATTCTACTTCAAAAGCCAAGATAGAGGATGACGAGGAT
ATCATCAAAGTTATTGAAACCGATGGGAGCCTAAAATCGCATCCTCGCGAAAAGAAAAAGCTTATTGATT
TTAGGGATAAGTTGTACCTTGCACCCCTAACAACTGTGGGCAATCTTCCCTTCAGAAGACTTTGCAAAGT
TTTGGGAGCTGATGTGACTTGTGGTGAGATGGCCATGTGTACAAATCTTTTGCAGGGCCAAGCTTCCGAA
TGGGCTCTGCTTAGAAGGCATTCATCGGAAGATCTGTTTGGTGTTCAGATTTGCGATATTGGGGATTGGG
GAGCCACTGCAGTGACGATTCATGGGCGGTCAAGACAACAACGTTATAGCAAGTCTGCGGATTGGGACTA
TATATACCAGTGTACCAAAAACGCTTCCCCTAACCTACAAGTTATAGGAAATGGAGATGTGTACTCTTTT
TTAGACTGGAACAAACACAAGTCTGACTGTCCTGAGTTGTCTAGCTGCATGATTGCTCGCGGAGCACTGA
TCAAGCCTTGGATATTTACTGAAATTAAAGAGCAACGGCACTGGGACATTACCTCCGGGGAAAGACTCAA
CATCTTCAAGGACTTTGTACGTTTTGGTCTTCAACATTGGGGATCCGATACAAAAGGAGTTGAGACAACT
AGGCATTTCTTACTGGAGTGGCTAAGCTACACGTTTAGGTACATACCTGTAGGTCTGCTGGATGTGATCC
CGCAGCAAATCAACTGGCGTCCGCCTTCTTACTTTGGTCGTGATGATCTCGAGACTCTCATGATGTCTGA
ATCTGCGGGTGACTGGGTGAGGATATCGGAATTGCTGCTTGGAAAGGTTCCGGAAGGCTTCACGTTTGCC
CCCAAACACAAATCCAACGCTTATGATCGAGCTGAAAATGGCTAATTACAAGTCACATCTTATGTAAGCT
CAATCTTCATCCATAAACATGTCTGAAACTGTGATGAGCAAATTTCTATTCAATAGTTTTTTTGAGTGTA
GACAGCATTCGTGAA
Downstream Sequence
GGTATTTTTCTTGAATTGTGAGTGATACAAGATTACGTTCCAAGTGGCGGAGTGTGATTG
GCAAGGTCACAAGGACCTCAAATAACTGATCTGATTAGAAATATAAATCCAAAAGAGAGA
GAGTGAGGTTGCTCGTCCTCCTCAGAGATAAGGTGGTGGTGGTGTGTGTGTGTGTCATCA
TTCATAAGCCACTGTGTGTCTCTCGAACAAACCAAATTAAGGCAGAGAGAAGAGAAAGAG
ATAACATTTAACAAAAGATGGCATCAACCTCACTCCTCAAGGCATCTCCTGTGTTGGACA
AATCGGAGTGGGTCAAGGGACAAAGCGTTCTCTTCCGTCAGCCATCTTCCGCCGCAGTCG
TCATACGTAACCGTGCCACCTCCCTTACCGTCCGTGCCGCTTCTTCCTACGCCGATGAGC
TTGTTAAGACAGCGGTTAGTACTCCCACCCACACATATCCTCTTCTTTGTCACGTGTCTT
ATCTGCATGATACATAAAGGTTTTTTACTTCTTTTATTCTTTTGATCTTTTGAATTGGAA
TGCCTAACAAAATTAATTATGTTCTGTTTTCAGTTCTTTTTTTAATAATTTGTAGTCTTA
ACTTTTCTTTCTTTTCACAACCATTTTTTCACTAGTTGGGACTTTGGGTTGATTTTAGTT
CATGTTTGTTAATGTAAGAGCTCATATTCATAGATAAAAGTTAGTATAAAATAACTCGAA
TTACGATAATAATGATGTTTTATGGTGTTGGGAGCAGAAAACAATTGCGTCTCCTGGACG
AGGAATCTTGGCGATGGACGAGTCGAACGCGACTTGCGGGAAGCGTTTGGACTCGATAGG
GCTAGAGAACACTGAGGCAAACCGACAAGCATACAGGACATTGCTGGTCTCTGCACCAGG
ACTCGGACAGTACATCTCCGGTGCAATCCTGTTCGAGGAGACTCTGTATCAGTCGACCAC
CGAAGGCAAGAAAATGGTCGACGTCCTCGTCGAGCAGAACATCGTCCCCGGTATCAAAGT
CGACAAGGTATGACTAATTAAAGTGCATGCTTATTTTTTATATATGTACCTAACGTGGTG
CTTGATTAATTAGAGGAATCTAAGCGAGTTGTATCTATTTATGGAAGTAGGGTTTGGTCC
CACTTGTTGGCTCTAACAATGAGTCATGGTGCCAAGGGCTCGATGGTCTATCCTCCCGAA
CTGCTGCTTACTATCAACAGGGTGCTCGTTTCGCCAAATGGTATAGTACCCTCTGTTCCT
ATCCAAAATGTTTCCTCTCGGCTGAGTTTTTTCTTTGACTTTTGTACTCTTGATGGTTTA
ATTAGGCGTACTGTCGTGAGCATTCCCAATGGTCCGTCTGCTCTAGCTGTAAAAGAAGCT
GCTTGGGGCCTTGCCCGATACGCTGCCATTTCACAAGTAAACACACATTGCTTCTCTTTC
AAGCAACAGGTCAACTCGGATTCTTTTAAAGCTATTATTTCACACGCAGATTGTGTTTTG
TGTTTTGTCTTTTCAGGACAGTGGATTGGTTCCAATAGTGGAGCCAGAGATATTGTTGGA
CGGAGAACACGACATCGACAGGACTTACGAAGTAGCAGAGAAGGTCTGGGCTGAGGTTTT
CTTTTACCTTGCTCAGAACAATGTCATGTTTGAAGGTATTACTAGCTAAGTATATGGAGC
TGCTTATCTTATATTATCAAGTTGTGTTAAAATATCAACATCTTGTACAAAGGTATTCTC
CTGAAGCCGAGCATGGTGACTCCCGGAGCCGAGTCTAAAGACAGAGCTACTCCTGAACAA
GTTGCCTCCTACACCCTTAAACTCCTCCGCAACAGAATCCCTCCGGCCGTCCCCGGAATC
ATGGCAAGCTCTATTCCTCTCTCTTCACTTCACATAGTATAAAAATATCTATAAAAATCA
AAATATTGCTTGATTTGATGGAAATTGCTTATGATGAGAACCCTAGATCTTGGTTGCTTG
AGATGGAAGTAAAGTTTTTT
mRNA Sequence
ATGTGTGATACAAGATTACGTTCCAAGTGGCGGAGTGTGATTGGCAAGATGGCATCAACCTCACTCCTCA
AGGCATCTCCTGTGTTGGACAAATCGGAGTGGGTCAAGGGACAAAGCGTTCTCTTCCGTCAGCCATCTTC
CGCCGCAGTCGTCATACGTAACCGTGCCACCTCCCTTACCGTCCGTGCCGCTTCTTCCTACGCCGATGAG
CTTGTTAAGACAGCGAAAACAATTGCGTCTCCTGGACGAGGAATCTTGGCGATGGACGAGTCGAACGCGA
CTTGCGGGAAGCGTTTGGACTCGATAGGGCTAGAGAACACTGAGGCAAACCGACAAGCATACAGGACATT
GCTGGTCTCTGCACCAGGACTCGGACAGTACATCTCCGGTGCAATCCTGTTCGAGGAGACTCTGTATCAG
TCGACCACCGAAGGCAAGAAAATGGTCGACGTCCTCGTCGAGCAGAACATCGTCCCCGGTATCAAAGTCG
ACAAGGGTTTGGTCCCACTTGTTGGCTCTAACAATGAGTCATGGTGCCAAGGGCTCGATGGTCTATCCTC
CCGAACTGCTGCTTACTATCAACAGGGTGCTCGTTTCGCCAAATGGCGTACTGTCGTGAGCATTCCCAAT
GGTCCGTCTGCTCTAGCTGTAAAAGAAGCTGCTTGGGGCCTTGCCCGATACGCTGCCATTTCACAAGACA
GTGGATTGGTTCCAATAGTGGAGCCAGAGATATTGTTGGACGGAGAACACGACATCGACAGGACTTACGA
AGTAGCAGAGAAGGTCTGGGCTGAGGTTTTCTTTTACCTTGCTCAGAACAATGTCATGTTTGAAGGTATT
CTCCTGAAGCCGAGCATGGTGACTCCCGGAGCCGAGTCTAAAGACAGAGCTACTCCTGAACAAGTTGCCT
CCTACACCCTTAAACTCCTCCGCAACAGAATCCCTCCGGCCTTCTTGTCGGGAGGACAGTCTGAGTTGGA
GGCAACGTTGAACCTGAACGCGATGAACCAAGCACCAAACCCGTGGCATGTATCCTTCTCCTACGCACGT
GCCCTGCAGAACACATGTTTGAAAACATGGGGAGGCAGAGCTGAGAACGTGAACGCAGCTCAGACCACTC
TATTGGCTAGAGCCAAGGCCAATTCGCTGGCTCAGCTTGGAAAATACACAGGAGAAGGTGAGTCTGAAGA
GGCTAAGGAGGGCTCGCTGAATATAAGAGCCCAACGTAGGGACGTCGACGTGGCAGGCAAAAAGGTCATT
AACAAGATTTGTCACGTCACTGCCCCATCAAGCTCTCGGGAGAGAAAGAGAAGCCGGCGAATCTGTGATC
TGGGGGACCAAATCAATGCCGATCGGTCTAGTGGTCCCTCTAGACGGGGAGAGAGAGAGAGGAAATGCAA
ACTGAAGAAGAAGAAGAAGAAGAAGAACCAAATCAGTAGAGCAGAGATCCTCACCCGATTCCGAGCAAAA
TCCACAAGGAAGATGGAAAAGACACACATGTCTGAGAAGATACTGGTTCTCGTGAGACTGAGGCCCCCTA
ACCAGAAAGAGATTGCCTCTAACGAACCTACGGAGGATTGGGAATGTCTCAATGATACCACCATTTTGTA
CAGAAGAAACACCTTCCGCCAATCCTCCAACTTTCCTTCTGCTTATTCTTTTGATAGAGTATACGGAGGT
GAATGTTCCACCAGACAAGTCTACGAGAATGGAACCAAGGACATCGCTCTCTCTGTTGTCAAAGGAATCA
ATTGTAGTATTTTTGCGTACGGCCAGACGAGTAGCGGAAAGACTTACACCATGTCTGCTATCACTGAGTT
TGCTCATGAAGAAAGAGCATTTTCCGTTAAATTTTCAGCTATAGAGATCTACAATGAGGCCATCCGAGAT
TTGCTCAGCTCCGATGGTACATCCCTTAGGCTACGAGACGATCCTGAGAAAGGGACAGTGGTTGAAAAAG
CCACAGAGGAAACTCTGCGAGATTGGAACCATCTCAAGGACCTTCTATCTGTTTGTGAAGCACAACGTAA
GATTGGTGAAACCTCACTGAATGAGAGAAGTTCCAGATCTCATCAGATTATCAGACTGACGGTTGAAAGC
TCTGCTCGTGAGTTCTTAGACAAAGAAAACTCCACCACCCTCATGGCGAGTGTGAATTTCATAGATCTGG
CGGGAAGCGAGCGTGCATCACAGGCGATGTCAGCTGGTGCGAGGCTCAAGGAAGGCTGCCATATCAACCG
AAGTTTGCTTACTCTTGGAACTGTGATCCGTAAACTTAGTAAGGGAAGGCAAGGGCACATCAACTTTAGA
GACTCCAAGCTCACACGAATTCTACAGCCATGCTTGGGTGGTAATTCAAGAACCGCCATCATCTGCACTC
TGAGCCCAGCGAGGAGTCACGTCGAGCTAACGAGGAACACTCTCTTGTTTGCTTGCTGTGCAAAGGAAGT
TACCACAAAGGCCCGGATCAACGTCGTTATGTCAGACAAGGCCCTGCTGCAGCAACTACAGCGTGAGCTT
GCGAGGCTCGAGACCGAGCTGAGAACCCCTGCCCCGCCTGCCTCGAAATGTGATTGTGCGATGACGGTGA
GGAAGAAGAATCTTCAGATACAAAAGATGGAAAAGGAGATGGCAGAGTTGAGAGAAGAGAGAGATCTTGC
TCAATCCCGGCTTGAAGATATCATGAGAATGGTTGAACTCGATGAGGCCTCAAAGTGTGGAACTCCGCAG
CACATAGACAAGTGGGAGGATGGTTCAGTGTCACAGACATCAATAACAAGAGCTTATGTTGGGTCTCATT
CTGAGGATGATGATGATGAAGAGCTGCCTACGCGTTCTGAAGATCCATCAGAAGAATATTGCAGAGAAGT
TCAATGCATTGAGATTGAGACATCAGCTACAGTCAACCACAAAGACGAAAAAAGAGCAGAACCTAAGAAC
ATTTTAGGCCCTAGTGTAGGCCAAAACGTGAGGTTGAGAAGCTGGAACCGCAGAGAGACCGCGAGCACTC
CACCTGAAAACATAGGGACAGAAAGACCAGAAGAAGAGAGTCACAAGAAGATAGTGTTTTCTGGTTTAGA
GTTGGGTTCTAGTGTGTCAAGGAATGATTCGCTGTCTTCTTGCGGGAGCGATTCCACTGCGACTCAGAGC
ATCAGAACGCCCTTGGGAGAAGAGGGAGGTATCACTAGCATCCGCACTTTCGTTGATGGTCTTAAGGAGA
TGGCTAAGCGTCAAGGCCAGGTGTCGATTGGTGATGATGAATCTGGTAAAATGGGGAGGGATATTGGTCT
GGTCATTATGGACATGGAATTTGAGAGGCAGCGGCGAGAGATTGTTGAGCTATGGCAAAGCTGTAACGTC
TCTTTGGTGCATAGAACATACTTCTACTTGCTCTTCAAAGGAGATGATGAGGCTGATTCAATCTACATTG
GGGTTGAGCTAAGAAGACTTTTGTTTATGAAAGCTCGCTTCTCTCAGGGAAACCAAACTTTGGAAGGAGG
AGAAACCTTAACACTGGCTTCAAGCCGGAAGGCGCTGCACGGGGAGAGAATGATGCTGAGCAAGCTGGTG
GGGAAAAGGTTTTCAGGAGAAGAGAGGAGAAGAATGTATCACAAGTTTGGGATTGGTGTCAACTCCAAAC
GCAGGCGTTTACAACTCGTGAACGAGCTTTGGAGCAATCCCAAGGACATGAGTCAGGTTGTGGAAAGCGC
AGATGTTGTAGGGAAGCTTGTGAGGTTCGCTGAGCAAGGGAGAGCCATGAAGGAGATGTTTGGTCTGGCC
TTCACCCCTCCTTCCTTCTTGACAGCTCAGAGATGGCGAGCCAACGTTGGTGTTTCTCTCAACAAAGTGA
AATCAAATGCAGGTCGGTTAGCCAGTGAAAATGGAATGTACAGGAGAAGACGAGGGCTGCGGCAGCGCCA
AGGAAAGACGATACTAACCTCCCTCCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTCCTTTGGCTCTTCT
CCCGAGATCTTTGCTCTCTATTCCCTTCGCTCCTTTCTCCTTCCATATCCTCAGAGGATGGAGAGCGGTG
GTCAGTATGAAAATGGGCGTTACAACCCCGATTACTACAAAGAAGGAACACACTCTGTCTGGAATGCGAT
GCCTAATCATCATCAGACAAAGGAGGACCAACATAATGCTCTGGTCATGAATCAGAAGATCATGTCCATC
CTTGCCGAGAGAGACGCTGCCCTCAAGGAAAGAGATGACGCCCTGGCTGCCAAGCAGGAAGCTTTAGCCG
CTCGGGACGAAGCACTTGACCTACGCGACAAAGCTCTCTCTCTAAGAGATAATGCTATTCTGGAGAGGGA
CAGTGCCTTAAGTGCTCTTCAGTTCCGTGAACACAATCTAAACTACATTTTGTCACGTGCAAAGCTCGGT
GCCTCCCAAAGCTCTCACTTACCCAACCCTTCCCCCTTGTCAACTATTCCACATGAAGCTGCTCCAAGTA
AAAGAAAAAAAAAGCGCAAACAGGAAACAAGGTCAAAGGGAAAGAGAGTAGGCGAAGATCATGTTGCTTC
TCCTGGAAAGAAATGCAGAAAAGATTGGGACAGTAACGTTGTCGGCTTAAACCTTGTCACCTTCGATGAG
ACGACAATGCCAGTGCCCATGTGCACTTGTACTGGTACTGCTCGTCACTGTTACAAATGGGGGAACGGCG
GGTGGCAATCATCATGCTGCACTACCACTTTGTCTCTGTATCCTCTTCCGCAGATGCCAAACAAGCGCCA
TTCTCGAGTGGGCGGTAGGAAAATGAGCGGAAACGTCTTCTCCAGGCTACTTAGCCGTTTAGCTGGCCAA
GGCCACGACCTCTCCTCTCCCGTTGATCTCAAGGATTATTGGGCTAGGCACGGCACCAATCGCTACATCA
CGATCATGTTCAAAGGCTTAAACCAGTTCTTCATTCTCTCCGCTCTTCTGACGGATCATCATCTACTACG
GCGTGTCCCTAATCGTTCCCCCTGTTCTCATCTTGCTCAGTCTGGACATCACCAGATGTTCCAGCAACTC
AACATCAGCCAGTTAAGACAGGAGCCACAGTCCCAGCAGAACCTGATGGTAACATGTCAGCAAAATGGAC
AAGTAGACCGGGTGTTGAGGTGTTGGCTGGATCAAAGGATTGGAGTTGCATATCAACTAGGCAAACATGT
GTTGAATTATTATATGAAATCCAGTTATATTTCAGATCATGAAATATCTAAAACCCTAAAAACATTATCA
CACTCTCTATTCATCTCTGTGTACGGCGACGAGGGAGCTATGTCCGATGTAGTAGCGGAAGGAGATGCAA
AGCCGACGAAGCAGTTCAGTGTGTATGAAGCGACATCGGAGGAGCTTATAGAACGGTCGATGGCGCCTAT
AAAAAAGGAGTTCCTATGTCCTCCTCCGAGCCGTTCCCTAAAGCAAAGTGACGTGAAGGCGCCTCATCCG
AGCTTAGTCCAGGAGAAGAAATCGAAACGACAGCTCAAAAGAGAACGCCGCGAGAAATGTGCGATAAACC
TGTGTCCGCAGGTTTCGAGAACAGAAGACGTTGATTCCTGTCAGTATAAAGAGAAATGCCGCTTTAATCA
TGACATTCAAGCTTTCAAAGCTCAGAAACCAGATGATATAGAGGGGCAGTGCCCATTTGTGGCCTCTGGG
ATCAAGTGTCAATATGGTTTATCATGCAGATTCTTTGGCACCCACAAAGATTTATCTGGAATTTCTGATG
CCGAGATTAACTTTTTCAACAAAGAGACACAGAGGCTTTTATGGAAAAATAACATGACTTTTCCCAAGGC
AGATGCCAAACTTAAATCCCTTGGCCTTATGGGCCATGCCAAAAAAAGCAATGTCGCTCAAGAGAATGAC
GCAGAAAAACCTCTAGATGGTGCTCAGACAAATGAAGATGTTGATATTCCTGGACCATTAGAAACTGAAG
ATGTTCGTCCCACGAAAAAGGCCAAATCTGACGAGACTTCTAAACTAGGACACATTATTGATGGGGTGAT
GAATGTAGACGATGAAACGGAAAAGACCGGGAATTCTACTTCAAAAGCCAAGATAGAGGATGACGAGGAT
ATCATCAAAGTTATTGAAACCGATGGGAGCCTAAAATCGCATCCTCGCGAAAAGAAAAAGCTTATTGATT
TTAGGGATAAGTTGTACCTTGCACCCCTAACAACTGTGGGCAATCTTCCCTTCAGAAGACTTTGCAAAGT
TTTGGGAGCTGATGTGACTTGTGGTGAGATGGCCATGTGTACAAATCTTTTGCAGGGCCAAGCTTCCGAA
TGGGCTCTGCTTAGAAGGCATTCATCGGAAGATCTGTTTGGTGTTCAGATTTGCGATATTGGGGATTGGG
GAGCCACTGCAGTGACGATTCATGGGCGGTCAAGACAACAACGTTATAGCAAGTCTGCGGATTGGGACTA
TATATACCAGTGTACCAAAAACGCTTCCCCTAACCTACAAGTTATAGGAAATGGAGATGTGTACTCTTTT
TTAGACTGGAACAAACACAAGTCTGACTGTCCTGAGTTGTCTAGCTGCATGATTGCTCGCGGAGCACTGA
TCAAGCCTTGGATATTTACTGAAATTAAAGAGCAACGGCACTGGGACATTACCTCCGGGGAAAGACTCAA
CATCTTCAAGGACTTTGTACGTTTTGGTCTTCAACATTGGGGATCCGATACAAAAGGAGTTGAGACAACT
AGGCATTTCTTACTGGAGTGGCTAAGCTACACGTTTAGGTACATACCTGTAGGTCTGCTGGATGTGATCC
CGCAGCAAATCAACTGGCGTCCGCCTTCTTACTTTGGTCGTGATGATCTCGAGACTCTCATGATGTCTGA
ATCTGCGGGTGACTGGGTGAGGATATCGGAATTGCTGCTTGGAAAGGTTCCGGAAGGCTTCACGTTTGCC
CCCAAACACAAATCCAACGCTTATGATCGAGCTGAAAATGGCTAATTACAAGTCACATCTTATGTAAGCT
CAATCTTCATCCATAAACATGTCTGAAACTGTGATGAGCAAATTTCTATTCAATAGTTTTTTTGAGTGTA
GACAGCATTCGTGAA
Pro Sequence
MCDTRLRSKWRSVIGKMASTSLLKASPVLDKSEWVKGQSVLFRQPSSAAVVIRNRATSLTVRAASSYADE
LVKTAKTIASPGRGILAMDESNATCGKRLDSIGLENTEANRQAYRTLLVSAPGLGQYISGAILFEETLYQ
STTEGKKMVDVLVEQNIVPGIKVDKGLVPLVGSNNESWCQGLDGLSSRTAAYYQQGARFAKWRTVVSIPN
GPSALAVKEAAWGLARYAAISQDSGLVPIVEPEILLDGEHDIDRTYEVAEKVWAEVFFYLAQNNVMFEGI
LLKPSMVTPGAESKDRATPEQVASYTLKLLRNRIPPAFLSGGQSELEATLNLNAMNQAPNPWHVSFSYAR
ALQNTCLKTWGGRAENVNAAQTTLLARAKANSLAQLGKYTGEGESEEAKEGSLNIRAQRRDVDVAGKKVI
NKICHVTAPSSSRERKRSRRICDLGDQINADRSSGPSRRGERERKCKLKKKKKKKNQISRAEILTRFRAK
STRKMEKTHMSEKILVLVRLRPPNQKEIASNEPTEDWECLNDTTILYRRNTFRQSSNFPSAYSFDRVYGG
ECSTRQVYENGTKDIALSVVKGINCSIFAYGQTSSGKTYTMSAITEFAHEERAFSVKFSAIEIYNEAIRD
LLSSDGTSLRLRDDPEKGTVVEKATEETLRDWNHLKDLLSVCEAQRKIGETSLNERSSRSHQIIRLTVES
SAREFLDKENSTTLMASVNFIDLAGSERASQAMSAGARLKEGCHINRSLLTLGTVIRKLSKGRQGHINFR
DSKLTRILQPCLGGNSRTAIICTLSPARSHVELTRNTLLFACCAKEVTTKARINVVMSDKALLQQLQREL
ARLETELRTPAPPASKCDCAMTVRKKNLQIQKMEKEMAELREERDLAQSRLEDIMRMVELDEASKCGTPQ
HIDKWEDGSVSQTSITRAYVGSHSEDDDDEELPTRSEDPSEEYCREVQCIEIETSATVNHKDEKRAEPKN
ILGPSVGQNVRLRSWNRRETASTPPENIGTERPEEESHKKIVFSGLELGSSVSRNDSLSSCGSDSTATQS
IRTPLGEEGGITSIRTFVDGLKEMAKRQGQVSIGDDESGKMGRDIGLVIMDMEFERQRREIVELWQSCNV
SLVHRTYFYLLFKGDDEADSIYIGVELRRLLFMKARFSQGNQTLEGGETLTLASSRKALHGERMMLSKLV
GKRFSGEERRRMYHKFGIGVNSKRRRLQLVNELWSNPKDMSQVVESADVVGKLVRFAEQGRAMKEMFGLA
FTPPSFLTAQRWRANVGVSLNKVKSNAGRLASENGMYRRRRGLRQRQGKTILTSLPPPPPPPPPPSFGSS
PEIFALYSLRSFLLPYPQRMESGGQYENGRYNPDYYKEGTHSVWNAMPNHHQTKEDQHNALVMNQKIMSI
LAERDAALKERDDALAAKQEALAARDEALDLRDKALSLRDNAILERDSALSALQFREHNLNYILSRAKLG
ASQSSHLPNPSPLSTIPHEAAPSKRKKKRKQETRSKGKRVGEDHVASPGKKCRKDWDSNVVGLNLVTFDE
TTMPVPMCTCTGTARHCYKWGNGGWQSSCCTTTLSLYPLPQMPNKRHSRVGGRKMSGNVFSRLLSRLAGQ
GHDLSSPVDLKDYWARHGTNRYITIMFKGLNQFFILSALLTDHHLLRRVPNRSPCSHLAQSGHHQMFQQL
NISQLRQEPQSQQNLMVTCQQNGQVDRVLRCWLDQRIGVAYQLGKHVLNYYMKSSYISDHEISKTLKTLS
HSLFISVYGDEGAMSDVVAEGDAKPTKQFSVYEATSEELIERSMAPIKKEFLCPPPSRSLKQSDVKAPHP
SLVQEKKSKRQLKRERREKCAINLCPQVSRTEDVDSCQYKEKCRFNHDIQAFKAQKPDDIEGQCPFVASG
IKCQYGLSCRFFGTHKDLSGISDAEINFFNKETQRLLWKNNMTFPKADAKLKSLGLMGHAKKSNVAQEND
AEKPLDGAQTNEDVDIPGPLETEDVRPTKKAKSDETSKLGHIIDGVMNVDDETEKTGNSTSKAKIEDDED
IIKVIETDGSLKSHPREKKKLIDFRDKLYLAPLTTVGNLPFRRLCKVLGADVTCGEMAMCTNLLQGQASE
WALLRRHSSEDLFGVQICDIGDWGATAVTIHGRSRQQRYSKSADWDYIYQCTKNASPNLQVIGNGDVYSF
LDWNKHKSDCPELSSCMIARGALIKPWIFTEIKEQRHWDITSGERLNIFKDFVRFGLQHWGSDTKGVETT
RHFLLEWLSYTFRYIPVGLLDVIPQQINWRPPSYFGRDDLETLMMSESAGDWVRISELLLGKVPEGFTFA
PKHKSNAYDRAENG