SEARCH

Sequence

GENOMES

Enrichment analyses (GO, KEGG, KOG, NR, SwissProt) were conducted,
including CDS regions and upstream 2k sequences.

ZS2 - BnaC02T228300ZS2

CDS Sequence (5200 bp)
ATGCCACCTCCATGCTATTCGCTCCTACTCTCGAATCCTCCGATTCTACCTTCGCTTATTCCTCCGGGCT
ACACGTACTCCTTCACACGTGGACCACAACGCTTCTCCTCCTCCTCCTTGCCCCACTCTCTTCATGGAAT
CGGAAGAAACATCGAGGTCGCCGAAGGAGTACAATTCGATGGGACGATTGCGAGTAGTAGTAGAGAGGAT
GTGAATCATGAGAATGAGGAGGATTTGATGGTCCAAGTGTGTGTGACTCGTACGTTGCCTCCAGCTTTGA
CTCTTGAGCTTGGACTCGAGAGACTCATCGAAGCTGTTGACCAACTCAAAGCTAATCCTCCAAAGTCCTC
TACTGGCGTCTTGCGGTTTCAAGTGGCTGTGCCGCCGAGGGCAAAGGCTTTGTTCTGGTTCTGCTCTCAA
CCTGTGTCTTCCGGTGTATTCCCTGTGTTTTTCCTCTCCAAAGATGATACTCTGGAGGACCCTTGTTATA
AATCTCTCTATGTAAAGGAACCTCATGGGATTTTTGGTATTGGAGACGCACTCTCTTTCCTTCATCACTC
CAAGGCTGGTCAGACCACCATCAAAACATTTCTCTCAGATGAATCAGGTATGGTGAAGGCTTATGGTTTT
CCTGATATTGATTTCAACGGAAACTCTAGTGTACATAGCAAGGATGGCTCTTCTTACTTCTTCGTTCCTC
AGATAGAGTTAGATGAGCACGACGAGATCTCCATATTAGCAGTTACGCTGGCATGGAACGATTCTCTATC
TTACAGATTTGAGCAAGCAATTAGTTCATATGAGAAATCAGTTTTTCAGGTTTCTTGTCATGTCTGCCCC
AATTTAGAGGATCATTGGTTCAAACATCTAAAAAGTTCTCTTGCAAAGTTGGCTGAGCATATGGAGTTTG
CAACATTTTCTAGGAGAGATCATGGTGACGCCAATGAACTGAAAAGTATACAATCATCATGTCAGTTCCA
TTGCAAGCTTTCACCTGAAGTTGTTTTATCAAACAACATGCTGCATCAGGAGGCTGAAGTGAGCAACTTG
TTGAAAGATCAGGCTAATATCAATGCTGTATGGGCATCAGCTATAATTGAAGAATGCACTCGTCTTGGTT
TGACGTACTTTTGTGTAGCTCCTGGATCAAGGTCCTCCCATCTTGCAATTGCTGCTGCTAACCACCCCCT
TACAACGTGTCTTGCATGCTTTGACGAACGATCTCTTGCCTTTCACGCCATTGGGTATGCTAAAGGATCC
CTTAAACCGGCTGTCATTATAACATCATCAGGAACTGCCGTTTCAAATCTTCTTCCAGCGGTGGTTGAAG
CCAGTGAGGATTTCTTGCCTCTGCTACTACTTACTGCAGATCGTCCCCCTGAACTTCAGGGAGTTGGCGC
AAATCAAGCTATAAATCAAATAAACCACTTTGGTTCGTTCGTCAGATTCTTCTTCAATCTCCCTCCTCCA
ACTGATCTTATACCAGTCCGGATGGTCCTTACTACCGTAGACTCTGCTCTACACTGGGCAACAGGTTCTG
CTTGTGGACCAGTACATCTGAATTGTCCTTTTAGAGACCCACTTGACAGTAGTCCAACAAATTGGTCATT
CAACTGCTTAAATGGATTAGACACGTGGATGTCCAATGCTGAACCATTCACAAAATATTTTCAAGTACAA
AGCCTCAAGGGCAATGGTAAAACAAGTGGCCAAATTACTGAGGTTTTACAAGTAATCAAAGAGGCTAAGA
AGGGCCTTCTTCTTATCGGTGCAATCCATACGGAGGATGAAATTTGGGCTTCTCTTCTCTTGGCTAAAGA
ACTGATGTGGCCGGTTGTTGCAGATGTCTTGTCTGGTGTACGGCTGCGCAAGCTTTCTAAACCTTTTCTT
GAGAAGTGGACCCCTGTTTTTATTGATCATCTTGATCATGCCCTGCTTTCGGATTCTGTTAGGAATTTGA
TAGAGTTTGACGTTGTTATCCAGATTGGAAGTCGGATAACAAGTAAAAGAGTTTCTCAGGTGCTTGAGAA
ATGCTTTCCGTTTGCATACATTTTGGTTGATAAGCATCCATGCCGACATGACCCATCACACTTGGTCACT
CACAGGGTCCAAAGCAATATTGTTCAGTTTGCTGATTGTGTGCTTAAATCTATATTTCCATGGAGGAGAA
GCAAATTAGATGGTCATCTACAGGCATTGAATGGCGCTATTGCCCGAGAAATTTCATTTCAATTAGCAGC
TGAGTGCTCCCTGACCGAACCTTATGTTGCACATATGCTTTCCAAAGCACTGACTTCTAAATCAGCTCTT
TTCATCGGAAATAGTATGCCAATAAGGGATGTGGATATGTATGGATGTAGTTCGGGAAACTATTATTCTC
ACGTGGTAGATATGATGTTAAGTGTAGAATCACCATGTCAATGGATACAAGTAACTGGAAATAGAGGAGC
TAGTGGCATTGATGGCTTGCTCAGCACGGCCACTGGCTTTGCTGTAGGATGCAAGAAGAGAGTTGTCTGT
GTGGTGGGAGATATCTCTTTCCTTCATGATACAAATGGATTGGCGATTTTGAAGCAGAGGATTGCGAGGA
AACCAATGACAGTTCTCGTGATAAACAACCGTGGAGGTGGAATCTTCCGACTTCTTCCTATAGCAAAGAG
AACAGAGCCTAGCGTGTTGAATCAATATTTCTATACATCACATGACATTTCCATTAAGAACTTGTGCTTG
GCACATGGTGTGAAGTATGTACATGTTGGGAGAAAAAGTGAACTTGAGGAAACCCTATTGGAACCCAGCC
TGGAAGAGATGGACTGTATTGTGGAGGTTGAAAGCTCTATTGATGCTAACGCGCTCGTTCATAGTACTTT
GGAGAGTTTTGCACGCCAAGCTGCAAATAAATCCTTGGGTATTATCTTGGCCAGTTCACTTCTTCATCCA
ATGATCGACAACGTACTTCTTTTCCAAGTCTCTGGAATACAATATTCGCGGTACAGAGTCAGACTGTGTG
ACAGACCTACAATATATTCTGGTGAATCCTCTCATTTCCATCGAGAAGGGTTCATACTCTCCCTGACTTT
GGAGGATGGAAGCGTTGGCTGCGGAGAGGTTGCACCTTTGGACAGTAGTAGGGAGAACTTAATGGATGTG
GAGGGGCAGCTTCAGTTGATTCTTCATCTTATGAAAGGTGCTAAACTCAGTCACATGCTTCCTTTGTTAA
ATGGCTCGTTTTCTTCCTGGATTCGGAGTGAACTTGGAATCACTGCATCATCAATTTTCCCAAGTGTCAG
ATGTGGTCTAGAAATGGCTCTTCTGAATGCAATGGCAGTAAGACATGATTCTAGTTTGTTGGGGATACTT
CATTGTCAGAAAGAAGAAATTGGTTCTGTTCAGCCACACTCTGTTCCAATATGTGCCCTTGTTGATTGTG
AAGGTACTCCATCAGAGGTCGCATACGTTGCTAGAAAACTTGTTGAAGAAGGGTTCAGTGCTATTAAACT
TAAAGTTGCTCGTCGAGTGAACTCCGTTCAAGATGCTTTAGTTCTGCAAGAAGTAAGGAGAGTCGTTGGC
GATCAAATCGAACTCCGTGCAGATGCTAACTGTCGCTGGACTTTTGAAGAGGCCATAACTTTTGGTTTAT
TGGTGAAAAAGTGCAATCTACAATATATTGAGGAACCTGTCCAGAATAAAGATGATCTTATAAGGTTTTG
TGAAGAAAGTGGATTACCAGTGGCACTTGATGAGACTCTTGATGATTTTAAGGAATGTCCTCTGCGCATG
CTTTCCAAATATACCCATCCTGGAGTAGTTGCTGTTGTTATCAAACCAAGTGTTGTGGGAGGGTTTGAGA
ATGCAGCACTGATTGCTCGCTGGGCACAGCAGCATGGAAAGATGGCTGTTATAAGTGCCGCATACGAAAG
TGGCCTAGGTTTGTCAGCATATATTTTGTTTGCATCGTATTTGGAGACGCTGAACGTCAAAACATTTAGA
GAGAGAAAGCAAGGGATGGCCTCTCTTGTGGCCCATGGTCTTGGAACCTACAAATGGCTTAACGAAGACG
TAATGATGAATAATAGTCTAGGGATATCTCGTAGTCCGTACAGTGGATTTATCGAAGGATCTGTTGCTGA
TGCTAGCAAAAATCTAAGGGATGTTAAGATAAACAACGATGTTATTGTTAGAACCAGTAAAGGAGCTCTT
GTCCGGACGTATGAACTGAGGGTAGATGTAGATGGTTTCTCTCATTTTATAAGAATCCACGAGGTTGGGC
AGAATGTAAAAGGAAGTGTAATGTTGTTTCTTCATGGGTTTCTTGGAACTGGTGAAGAATGGATCCCCAT
CATGAAGGGTATCTCAGGATCTGCAAGATGCATTTCAGTTGATATTCCTGGTCATGGAAGCTCAAGGGTA
CAAAGTCATGCTAGTGAGACCCAGAAGACCCCTCCTTACTCGATGGAGATGATAGCGGAAGCACTGTATA
AGTTGATGGAGCAAATTACTCCTGGGAAAGTTACAATAGTTGGATATTCCATGGGAGGAAGAATAGCACT
GTACACGGCTTTGAGGTTTAGCAACAAGATTGAAGGAGCTGTTATTGTGTCGGGGAGCCCCGGGATCAAG
GATCCAGTGTCAAGGACAGTTCAAAGGGCAACAGATGATTCTAAAGCACGAATGATGGTTGACCATGGAC
TAGAAATCTTTCTAGAGAACTGGTACAATAGAGGCTTGTGGAAAAGTTTGAGAAGTCATCCCCATTTTAG
AAAAATAGTTGCAAGCCGCTTGATACATGATGATGTCCTTAGTGTAGCAAAGCTCCTCTCAGATCTGAGC
ACCGGGAGACAGCCGTCATTGTGGGAAGAGTTGGCGTTTTGTGATACAAATGTCTCGCTTGTTTATGGAG
AGAAAGATGTAAAATTCAAGAAAATTGCTACTAGGATGTACGTTGAGATGAGTAAAAGCAACAAGGGCGA
AAACTATATTATTGAGACGGTTGAAATCCCAGAGACTGGTCATGCTGTTCATCTTGAGAGCCCTCTGCTC
TTGATCCTCGCTCTTAGAAAGTTCTTAACAAGAGTGCGCAAAAACTCTGCAGAGACAGCTTTCTCAGAAG
CTCTTGTTAGCACTTAA
Upstream Sequence
ATCAAGTGTATTATCCTCCTCCTCTCTGTTTCGAACTTCAAAACCAAATAACATCCTCCTCCAACTTCAC
ACAGCTTACAGCTAAGAAACAATGCCACCTCCATGCTATTCGCTCCTACTCTCGAATCCTCCGATTCTAC
CTTCGCTTATTCCTCCGGGCTACACGTACTCCTTCACACGTGGACCACAACGCTTCTCCTCCTCCTCCTT
GCCCCACTCTCTTCATGGAATCGGAAGAAACATCGAGGTCGCCGAAGGAGTACAATTCGATGGGACGATT
GCGAGTAGTAGTAGAGAGGATGTGAATCATGAGAATGAGGAGGATTTGATGGTCCAAGTGTGTGTGACTC
GTACGTTGCCTCCAGCTTTGACTCTTGAGCTTGGACTCGAGAGACTCATCGAAGCTGTTGACCAACTCAA
AGCTAATCCTCCAAAGTCCTCTACTGGCGTCTTGCGGTTTCAAGTGGCTGTGCCGCCGAGGGCAAAGGCT
TTGTTCTGGTTCTGCTCTCAACCTGTGTCTTCCGGTGTATTCCCTGTGTTTTTCCTCTCCAAAGATGATA
CTCTGGAGGACCCTTGTTATAAATCTCTCTATGTAAAGGAACCTCATGGGATTTTTGGTATTGGAGACGC
ACTCTCTTTCCTTCATCACTCCAAGGCTGGTCAGACCACCATCAAAACATTTCTCTCAGATGAATCAGGT
ATGGTGAAGGCTTATGGTTTTCCTGATATTGATTTCAACGGAAACTCTAGTGTACATAGCAAGGATGGCT
CTTCTTACTTCTTCGTTCCTCAGATAGAGTTAGATGAGCACGACGAGATCTCCATATTAGCAGTTACGCT
GGCATGGAACGATTCTCTATCTTACAGATTTGAGCAAGCAATTAGTTCATATGAGAAATCAGTTTTTCAG
GTTTCTTGTCATGTCTGCCCCAATTTAGAGGATCATTGGTTCAAACATCTAAAAAGTTCTCTTGCAAAGT
TGGCTGAGCATATGGAGTTTGCAACATTTTCTAGGAGAGATCATGGTGACGCCAATGAACTGAAAAGTAT
ACAATCATCATGTCAGTTCCATTGCAAGCTTTCACCTGAAGTTGTTTTATCAAACAACATGCTGCATCAG
GAGGCTGAAGTGAGCAACTTGTTGAAAGATCAGGCTAATATCAATGCTGTATGGGCATCAGCTATAATTG
AAGAATGCACTCGTCTTGGTTTGACGTACTTTTGTGTAGCTCCTGGATCAAGGTCCTCCCATCTTGCAAT
TGCTGCTGCTAACCACCCCCTTACAACGTGTCTTGCATGCTTTGACGAACGATCTCTTGCCTTTCACGCC
ATTGGGTATGCTAAAGGATCCCTTAAACCGGCTGTCATTATAACATCATCAGGAACTGCCGTTTCAAATC
TTCTTCCAGCGGTGGTTGAAGCCAGTGAGGATTTCTTGCCTCTGCTACTACTTACTGCAGATCGTCCCCC
TGAACTTCAGGGAGTTGGCGCAAATCAAGCTATAAATCAAATAAACCACTTTGGTTCGTTCGTCAGATTC
TTCTTCAATCTCCCTCCTCCAACTGATCTTATACCAGTCCGGATGGTCCTTACTACCGTAGACTCTGCTC
TACACTGGGCAACAGGTTCTGCTTGTGGACCAGTACATCTGAATTGTCCTTTTAGAGACCCACTTGACAG
TAGTCCAACAAATTGGTCATTCAACTGCTTAAATGGATTAGACACGTGGATGTCCAATGCTGAACCATTC
ACAAAATATTTTCAAGTACAAAGCCTCAAGGGCAATGGTAAAACAAGTGGCCAAATTACTGAGGTTTTAC
AAGTAATCAAAGAGGCTAAGAAGGGCCTTCTTCTTATCGGTGCAATCCATACGGAGGATGAAATTTGGGC
TTCTCTTCTCTTGGCTAAAGAACTGATGTGGCCGGTTGTTGCAGATGTCTTGTCTGGTGTACGGCTGCGC
AAGCTTTCTAAACCTTTTCTTGAGAAGTGGACCCCTGTTTTTATTGATCATCTTGATCATGCCCTGCTTT
CGGATTCTGTTAGGAATTTGATAGAGTTTGACGTTGTTATCCAGATTGGAAGTCGGATAACAAGTAAAAG
AGTTTCTCAGGTGCTTGAGAAATGCTTTCCGTTTGCATACATTTTGGTTGATAAGCATCCATGCCGACAT
GACCCATCACACTTGGTCACTCACAGGGTCCAAAGCAATATTGTTCAGTTTGCTGATTGTGTGCTTAAAT
CTATATTTCCATGGAGGAGAAGCAAATTAGATGGTCATCTACAGGCATTGAATGGCGCTATTGCCCGAGA
AATTTCATTTCAATTAGCAGCTGAGTGCTCCCTGACCGAACCTTATGTTGCACATATGCTTTCCAAAGCA
CTGACTTCTAAATCAGCTCTTTTCATCGGAAATAGTATGCCAATAAGGGATGTGGATATGTATGGATGTA
GTTCGGGAAACTATTATTCTCACGTGGTAGATATGATGTTAAGTGTAGAATCACCATGTCAATGGATACA
AGTAACTGGAAATAGAGGAGCTAGTGGCATTGATGGCTTGCTCAGCACGGCCACTGGCTTTGCTGTAGGA
TGCAAGAAGAGAGTTGTCTGTGTGGTGGGAGATATCTCTTTCCTTCATGATACAAATGGATTGGCGATTT
TGAAGCAGAGGATTGCGAGGAAACCAATGACAGTTCTCGTGATAAACAACCGTGGAGGTGGAATCTTCCG
ACTTCTTCCTATAGCAAAGAGAACAGAGCCTAGCGTGTTGAATCAATATTTCTATACATCACATGACATT
TCCATTAAGAACTTGTGCTTGGCACATGGTGTGAAGTATGTACATGTTGGGAGAAAAAGTGAACTTGAGG
AAACCCTATTGGAACCCAGCCTGGAAGAGATGGACTGTATTGTGGAGGTTGAAAGCTCTATTGATGCTAA
CGCGCTCGTTCATAGTACTTTGGAGAGTTTTGCACGCCAAGCTGCAAATAAATCCTTGGGTATTATCTTG
GCCAGTTCACTTCTTCATCCAATGATCGACAACGTACTTCTTTTCCAAGTCTCTGGAATACAATATTCGC
GGTACAGAGTCAGACTGTGTGACAGACCTACAATATATTCTGGTGAATCCTCTCATTTCCATCGAGAAGG
GTTCATACTCTCCCTGACTTTGGAGGATGGAAGCGTTGGCTGCGGAGAGGTTGCACCTTTGGACAGTAGT
AGGGAGAACTTAATGGATGTGGAGGGGCAGCTTCAGTTGATTCTTCATCTTATGAAAGGTGCTAAACTCA
GTCACATGCTTCCTTTGTTAAATGGCTCGTTTTCTTCCTGGATTCGGAGTGAACTTGGAATCACTGCATC
ATCAATTTTCCCAAGTGTCAGATGTGGTCTAGAAATGGCTCTTCTGAATGCAATGGCAGTAAGACATGAT
TCTAGTTTGTTGGGGATACTTCATTGTCAGAAAGAAGAAATTGGTTCTGTTCAGCCACACTCTGTTCCAA
TATGTGCCCTTGTTGATTGTGAAGGTACTCCATCAGAGGTCGCATACGTTGCTAGAAAACTTGTTGAAGA
AGGGTTCAGTGCTATTAAACTTAAAGTTGCTCGTCGAGTGAACTCCGTTCAAGATGCTTTAGTTCTGCAA
GAAGTAAGGAGAGTCGTTGGCGATCAAATCGAACTCCGTGCAGATGCTAACTGTCGCTGGACTTTTGAAG
AGGCCATAACTTTTGGTTTATTGGTGAAAAAGTGCAATCTACAATATATTGAGGAACCTGTCCAGAATAA
AGATGATCTTATAAGGTTTTGTGAAGAAAGTGGATTACCAGTGGCACTTGATGAGACTCTTGATGATTTT
AAGGAATGTCCTCTGCGCATGCTTTCCAAATATACCCATCCTGGAGTAGTTGCTGTTGTTATCAAACCAA
GTGTTGTGGGAGGGTTTGAGAATGCAGCACTGATTGCTCGCTGGGCACAGCAGCATGGAAAGATGGCTGT
TATAAGTGCCGCATACGAAAGTGGCCTAGGTTTGTCAGCATATATTTTGTTTGCATCGTATTTGGAGACG
CTGAACGTCAAAACATTTAGAGAGAGAAAGCAAGGGATGGCCTCTCTTGTGGCCCATGGTCTTGGAACCT
ACAAATGGCTTAACGAAGACGTAATGATGAATAATAGTCTAGGGATATCTCGTAGTCCGTACAGTGGATT
TATCGAAGGATCTGTTGCTGATGCTAGCAAAAATCTAAGGGATGTTAAGATAAACAACGATGTTATTGTT
AGAACCAGTAAAGGAGCTCTTGTCCGGACGTATGAACTGAGGGTAGATGTAGATGGTTTCTCTCATTTTA
TAAGAATCCACGAGGTTGGGCAGAATGTAAAAGGAAGTGTAATGTTGTTTCTTCATGGGTTTCTTGGAAC
TGGTGAAGAATGGATCCCCATCATGAAGGGTATCTCAGGATCTGCAAGATGCATTTCAGTTGATATTCCT
GGTCATGGAAGCTCAAGGGTACAAAGTCATGCTAGTGAGACCCAGAAGACCCCTCCTTACTCGATGGAGA
TGATAGCGGAAGCACTGTATAAGTTGATGGAGCAAATTACTCCTGGGAAAGTTACAATAGTTGGATATTC
CATGGGAGGAAGAATAGCACTGTACACGGCTTTGAGGTTTAGCAACAAGATTGAAGGAGCTGTTATTGTG
TCGGGGAGCCCCGGGATCAAGGATCCAGTGTCAAGGACAGTTCAAAGGGCAACAGATGATTCTAAAGCAC
GAATGATGGTTGACCATGGACTAGAAATCTTTCTAGAGAACTGGTACAATAGAGGCTTGTGGAAAAGTTT
GAGAAGTCATCCCCATTTTAGAAAAATAGTTGCAAGCCGCTTGATACATGATGATGTCCTTAGTGTAGCA
AAGCTCCTCTCAGATCTGAGCACCGGGAGACAGCCGTCATTGTGGGAAGAGTTGGCGTTTTGTGATACAA
ATGTCTCGCTTGTTTATGGAGAGAAAGATGTAAAATTCAAGAAAATTGCTACTAGGATGTACGTTGAGAT
GAGTAAAAGCAACAAGGGCGAAAACTATATTATTGAGACGGTTGAAATCCCAGAGACTGGTCATGCTGTT
CATCTTGAGAGCCCTCTGCTCTTGATCCTCGCTCTTAGAAAGTTCTTAACAAGAGTGCGCAAAAACTCTG
CAGAGACAGCTTTCTCAGAAGCTCTTGTTAGCACTTAAAGAAACATAAGCAAACTCTGCTGAAATCTGAG
AAATGCATCAGATTTTGGATAATAAAGACATTATTTTGATCATGATTGGACCTGTCTCAGGCATGAACTA
AATTGATTTAAATAATCATATATATTTTGTTTTGAATTTGTGTATTCCTCCGCTGTCTAATAAAGTAGAA
CTTATGATGGTT
Downstream Sequence
GTTCGTTTCGTTCGTTTGGTTATGATCTTTGTGTTACTCAAATTGAATTCGAATTTTTAT
TTTTATTTTTATTTGATTCCAGGTCGCCGAAGGAGTACAATTCGATGGGACGATTGCGAG
TAGTAGTAGAGAGGATGTGAATCATGAGAATGAGGAGGATTTGATGGTCCAAGTGTGTGT
GACTCGTACGTTGCCTCCAGCTTTGACTCTTGAGCTTGGACTCGAGAGACTCATCGAAGC
TGTTGACCAACTCAAAGCTAATCCTCCAAAGTCCTCTACTGGCGTCTTGCGGTTTCAAGT
ACGGTTTGCTTTTTTTTTGTTTTTGTGAATGTTTTTCTGTTTTTTTTTTTTGGTTCGGGC
CGAAACCATAATCCTCCAGACCAGACCTGAAATTAACAGCATGTAATATTTGTAGGTGGC
TGTGCCGCCGAGGGCAAAGGCTTTGTTCTGGTTCTGCTCTCAACCTGTGTCTTCCGGTGT
ATTCCCTGTGTTTTTCCTCTCCAAAGATGATACTCTGGAGGACCCTTGTTATAAATCTCT
CTATGTAAAGGAACCTCATGGGATTTTTGGTATTGGAGACGCACTCTCTTTCCTTCATCA
CTCCAAGGCTGGTCAGACCACCATCAAAACGTATTCTTCCTTTTCGGTCCTTACTTTGTT
CACAACACTAGAACAAGATCAATATAGGATTCTTTCTCTTTTTCTGTAGATTTCTCTCAG
ATGAATCAGGTATGGTGAAGGCTTATGGTTTTCCTGATATTGATTTCAACGGAAACTCTA
GTGTACATAGCAAGGATGGCTCTTCTTACTTCTTCGTTCCTCAGGTTGAATTTGTCATCA
AGCTCTTGACTTGCATCTTTTTTTCTTTCTTTTTTTTTTTGTATTGATGAAGTGGTTGAA
GCAAGCTTTCTTATATTGTACATGGTCTGCTTTTGACAGATAGAGTTAGATGAGCACGAC
GAGATCTCCATATTAGCAGTTACGCTGGCATGGAACGATTCTCTATCTTACAGATTTGAG
CAAGCAATTAGTTCATATGAGAAATCAGTTTTTCAGGTATGCTGCCATTTAATTTCTTCT
TAGTATCCAAACCACCATGCTGCGTATCCTTTACGACAGGACACTAAACAAGTATGCAAA
ACTGTTAATCTAATGGGAAGTATATACTTGAAGTGCAGATTTTTTTTTAACTTGTGTCAG
TCAAATGGAAAGTTAATATTTTTTAGCTATTGCTGTGTCTTGTTTTCATGTTAATCTTGC
TTCATTTTCATGCAGGTTTCTTGTCATGTCTGCCCCAATTTAGAGGATCATTGGTTCAAA
CATCTAAAAAGTTCTCTTGCAAAGTTGGCTGTAAAAGAGATTCATCCAATAACTTTGGTT
TTTCTCCCTAACTCTTAAGTAGCTAGACAAGATGCAAGGATATATATATATATATATATA
TATATATATATATGTGATATTTTGTTCATTATCAAACTGTTTCCTGCACAAATGTTACAT
TCCTGATTTTCAGAGAGCCACTTAAACACTATTTGACGAAGTTTCTTATAATATTGTTTT
ACGCGTTTGGCCCACATGGATAGTACTTTTCTGAACTGAGCATGTAAACTTAGTTTCGAC
CTCATGCCAGTCATTCTTATGTTTATTTTGTAGGAGCATATGGAGTTTGCAACATTTTCT
AGGAGAGATCATGGTGACGCCAATGAACTGGTATCATCTTTACCCTGGTATTTTGGAATT
TAATTATCGTCTCTGAAGCTTGTCCGTTGGAATTTCTTCTCTTTGGTCCATATGGATATA
TTTCTCTACTGTTTGATTTGCCAGCTTTTGTTGCAGAAAAGTATACAATCATCATGTCAG
TTCCATTGCAAGCTTTCACCTGAAGTTGTTTTATCAAACAACATGGTACGTAGATATATA
ATTATGGCCTTTTTCTTTAGATTGTTAGGATGCTTTTTGTATGACTAGTGATTTTGAAAA
GACTGTTCATCATGCCTTCA
mRNA Sequence
ATCAAGTGTATTATCCTCCTCCTCTCTGTTTCGAACTTCAAAACCAAATAACATCCTCCTCCAACTTCAC
ACAGCTTACAGCTAAGAAACAATGCCACCTCCATGCTATTCGCTCCTACTCTCGAATCCTCCGATTCTAC
CTTCGCTTATTCCTCCGGGCTACACGTACTCCTTCACACGTGGACCACAACGCTTCTCCTCCTCCTCCTT
GCCCCACTCTCTTCATGGAATCGGAAGAAACATCGAGGTCGCCGAAGGAGTACAATTCGATGGGACGATT
GCGAGTAGTAGTAGAGAGGATGTGAATCATGAGAATGAGGAGGATTTGATGGTCCAAGTGTGTGTGACTC
GTACGTTGCCTCCAGCTTTGACTCTTGAGCTTGGACTCGAGAGACTCATCGAAGCTGTTGACCAACTCAA
AGCTAATCCTCCAAAGTCCTCTACTGGCGTCTTGCGGTTTCAAGTGGCTGTGCCGCCGAGGGCAAAGGCT
TTGTTCTGGTTCTGCTCTCAACCTGTGTCTTCCGGTGTATTCCCTGTGTTTTTCCTCTCCAAAGATGATA
CTCTGGAGGACCCTTGTTATAAATCTCTCTATGTAAAGGAACCTCATGGGATTTTTGGTATTGGAGACGC
ACTCTCTTTCCTTCATCACTCCAAGGCTGGTCAGACCACCATCAAAACATTTCTCTCAGATGAATCAGGT
ATGGTGAAGGCTTATGGTTTTCCTGATATTGATTTCAACGGAAACTCTAGTGTACATAGCAAGGATGGCT
CTTCTTACTTCTTCGTTCCTCAGATAGAGTTAGATGAGCACGACGAGATCTCCATATTAGCAGTTACGCT
GGCATGGAACGATTCTCTATCTTACAGATTTGAGCAAGCAATTAGTTCATATGAGAAATCAGTTTTTCAG
GTTTCTTGTCATGTCTGCCCCAATTTAGAGGATCATTGGTTCAAACATCTAAAAAGTTCTCTTGCAAAGT
TGGCTGAGCATATGGAGTTTGCAACATTTTCTAGGAGAGATCATGGTGACGCCAATGAACTGAAAAGTAT
ACAATCATCATGTCAGTTCCATTGCAAGCTTTCACCTGAAGTTGTTTTATCAAACAACATGCTGCATCAG
GAGGCTGAAGTGAGCAACTTGTTGAAAGATCAGGCTAATATCAATGCTGTATGGGCATCAGCTATAATTG
AAGAATGCACTCGTCTTGGTTTGACGTACTTTTGTGTAGCTCCTGGATCAAGGTCCTCCCATCTTGCAAT
TGCTGCTGCTAACCACCCCCTTACAACGTGTCTTGCATGCTTTGACGAACGATCTCTTGCCTTTCACGCC
ATTGGGTATGCTAAAGGATCCCTTAAACCGGCTGTCATTATAACATCATCAGGAACTGCCGTTTCAAATC
TTCTTCCAGCGGTGGTTGAAGCCAGTGAGGATTTCTTGCCTCTGCTACTACTTACTGCAGATCGTCCCCC
TGAACTTCAGGGAGTTGGCGCAAATCAAGCTATAAATCAAATAAACCACTTTGGTTCGTTCGTCAGATTC
TTCTTCAATCTCCCTCCTCCAACTGATCTTATACCAGTCCGGATGGTCCTTACTACCGTAGACTCTGCTC
TACACTGGGCAACAGGTTCTGCTTGTGGACCAGTACATCTGAATTGTCCTTTTAGAGACCCACTTGACAG
TAGTCCAACAAATTGGTCATTCAACTGCTTAAATGGATTAGACACGTGGATGTCCAATGCTGAACCATTC
ACAAAATATTTTCAAGTACAAAGCCTCAAGGGCAATGGTAAAACAAGTGGCCAAATTACTGAGGTTTTAC
AAGTAATCAAAGAGGCTAAGAAGGGCCTTCTTCTTATCGGTGCAATCCATACGGAGGATGAAATTTGGGC
TTCTCTTCTCTTGGCTAAAGAACTGATGTGGCCGGTTGTTGCAGATGTCTTGTCTGGTGTACGGCTGCGC
AAGCTTTCTAAACCTTTTCTTGAGAAGTGGACCCCTGTTTTTATTGATCATCTTGATCATGCCCTGCTTT
CGGATTCTGTTAGGAATTTGATAGAGTTTGACGTTGTTATCCAGATTGGAAGTCGGATAACAAGTAAAAG
AGTTTCTCAGGTGCTTGAGAAATGCTTTCCGTTTGCATACATTTTGGTTGATAAGCATCCATGCCGACAT
GACCCATCACACTTGGTCACTCACAGGGTCCAAAGCAATATTGTTCAGTTTGCTGATTGTGTGCTTAAAT
CTATATTTCCATGGAGGAGAAGCAAATTAGATGGTCATCTACAGGCATTGAATGGCGCTATTGCCCGAGA
AATTTCATTTCAATTAGCAGCTGAGTGCTCCCTGACCGAACCTTATGTTGCACATATGCTTTCCAAAGCA
CTGACTTCTAAATCAGCTCTTTTCATCGGAAATAGTATGCCAATAAGGGATGTGGATATGTATGGATGTA
GTTCGGGAAACTATTATTCTCACGTGGTAGATATGATGTTAAGTGTAGAATCACCATGTCAATGGATACA
AGTAACTGGAAATAGAGGAGCTAGTGGCATTGATGGCTTGCTCAGCACGGCCACTGGCTTTGCTGTAGGA
TGCAAGAAGAGAGTTGTCTGTGTGGTGGGAGATATCTCTTTCCTTCATGATACAAATGGATTGGCGATTT
TGAAGCAGAGGATTGCGAGGAAACCAATGACAGTTCTCGTGATAAACAACCGTGGAGGTGGAATCTTCCG
ACTTCTTCCTATAGCAAAGAGAACAGAGCCTAGCGTGTTGAATCAATATTTCTATACATCACATGACATT
TCCATTAAGAACTTGTGCTTGGCACATGGTGTGAAGTATGTACATGTTGGGAGAAAAAGTGAACTTGAGG
AAACCCTATTGGAACCCAGCCTGGAAGAGATGGACTGTATTGTGGAGGTTGAAAGCTCTATTGATGCTAA
CGCGCTCGTTCATAGTACTTTGGAGAGTTTTGCACGCCAAGCTGCAAATAAATCCTTGGGTATTATCTTG
GCCAGTTCACTTCTTCATCCAATGATCGACAACGTACTTCTTTTCCAAGTCTCTGGAATACAATATTCGC
GGTACAGAGTCAGACTGTGTGACAGACCTACAATATATTCTGGTGAATCCTCTCATTTCCATCGAGAAGG
GTTCATACTCTCCCTGACTTTGGAGGATGGAAGCGTTGGCTGCGGAGAGGTTGCACCTTTGGACAGTAGT
AGGGAGAACTTAATGGATGTGGAGGGGCAGCTTCAGTTGATTCTTCATCTTATGAAAGGTGCTAAACTCA
GTCACATGCTTCCTTTGTTAAATGGCTCGTTTTCTTCCTGGATTCGGAGTGAACTTGGAATCACTGCATC
ATCAATTTTCCCAAGTGTCAGATGTGGTCTAGAAATGGCTCTTCTGAATGCAATGGCAGTAAGACATGAT
TCTAGTTTGTTGGGGATACTTCATTGTCAGAAAGAAGAAATTGGTTCTGTTCAGCCACACTCTGTTCCAA
TATGTGCCCTTGTTGATTGTGAAGGTACTCCATCAGAGGTCGCATACGTTGCTAGAAAACTTGTTGAAGA
AGGGTTCAGTGCTATTAAACTTAAAGTTGCTCGTCGAGTGAACTCCGTTCAAGATGCTTTAGTTCTGCAA
GAAGTAAGGAGAGTCGTTGGCGATCAAATCGAACTCCGTGCAGATGCTAACTGTCGCTGGACTTTTGAAG
AGGCCATAACTTTTGGTTTATTGGTGAAAAAGTGCAATCTACAATATATTGAGGAACCTGTCCAGAATAA
AGATGATCTTATAAGGTTTTGTGAAGAAAGTGGATTACCAGTGGCACTTGATGAGACTCTTGATGATTTT
AAGGAATGTCCTCTGCGCATGCTTTCCAAATATACCCATCCTGGAGTAGTTGCTGTTGTTATCAAACCAA
GTGTTGTGGGAGGGTTTGAGAATGCAGCACTGATTGCTCGCTGGGCACAGCAGCATGGAAAGATGGCTGT
TATAAGTGCCGCATACGAAAGTGGCCTAGGTTTGTCAGCATATATTTTGTTTGCATCGTATTTGGAGACG
CTGAACGTCAAAACATTTAGAGAGAGAAAGCAAGGGATGGCCTCTCTTGTGGCCCATGGTCTTGGAACCT
ACAAATGGCTTAACGAAGACGTAATGATGAATAATAGTCTAGGGATATCTCGTAGTCCGTACAGTGGATT
TATCGAAGGATCTGTTGCTGATGCTAGCAAAAATCTAAGGGATGTTAAGATAAACAACGATGTTATTGTT
AGAACCAGTAAAGGAGCTCTTGTCCGGACGTATGAACTGAGGGTAGATGTAGATGGTTTCTCTCATTTTA
TAAGAATCCACGAGGTTGGGCAGAATGTAAAAGGAAGTGTAATGTTGTTTCTTCATGGGTTTCTTGGAAC
TGGTGAAGAATGGATCCCCATCATGAAGGGTATCTCAGGATCTGCAAGATGCATTTCAGTTGATATTCCT
GGTCATGGAAGCTCAAGGGTACAAAGTCATGCTAGTGAGACCCAGAAGACCCCTCCTTACTCGATGGAGA
TGATAGCGGAAGCACTGTATAAGTTGATGGAGCAAATTACTCCTGGGAAAGTTACAATAGTTGGATATTC
CATGGGAGGAAGAATAGCACTGTACACGGCTTTGAGGTTTAGCAACAAGATTGAAGGAGCTGTTATTGTG
TCGGGGAGCCCCGGGATCAAGGATCCAGTGTCAAGGACAGTTCAAAGGGCAACAGATGATTCTAAAGCAC
GAATGATGGTTGACCATGGACTAGAAATCTTTCTAGAGAACTGGTACAATAGAGGCTTGTGGAAAAGTTT
GAGAAGTCATCCCCATTTTAGAAAAATAGTTGCAAGCCGCTTGATACATGATGATGTCCTTAGTGTAGCA
AAGCTCCTCTCAGATCTGAGCACCGGGAGACAGCCGTCATTGTGGGAAGAGTTGGCGTTTTGTGATACAA
ATGTCTCGCTTGTTTATGGAGAGAAAGATGTAAAATTCAAGAAAATTGCTACTAGGATGTACGTTGAGAT
GAGTAAAAGCAACAAGGGCGAAAACTATATTATTGAGACGGTTGAAATCCCAGAGACTGGTCATGCTGTT
CATCTTGAGAGCCCTCTGCTCTTGATCCTCGCTCTTAGAAAGTTCTTAACAAGAGTGCGCAAAAACTCTG
CAGAGACAGCTTTCTCAGAAGCTCTTGTTAGCACTTAAAGAAACATAAGCAAACTCTGCTGAAATCTGAG
AAATGCATCAGATTTTGGATAATAAAGACATTATTTTGATCATGATTGGACCTGTCTCAGGCATGAACTA
AATTGATTTAAATAATCATATATATTTTGTTTTGAATTTGTGTATTCCTCCGCTGTCTAATAAAGTAGAA
CTTATGATGGTT
Pro Sequence
MPPPCYSLLLSNPPILPSLIPPGYTYSFTRGPQRFSSSSLPHSLHGIGRNIEVAEGVQFDGTIASSSRED
VNHENEEDLMVQVCVTRTLPPALTLELGLERLIEAVDQLKANPPKSSTGVLRFQVAVPPRAKALFWFCSQ
PVSSGVFPVFFLSKDDTLEDPCYKSLYVKEPHGIFGIGDALSFLHHSKAGQTTIKTFLSDESGMVKAYGF
PDIDFNGNSSVHSKDGSSYFFVPQIELDEHDEISILAVTLAWNDSLSYRFEQAISSYEKSVFQVSCHVCP
NLEDHWFKHLKSSLAKLAEHMEFATFSRRDHGDANELKSIQSSCQFHCKLSPEVVLSNNMLHQEAEVSNL
LKDQANINAVWASAIIEECTRLGLTYFCVAPGSRSSHLAIAAANHPLTTCLACFDERSLAFHAIGYAKGS
LKPAVIITSSGTAVSNLLPAVVEASEDFLPLLLLTADRPPELQGVGANQAINQINHFGSFVRFFFNLPPP
TDLIPVRMVLTTVDSALHWATGSACGPVHLNCPFRDPLDSSPTNWSFNCLNGLDTWMSNAEPFTKYFQVQ
SLKGNGKTSGQITEVLQVIKEAKKGLLLIGAIHTEDEIWASLLLAKELMWPVVADVLSGVRLRKLSKPFL
EKWTPVFIDHLDHALLSDSVRNLIEFDVVIQIGSRITSKRVSQVLEKCFPFAYILVDKHPCRHDPSHLVT
HRVQSNIVQFADCVLKSIFPWRRSKLDGHLQALNGAIAREISFQLAAECSLTEPYVAHMLSKALTSKSAL
FIGNSMPIRDVDMYGCSSGNYYSHVVDMMLSVESPCQWIQVTGNRGASGIDGLLSTATGFAVGCKKRVVC
VVGDISFLHDTNGLAILKQRIARKPMTVLVINNRGGGIFRLLPIAKRTEPSVLNQYFYTSHDISIKNLCL
AHGVKYVHVGRKSELEETLLEPSLEEMDCIVEVESSIDANALVHSTLESFARQAANKSLGIILASSLLHP
MIDNVLLFQVSGIQYSRYRVRLCDRPTIYSGESSHFHREGFILSLTLEDGSVGCGEVAPLDSSRENLMDV
EGQLQLILHLMKGAKLSHMLPLLNGSFSSWIRSELGITASSIFPSVRCGLEMALLNAMAVRHDSSLLGIL
HCQKEEIGSVQPHSVPICALVDCEGTPSEVAYVARKLVEEGFSAIKLKVARRVNSVQDALVLQEVRRVVG
DQIELRADANCRWTFEEAITFGLLVKKCNLQYIEEPVQNKDDLIRFCEESGLPVALDETLDDFKECPLRM
LSKYTHPGVVAVVIKPSVVGGFENAALIARWAQQHGKMAVISAAYESGLGLSAYILFASYLETLNVKTFR
ERKQGMASLVAHGLGTYKWLNEDVMMNNSLGISRSPYSGFIEGSVADASKNLRDVKINNDVIVRTSKGAL
VRTYELRVDVDGFSHFIRIHEVGQNVKGSVMLFLHGFLGTGEEWIPIMKGISGSARCISVDIPGHGSSRV
QSHASETQKTPPYSMEMIAEALYKLMEQITPGKVTIVGYSMGGRIALYTALRFSNKIEGAVIVSGSPGIK
DPVSRTVQRATDDSKARMMVDHGLEIFLENWYNRGLWKSLRSHPHFRKIVASRLIHDDVLSVAKLLSDLS
TGRQPSLWEELAFCDTNVSLVYGEKDVKFKKIATRMYVEMSKSNKGENYIIETVEIPETGHAVHLESPLL
LILALRKFLTRVRKNSAETAFSEALVST