SEARCH

DETAIL

Detail for ZS11_A02G11040.t1

Genome:
ZS11: ZS11_A02G11040.t1
ZS11_v0: BnaA02T0100500ZS
WE: WE_C02G17040.t1
WE_v0: BnaA02T0102500WE
Darmor: A02p06630.1_BnaDAR
BH: BnaA02T110000BH
Quinta: BnaA02T0100700QU
ZS2: BnaA02T106200ZS2
SW: BnaA02T115400SW
Express61: A02p009960.1_BnaEXP
Xiaoyun: BnaA02G0095800XY
P202: BnaA03G008960P202.1
BL: BnaA02T110000BH
TA: BnaA02T086500TA
Da-Ae: rna-gnl|WGS:JAGKQM|BnaA01g02650D
RB: BnaA02T114400RB
No2127: BnaA02T0102200NO
P130: BnaA02G009580P130.2
Length: 1969
KOG ID: KOG0261
KOG E-value: Not available
KOG Score: 2320.0
KOG Annotation: Transcription
Biological Process: GO:0046274(lignin catabolic process),GO:0006351(transcription, DNA-templated)
Cellular Component: GO:0048046(apoplast)
Molecular Function: GO:0052716(hydroquinone:oxygen oxidoreductase activity),GO:0003677(DNA binding),GO:0003899(DNA-directed 5'-3' RNA polymerase activity),GO:0005507(copper ion binding),GO:0016491(oxidoreductase activity)
KEGG Ortholog: K03018 RPC1, POLR3A; DNA-directed RNA polymerase III subunit RPC1 [EC:2.7.7.6]
NR Protein: XP_009126624.1
NR E-value: Not available
NR Score: 2748.0
NR Annotation: XP_009126624.1 PREDICTED: DNA-directed RNA polymerase III subunit 1 [Brassica rapa]
SwissProt Protein: F4JXF9|NRPC1_ARATH
SwissProt E-value: Not available
SwissProt Score: 2459.0
SwissProt Annotation: DNA-directed RNA polymerase III subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPC1 PE=2 SV=1
CDS Sequence (5994 bp)
ATGGCGTCTTGGCTTCTCTTTTCTGTGTTCTCTTGTGTTCTTCTTCTTCCTGAACCTGCATTTGGGATTA
CTAGGCATTATACGCTGGACATAAAAATGCACAACGTGACACGTCTTTGCCACACAAAGAGTCTTGTTTC
TGTAAACGGGAAGTTCCCAGGGCCTAAGATTATAGCTAGAGAAGGTGACCAGCTTCTGATCAAAGTTGTT
AACCATGTCCCAAACAATATCTCTCTCCACTGGCATGGGATCAGGCAATTAAGGTCTGGTTGGGCTGATG
GGCCAGCTTACATAACCCAATGTCCTATCCAGACAGGGCAAAGCTATGTATACAACTACACCATTGTTGG
TCAAAGAGGCACTTTATGGTACCATGCTCACATCTCATGGCTCAGAGCAACGGTCTATGGTCCTCTCATC
ATTCTTCCCAAACACGGTGTTCCTTACCCGTTCCCCAAACCTCACAAAGAAGTCCCCATGGTCTTTGGTG
AGTGGTTCAACGCAGACACTGAGGCAATCATCCGCCAAGCAACGCTGACAGGAGGTGGTCCAAATGTCTC
TGATGCTTACACAATCAACGGTCTTCCTGGTCCACTATACAACTGCTCAGCTAAAGATACATTCAGACTG
AGAGTGAAGGCAGGGAAGACATACCTTCTCAGGATAATCAACGCTGCACTTAATGACGAGCTCTTCTTCA
GCATCGCAAACCACACGGTCACAGTCGTTGAAGCTGATGCCGTCTACGTCAAACCATTCGAGACTAACAC
CATCTTGATCGCTCCTGGCCAGACCACAAACGTCCTGCTCAAGACCAAACCGAGTTACCCTAGCGCCTCT
TTCTTCATGACCGCTAGACCATACGCCACAGGTCAAGGGACTTTCGATAACACTACAGTCGCAGGCATCT
TAGAATACGAACAGCCTAAACATGCAAAGACCAATATAAAGAATCTTCCTCTCTTCACGCCGGTGCTCCC
CGCTCTAAACGACACAAACTTCGCTACCAAGTTCAGCAACAAGCTGCGTAGCTTGAACAGCAAGAAGTTC
CCGGCGAACGTGCCTCAGGAGGTTGATAGGAAGTTCTTTTTCACGGTGGGGTTGGGGACTAACCCTTGTA
ACCATAAGAACAACCAGACGTGCCAGGGTCCTACCAACACCACAATGTTTGCTGCATCGATCAGCAACAT
CTCATTCACACTACCAACGAAGGCTCTCCTTCAATCTCACTACTCTGGGAGATCTAACGGAGTTTATTCT
CCGAACTTCCCGTGGAGTCCCATTGTTCCTTTTAACTACACAGGCACTCCGCCTAACAACACCATGGTTA
GCACCGGGACGAACTTGATGGTTCTGAGGTATAACACGAGTGTGGAGCTTGTGATGCAGGACACTAGCAT
TCTTGGTGCGGAGAGCCATCCTCTTCACCTTCATGGGTTCAACTTCTTTGTCGTTGGCCAAGGGTTTGGG
AACTTTGATCCGAACAAGGATCCTAAGAAGTTTAATCTTGTTGATCCGATAGAGAGGAACACGGTTGGTG
TGCCTTCTGGTGGATGGGCTGCTATTAGATTCCTTGCAGACAACCCAGGAGTGTGGTTCATGCACTGTCA
CTTGGAAGTGCATACCAGTTGGGGTCTGAGGATGGCTTGGCTTGTTCTTGATGGAGATAAGCCTGATCAG
AAACTGATTCCTCCTCCTGCAGACTTGCCCAACCGTTCTGTATCTCCCGCCTGTTACTCGATCTGCTGCT
TAAACCCTAAAACCCAACCTCCATCGCCATCCTTCGATTGGAGAGAGATGGAGACGAAGACGGAGATCGA
ATTCACCAAGGAGCCCTACATCGAAGACGTTGGTCCTCTCAAAATACAAAGCATAAACTTCTCAATGCTC
TCTGATATTGAAGTCATGAAAGCTGCTGAGGTTCAGGTCTGGAAGAATATGTACTACGAGTCCAATTTTA
AGCCTATTGAAGGCGGCTTGTTGGATCCTCGAATGGGTCCTCCTAACAAAAGGTCTACATGCGCAACCTG
TCATGGCAACTTCCAAAACTGTCCTGGACACTATGGCTATCTGAAGCTTGACCTTCCGGTTTATAACGTT
GGATTCTTCAATTTTATCCTTGACATTTTGAAGTGCATCTGTAAGAGCTGTTCCAGCATGCTTATTGAAG
AGAAGATGTATGAAGATCACTTGAAGAAGATGCGGAATCCAAGAACGGAGCCATTGAGGAAGACTGAATT
GGCCAAAGCGGTTGTCAAGAAGTGCAGTCTGATGGCTGGCCAGAGAGTTATTACTTGCAAAAAATGTGGA
TACCTCAATGgcatggtAAAGAAGGTCGCAGCGCAGTTGGGTATAGGCATCAGTCATGACCGATCTAAAA
TCCATGGTGGGGAGATTGATGAATTTAAATCTGCAATATCCCACACAAAAGAGTCTGCTGGTGGAATAAA
TCCTCTTACCTATGTTCTTGATCCTAACGTGGTGCTTAGACTTTTTAAAGGAATGAGTGACAAGGACTGC
GAACTTCTGTatattgctcatagaCCTGAGAATCTCATCATAACGTGCATGCTCGTGCCACCGTTATCAA
TCCGACCGTCTGTTATGATTGGTGGTACACAAAGCAATGAAAACGACATAACAGAGAGATTAAAGAAAAT
CATTCAAGACAATGCTTCTCTTCATAAGATTTTAAGCCAACCTACCACATCGCCCAAAAACATGCAAGTA
TGGGATACAGTTCAAAGCGAGGTTGCACAATACATTAATAGTGAAGTCCGAGGTGTCCAGATCATGCCAA
ACACCAAGCCACTGGCTGGACTCCTTCAGCGTCTCAAGGGAAAAGGGGGACGTTTCCGTGCAAACTTGTC
AGGGAAGCGTGTCGAGTTCACTGGTAGAACTGTTATTTCACCTGATCCCAATTTGAAAATTACAGAGGTA
GGGATTCCTATCCTTATGGCCCGGATCTTAACTTTTCCTGAATGTGTGTCCCGTCATAATATTGATAAGT
TGAGGCAACGCGTCCGCAATGGCCCTAATAAATACCCTGGTGCCAGAAATGTCAGATATCCAGATGGTTC
TTCAAGGACTTTGGTGGGTGATTATCGGAAGCGTATTGCTGATGAATTGACTTATGGATGCATAGTTGAC
CGTCATTTGGAAGACGGGGATGCTATTCTTTTCAACAGACAACCGAGTCTGCATCGGATGTCTATCATGT
GTCACAGGGCAAGAATAATGCCTTGGAGAACATTGAGGTTCAACGAATCGGTTTGTAACCCATATAATGC
TGATTTTGATGGTGATGAGATGAACATGCACGTACCACAAACAGAGGAGGCTCGGACAGAGGCTATTACA
TTGATGGGGgtacaaaacaatttatgcaccccaaaaaaTGGAGAAATGTTAGTAGCATCAACACAGGATT
TTTTAACATCTGCCTATTTGATAACGAGAAAGGACACGTTCTATGACCGTGCAGCCTTTTCACTTATATG
TTCTTACATGGGAGACGCCATGGATTCCATAGATTTGCCCACGCCCACAATCTTTAAGCCAATAGAGCTT
TGGACTGGTAAACAGGTTTTTAATGTTTTGCTGCGTCCAAACGCAAGTGTCAGAGTCTACGTAACTCTCA
ATGTGAAAGAGAAGAACTTCAGGAAGGGAGAACATTATGATGAAACAATGTGCATAAATGATGGATGGGT
TTATTTTCGGAACAGTGAGCTAATATCAGGACAACTGGGGAAGGCTACGTTAGGAAGTGGAAGCAAGGAT
GGATTATATTCAATTCTTCTTCGAGATTACAACTCCCATGCTGCTGCAGTCTGCATGAATCGTCTAGCAA
AGTTGAGTGCTCGATGGATTGGAATCCATGGCTTCTCCATTGGGATCGATGACGTTCAACCTGGAAAAAA
GTTGAAAGAGGACAGAGAGGTTATAGTGAAACGCCGATATAAGGATTGTGATGAATTACTTAAGAACTAT
GAAAAAGGAGATCTAGATGCTGCAAAAACACTGGAAGCTAACTTAACAGGGTTTCTTAATAAAATTCGAG
AAGAGACTGGGAAGCTCTGTATGGACGGATTACATTGGAGAAACAGTCCCTTGATCATGTCGCAATGCGG
TTCCAAGGGATCCCCTATCAATATCAGTCAGATGGTTACATGTGTTGGTCAGCAGACAGTTAATGGTAGC
CGTGCTCCTGATGGATTTATAGATCGAAGTCTTCCTCATTTCCCTAGAATGTCCAGAACCCCTGAAGCTA
AAGGTTTTGTTGCTAATTCGTTCTACGACGGCCTTAGTGCCACAGAGTTTTTCTTTCACACTATGGGTGG
ACGAGAAGGTCTAGTTGATACAGCGGTGAAAACTGCCAGTACAGGTTACATGTCTCGAAGACTGATGAAA
GCCTTGGAGGATCTCTTAGTCCATTATGATAACACAGTGCGAAACGCCAGCGGAAGCATACTTCAGTTTA
CTTATGGGGATGATGGGATGGACCCAGCACTGATGGAAGGAAAGAATGGAACGCCTTTAAATTTTGATAG
ATTATTTCTTAAAATTCAGGCCACTTGTCCTCCTAGATCACATCACAATTATCTTTCTTCAGACGAACTG
TTGCAAAAGTTCGAAGAGGAGTTAGTCAGGCAAGATACAAGTCGGGTGTGCACTGACGCCTTCGTGAAAT
CTCTAAGAGAATTTGTTTGTTTGCTCGGAGTAAAGTCTGCAAGCCCGAGCCAGATTTTCTCTAAAGGATC
TGGTGTGACTGATAAGCAACTCGAGGTATTTGTGAAAATTTGTGTATCTCGATACCGGGGGAAAACAATT
GAACCTGGGACTGCAATTGGACCAATAGGAGCTCAGAGTATCGGAGAACCAGGGACACAAATGACTCTGA
AAACTTTTCACTTTGCTGGAGTTGCTAGCATGAATATCACCCAAGGAGTTCCTCGAATCAACGAAATCAT
AAATGCTACCAAAACTATAAGCACACCCGTCATCTCTGCAGAACTTGAGAACCCCCTGGTAGAGGCTAGT
GCCCGAATGGCCAAAGGACGCATCGAGAAAACTACTTTAGGACAGGTTGCTGAGAGTATCGAGGTGCTAA
TGACTTCAACATCAGCGTCAGTGAGGATAACCCTTGACAAGAAAATAATAGAGGAGGCGTTTTTGTCTAT
AACCCCCTGGTCGGTTAAAAATTCCATACTAAAGACCAGAATCAAACTGCAGGATGAGAATATCAGGGTC
TTAGATACGGGATTGGATATTATTCCAAAGGGAGATCAAAATGGGACTCATTTCACTCTCCACAATCTGA
AGAATGTGCTGCCAAATGTTATAGTGAATGGGATCAAAACAGTTGAGCGAGTCGTTATAGCAGAGGATAC
AGATAAAAAGAAAGAGATTGGTGGGAAGAAAAGATTGAAACTGTTCGTGGAGGGAACAAACCTCCTGGAT
GTAATGGGCACTCCGGGAATCGATGGGAGAACTACTACAAGCAACAATATTGTCGAAGTGAGCAAAACAC
TGGGAATTGAGGCTGCAAGGACGACAATTATTGATGAAATAGGGTCAGTTATGGGAAACCATGGAATGAG
TATAGACATTCGTCACATGATGCTTTTGGCTGATGTCATGACTTACCGGGGGGAGGTACTTGGGATCCAA
AGAACCGGGATACAGAAGATGGACAAAAGTGTGCTGATGCAGGCATCTTTTGAGAGGACTGGAGATCATT
TATTTAGTGCAGCAATTAGCGGAAAAGTTGATAACATAGAGGGAGTCACAGAGTGTGTGATTATGGGCAT
ACCAATGAAACTCGGAACCGGAATCCTCAAAGTCCTCCAAAAGACTAAGACTGAGGATCTGCCAAAGCTG
AACTATGGTGCTGATCCAATCATCTCTTGA
Upstream Sequence
ATGGCGTCTTGGCTTCTCTTTTCTGTGTTCTCTTGTGTTCTTCTTCTTCCTGAACCTGCATTTGGGATTA
CTAGGCATTATACGCTGGACATAAAAATGCACAACGTGACACGTCTTTGCCACACAAAGAGTCTTGTTTC
TGTAAACGGGAAGTTCCCAGGGCCTAAGATTATAGCTAGAGAAGGTGACCAGCTTCTGATCAAAGTTGTT
AACCATGTCCCAAACAATATCTCTCTCCACTGGCATGGGATCAGGCAATTAAGGTCTGGTTGGGCTGATG
GGCCAGCTTACATAACCCAATGTCCTATCCAGACAGGGCAAAGCTATGTATACAACTACACCATTGTTGG
TCAAAGAGGCACTTTATGGTACCATGCTCACATCTCATGGCTCAGAGCAACGGTCTATGGTCCTCTCATC
ATTCTTCCCAAACACGGTGTTCCTTACCCGTTCCCCAAACCTCACAAAGAAGTCCCCATGGTCTTTGGTG
AGTGGTTCAACGCAGACACTGAGGCAATCATCCGCCAAGCAACGCTGACAGGAGGTGGTCCAAATGTCTC
TGATGCTTACACAATCAACGGTCTTCCTGGTCCACTATACAACTGCTCAGCTAAAGATACATTCAGACTG
AGAGTGAAGGCAGGGAAGACATACCTTCTCAGGATAATCAACGCTGCACTTAATGACGAGCTCTTCTTCA
GCATCGCAAACCACACGGTCACAGTCGTTGAAGCTGATGCCGTCTACGTCAAACCATTCGAGACTAACAC
CATCTTGATCGCTCCTGGCCAGACCACAAACGTCCTGCTCAAGACCAAACCGAGTTACCCTAGCGCCTCT
TTCTTCATGACCGCTAGACCATACGCCACAGGTCAAGGGACTTTCGATAACACTACAGTCGCAGGCATCT
TAGAATACGAACAGCCTAAACATGCAAAGACCAATATAAAGAATCTTCCTCTCTTCACGCCGGTGCTCCC
CGCTCTAAACGACACAAACTTCGCTACCAAGTTCAGCAACAAGCTGCGTAGCTTGAACAGCAAGAAGTTC
CCGGCGAACGTGCCTCAGGAGGTTGATAGGAAGTTCTTTTTCACGGTGGGGTTGGGGACTAACCCTTGTA
ACCATAAGAACAACCAGACGTGCCAGGGTCCTACCAACACCACAATGTTTGCTGCATCGATCAGCAACAT
CTCATTCACACTACCAACGAAGGCTCTCCTTCAATCTCACTACTCTGGGAGATCTAACGGAGTTTATTCT
CCGAACTTCCCGTGGAGTCCCATTGTTCCTTTTAACTACACAGGCACTCCGCCTAACAACACCATGGTTA
GCACCGGGACGAACTTGATGGTTCTGAGGTATAACACGAGTGTGGAGCTTGTGATGCAGGACACTAGCAT
TCTTGGTGCGGAGAGCCATCCTCTTCACCTTCATGGGTTCAACTTCTTTGTCGTTGGCCAAGGGTTTGGG
AACTTTGATCCGAACAAGGATCCTAAGAAGTTTAATCTTGTTGATCCGATAGAGAGGAACACGGTTGGTG
TGCCTTCTGGTGGATGGGCTGCTATTAGATTCCTTGCAGACAACCCAGGAGTGTGGTTCATGCACTGTCA
CTTGGAAGTGCATACCAGTTGGGGTCTGAGGATGGCTTGGCTTGTTCTTGATGGAGATAAGCCTGATCAG
AAACTGATTCCTCCTCCTGCAGACTTGCCCAACCGTTCTGTATCTCCCGCCTGTTACTCGATCTGCTGCT
TAAACCCTAAAACCCAACCTCCATCGCCATCCTTCGATTGGAGAGAGATGGAGACGAAGACGGAGATCGA
ATTCACCAAGGAGCCCTACATCGAAGACGTTGGTCCTCTCAAAATACAAAGCATAAACTTCTCAATGCTC
TCTGATATTGAAGTCATGAAAGCTGCTGAGGTTCAGGTCTGGAAGAATATGTACTACGAGTCCAATTTTA
AGCCTATTGAAGGCGGCTTGTTGGATCCTCGAATGGGTCCTCCTAACAAAAGGTCTACATGCGCAACCTG
TCATGGCAACTTCCAAAACTGTCCTGGACACTATGGCTATCTGAAGCTTGACCTTCCGGTTTATAACGTT
GGATTCTTCAATTTTATCCTTGACATTTTGAAGTGCATCTGTAAGAGCTGTTCCAGCATGCTTATTGAAG
AGAAGATGTATGAAGATCACTTGAAGAAGATGCGGAATCCAAGAACGGAGCCATTGAGGAAGACTGAATT
GGCCAAAGCGGTTGTCAAGAAGTGCAGTCTGATGGCTGGCCAGAGAGTTATTACTTGCAAAAAATGTGGA
TACCTCAATGgcatggtAAAGAAGGTCGCAGCGCAGTTGGGTATAGGCATCAGTCATGACCGATCTAAAA
TCCATGGTGGGGAGATTGATGAATTTAAATCTGCAATATCCCACACAAAAGAGTCTGCTGGTGGAATAAA
TCCTCTTACCTATGTTCTTGATCCTAACGTGGTGCTTAGACTTTTTAAAGGAATGAGTGACAAGGACTGC
GAACTTCTGTatattgctcatagaCCTGAGAATCTCATCATAACGTGCATGCTCGTGCCACCGTTATCAA
TCCGACCGTCTGTTATGATTGGTGGTACACAAAGCAATGAAAACGACATAACAGAGAGATTAAAGAAAAT
CATTCAAGACAATGCTTCTCTTCATAAGATTTTAAGCCAACCTACCACATCGCCCAAAAACATGCAAGTA
TGGGATACAGTTCAAAGCGAGGTTGCACAATACATTAATAGTGAAGTCCGAGGTGTCCAGATCATGCCAA
ACACCAAGCCACTGGCTGGACTCCTTCAGCGTCTCAAGGGAAAAGGGGGACGTTTCCGTGCAAACTTGTC
AGGGAAGCGTGTCGAGTTCACTGGTAGAACTGTTATTTCACCTGATCCCAATTTGAAAATTACAGAGGTA
GGGATTCCTATCCTTATGGCCCGGATCTTAACTTTTCCTGAATGTGTGTCCCGTCATAATATTGATAAGT
TGAGGCAACGCGTCCGCAATGGCCCTAATAAATACCCTGGTGCCAGAAATGTCAGATATCCAGATGGTTC
TTCAAGGACTTTGGTGGGTGATTATCGGAAGCGTATTGCTGATGAATTGACTTATGGATGCATAGTTGAC
CGTCATTTGGAAGACGGGGATGCTATTCTTTTCAACAGACAACCGAGTCTGCATCGGATGTCTATCATGT
GTCACAGGGCAAGAATAATGCCTTGGAGAACATTGAGGTTCAACGAATCGGTTTGTAACCCATATAATGC
TGATTTTGATGGTGATGAGATGAACATGCACGTACCACAAACAGAGGAGGCTCGGACAGAGGCTATTACA
TTGATGGGGgtacaaaacaatttatgcaccccaaaaaaTGGAGAAATGTTAGTAGCATCAACACAGGATT
TTTTAACATCTGCCTATTTGATAACGAGAAAGGACACGTTCTATGACCGTGCAGCCTTTTCACTTATATG
TTCTTACATGGGAGACGCCATGGATTCCATAGATTTGCCCACGCCCACAATCTTTAAGCCAATAGAGCTT
TGGACTGGTAAACAGGTTTTTAATGTTTTGCTGCGTCCAAACGCAAGTGTCAGAGTCTACGTAACTCTCA
ATGTGAAAGAGAAGAACTTCAGGAAGGGAGAACATTATGATGAAACAATGTGCATAAATGATGGATGGGT
TTATTTTCGGAACAGTGAGCTAATATCAGGACAACTGGGGAAGGCTACGTTAGGAAGTGGAAGCAAGGAT
GGATTATATTCAATTCTTCTTCGAGATTACAACTCCCATGCTGCTGCAGTCTGCATGAATCGTCTAGCAA
AGTTGAGTGCTCGATGGATTGGAATCCATGGCTTCTCCATTGGGATCGATGACGTTCAACCTGGAAAAAA
GTTGAAAGAGGACAGAGAGGTTATAGTGAAACGCCGATATAAGGATTGTGATGAATTACTTAAGAACTAT
GAAAAAGGAGATCTAGATGCTGCAAAAACACTGGAAGCTAACTTAACAGGGTTTCTTAATAAAATTCGAG
AAGAGACTGGGAAGCTCTGTATGGACGGATTACATTGGAGAAACAGTCCCTTGATCATGTCGCAATGCGG
TTCCAAGGGATCCCCTATCAATATCAGTCAGATGGTTACATGTGTTGGTCAGCAGACAGTTAATGGTAGC
CGTGCTCCTGATGGATTTATAGATCGAAGTCTTCCTCATTTCCCTAGAATGTCCAGAACCCCTGAAGCTA
AAGGTTTTGTTGCTAATTCGTTCTACGACGGCCTTAGTGCCACAGAGTTTTTCTTTCACACTATGGGTGG
ACGAGAAGGTCTAGTTGATACAGCGGTGAAAACTGCCAGTACAGGTTACATGTCTCGAAGACTGATGAAA
GCCTTGGAGGATCTCTTAGTCCATTATGATAACACAGTGCGAAACGCCAGCGGAAGCATACTTCAGTTTA
CTTATGGGGATGATGGGATGGACCCAGCACTGATGGAAGGAAAGAATGGAACGCCTTTAAATTTTGATAG
ATTATTTCTTAAAATTCAGGCCACTTGTCCTCCTAGATCACATCACAATTATCTTTCTTCAGACGAACTG
TTGCAAAAGTTCGAAGAGGAGTTAGTCAGGCAAGATACAAGTCGGGTGTGCACTGACGCCTTCGTGAAAT
CTCTAAGAGAATTTGTTTGTTTGCTCGGAGTAAAGTCTGCAAGCCCGAGCCAGATTTTCTCTAAAGGATC
TGGTGTGACTGATAAGCAACTCGAGGTATTTGTGAAAATTTGTGTATCTCGATACCGGGGGAAAACAATT
GAACCTGGGACTGCAATTGGACCAATAGGAGCTCAGAGTATCGGAGAACCAGGGACACAAATGACTCTGA
AAACTTTTCACTTTGCTGGAGTTGCTAGCATGAATATCACCCAAGGAGTTCCTCGAATCAACGAAATCAT
AAATGCTACCAAAACTATAAGCACACCCGTCATCTCTGCAGAACTTGAGAACCCCCTGGTAGAGGCTAGT
GCCCGAATGGCCAAAGGACGCATCGAGAAAACTACTTTAGGACAGGTTGCTGAGAGTATCGAGGTGCTAA
TGACTTCAACATCAGCGTCAGTGAGGATAACCCTTGACAAGAAAATAATAGAGGAGGCGTTTTTGTCTAT
AACCCCCTGGTCGGTTAAAAATTCCATACTAAAGACCAGAATCAAACTGCAGGATGAGAATATCAGGGTC
TTAGATACGGGATTGGATATTATTCCAAAGGGAGATCAAAATGGGACTCATTTCACTCTCCACAATCTGA
AGAATGTGCTGCCAAATGTTATAGTGAATGGGATCAAAACAGTTGAGCGAGTCGTTATAGCAGAGGATAC
AGATAAAAAGAAAGAGATTGGTGGGAAGAAAAGATTGAAACTGTTCGTGGAGGGAACAAACCTCCTGGAT
GTAATGGGCACTCCGGGAATCGATGGGAGAACTACTACAAGCAACAATATTGTCGAAGTGAGCAAAACAC
TGGGAATTGAGGCTGCAAGGACGACAATTATTGATGAAATAGGGTCAGTTATGGGAAACCATGGAATGAG
TATAGACATTCGTCACATGATGCTTTTGGCTGATGTCATGACTTACCGGGGGGAGGTACTTGGGATCCAA
AGAACCGGGATACAGAAGATGGACAAAAGTGTGCTGATGCAGGCATCTTTTGAGAGGACTGGAGATCATT
TATTTAGTGCAGCAATTAGCGGAAAAGTTGATAACATAGAGGGAGTCACAGAGTGTGTGATTATGGGCAT
ACCAATGAAACTCGGAACCGGAATCCTCAAAGTCCTCCAAAAGACTAAGACTGAGGATCTGCCAAAGCTG
AACTATGGTGCTGATCCAATCATCTCTTGA
Downstream Sequence
CGTATGTAAAGATTTGTTTATTCTGTTAAAATGTTGGGATTACACATGAAAGGTTGCAGA
TGATGATAAGATGTTTCATATTTTTTTGCAGATAAAAATGCACAACGTGACACGTCTTTG
CCACACAAAGAGTCTTGTTTCTGTAAACGGGAAGTTCCCAGGGCCTAAGATTATAGCTAG
AGAAGGTGACCAGCTTCTGATCAAAGTTGTTAACCATGTCCCAAACAATATCTCTCTCCA
CTGGTAATAATAATCTTTGAAACACTTTGAAGCACCATTAAGAAGCAATGTTTGATTACT
AACACCTGAAAATCTCAGGCATGGGATCAGGCAATTAAGGTCTGGTTGGGCTGATGGGCC
AGCTTACATAACCCAATGTCCTATCCAGACAGGGCAAAGCTATGTATACAACTACACCAT
TGTTGGTCAAAGAGGCACTTTATGGTACCATGCTCACATCTCATGGCTCAGAGCAACGGT
CTATGGTCCTCTCATCATTCTTCCCAAACACGGTGTTCCTTACCCGTTCCCCAAACCTCA
CAAAGAAGTCCCCATGGTCTTTGGTGAGTGGTTCAACGCAGACACTGAGGCAATCATCCG
CCAAGCAACGCTGACAGGAGGTGGTCCAAATGTCTCTGATGCTTACACAATCAACGGTCT
TCCTGGTCCACTATACAACTGCTCAGCTAAAGGTAAGTGTTGATTCCTACTACAACTAAT
AATGGTTTTGATAAAATATAATAACCAACAATGATGTTAAAAAAAAACAGATACATTCAG
ACTGAGAGTGAAGGCAGGGAAGACATACCTTCTCAGGATAATCAACGCTGCACTTAATGA
CGAGCTCTTCTTCAGCATCGCAAACCACACGGTCACAGTCGTTGAAGCTGATGCCGTCTA
CGTCAAACCATTCGAGACTAACACCATCTTGATCGCTCCTGGCCAGACCACAAACGTCCT
GCTCAAGACCAAACCGAGTTACCCTAGCGCCTCTTTCTTCATGACCGCTAGACCATACGC
CACAGGTCAAGGGACTTTCGATAACACTACAGTCGCAGGCATCTTAGAATACGAACAGCC
TAAACATGCAAAGACCAATATAAAGAATCTTCCTCTCTTCACGCCGGTGCTCCCCGCTCT
AAACGACACAAACTTCGCTACCAAGTTCAGCAACAAGCTGCGTAGCTTGAACAGCAAGAA
GTTCCCGGCGAACGTGCCTCAGGAGGTTGATAGGAAGTTCTTTTTCACGGTGGGGTTGGG
GACTAACCCTTGTAACCATAAGAACAACCAGACGTGCCAGGGTCCTACCAACACCACAAT
GTTTGCTGCATCGATCAGCAACATCTCATTCACACTACCAACGAAGGCTCTCCTTCAATC
TCACTACTCTGGGAGATCTAACGGAGTTTATTCTCCGAACTTCCCGTGGAGTCCCATTGT
TCCTTTTAACTACACAGGCACTCCGCCTAACAACACCATGGTTAGCACCGGGACGAACTT
GATGGTTCTGAGGTATAACACGAGTGTGGAGCTTGTGATGCAGGACACTAGCATTCTTGG
TGCGGAGAGCCATCCTCTTCACCTTCATGGGTTCAACTTCTTTGTCGTTGGCCAAGGGTT
TGGGAACTTTGATCCGAACAAGGATCCTAAGAAGTTTAATCTTGTTGATCCGATAGAGAG
GAACACGGTTGGTGTGCCTTCTGGTGGATGGGCTGCTATTAGATTCCTTGCAGACAACCC
AGGTAAGAGAAAACACTATAAACCAAAGAAATATAGCTTTTATGTTAGTCTAATCTTATA
TGAAAATTAAAAAAATGTAATGACAGGAGTGTGGTTCATGCACTGTCACTTGGAAGTGCA
TACCAGTTGGGGTCTGAGGATGGCTTGGCTTGTTCTTGATGGAGATAAGCCTGATCAGAA
ACTGATTCCTCCTCCTGCAGACTTGCCCAAGTGCTGATAAATCTTGCTATCTTTTTTGTT
TCTTGTCTCTTCTTTCTTCC
Pro Sequence
MASWLLFSVFSCVLLLPEPAFGITRHYTLDIKMHNVTRLCHTKSLVSVNG
KFPGPKIIAREGDQLLIKVVNHVPNNISLHWHGIRQLRSGWADGPAYITQ
CPIQTGQSYVYNYTIVGQRGTLWYHAHISWLRATVYGPLIILPKHGVPYP
FPKPHKEVPMVFGEWFNADTEAIIRQATLTGGGPNVSDAYTINGLPGPLY
NCSAKDTFRLRVKAGKTYLLRIINAALNDELFFSIANHTVTVVEADAVYV
KPFETNTILIAPGQTTNVLLKTKPSYPSASFFMTARPYATGQGTFDNTTV
AGILEYEQPKHAKTNIKNLPLFTPVLPALNDTNFATKFSNKLRSLNSKKF
PANVPQEVDRKFFFTVGLGTNPCNHKNNQTCQGPTNTTMFAASISNISFT
LPTKALLQSHYSGRSNGVYSPNFPWSPIVPFNYTGTPPNNTMVSTGTNLM
VLRYNTSVELVMQDTSILGAESHPLHLHGFNFFVVGQGFGNFDPNKDPKK
FNLVDPIERNTVGVPSGGWAAIRFLADNPGVWFMHCHLEVHTSWGLRMAW
LVLDGDKPDQKLIPPPADLPNRSVSPACYSICCLNPKTQPPSPSFDWREM
ETKTEIEFTKEPYIEDVGPLKIQSINFSMLSDIEVMKAAEVQVWKNMYYE
SNFKPIEGGLLDPRMGPPNKRSTCATCHGNFQNCPGHYGYLKLDLPVYNV
GFFNFILDILKCICKSCSSMLIEEKMYEDHLKKMRNPRTEPLRKTELAKA
VVKKCSLMAGQRVITCKKCGYLNGMVKKVAAQLGIGISHDRSKIHGGEID
EFKSAISHTKESAGGINPLTYVLDPNVVLRLFKGMSDKDCELLYIAHRPE
NLIITCMLVPPLSIRPSVMIGGTQSNENDITERLKKIIQDNASLHKILSQ
PTTSPKNMQVWDTVQSEVAQYINSEVRGVQIMPNTKPLAGLLQRLKGKGG
RFRANLSGKRVEFTGRTVISPDPNLKITEVGIPILMARILTFPECVSRHN
IDKLRQRVRNGPNKYPGARNVRYPDGSSRTLVGDYRKRIADELTYGCIVD
RHLEDGDAILFNRQPSLHRMSIMCHRARIMPWRTLRFNESVCNPYNADFD
GDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGEMLVASTQDFLTSAYL
ITRKDTFYDRAAFSLICSYMGDAMDSIDLPTPTIFKPIELWTGKQVFNVL
LRPNASVRVYVTLNVKEKNFRKGEHYDETMCINDGWVYFRNSELISGQLG
KATLGSGSKDGLYSILLRDYNSHAAAVCMNRLAKLSARWIGIHGFSIGID
DVQPGKKLKEDREVIVKRRYKDCDELLKNYEKGDLDAAKTLEANLTGFLN
KIREETGKLCMDGLHWRNSPLIMSQCGSKGSPINISQMVTCVGQQTVNGS
RAPDGFIDRSLPHFPRMSRTPEAKGFVANSFYDGLSATEFFFHTMGGREG
LVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNASGSILQFTYGDDGM
DPALMEGKNGTPLNFDRLFLKIQATCPPRSHHNYLSSDELLQKFEEELVR
QDTSRVCTDAFVKSLREFVCLLGVKSASPSQIFSKGSGVTDKQLEVFVKI
CVSRYRGKTIEPGTAIGPIGAQSIGEPGTQMTLKTFHFAGVASMNITQGV
PRINEIINATKTISTPVISAELENPLVEASARMAKGRIEKTTLGQVAESI
EVLMTSTSASVRITLDKKIIEEAFLSITPWSVKNSILKTRIKLQDENIRV
LDTGLDIIPKGDQNGTHFTLHNLKNVLPNVIVNGIKTVERVVIAEDTDKK
KEIGGKKRLKLFVEGTNLLDVMGTPGIDGRTTTSNNIVEVSKTLGIEAAR
TTIIDEIGSVMGNHGMSIDIRHMMLLADVMTYRGEVLGIQRTGIQKMDKS
VLMQASFERTGDHLFSAAISGKVDNIEGVTECVIMGIPMKLGTGILKVLQ
KTKTEDLPKLNYGADPIIS
mRNA Sequence
ATGGCGTCTTGGCTTCTCTTTTCTGTGTTCTCTTGTGTTCTTCTTCTTCCTGAACCTGCATTTGGGATTA
CTAGGCATTATACGCTGGACATAAAAATGCACAACGTGACACGTCTTTGCCACACAAAGAGTCTTGTTTC
TGTAAACGGGAAGTTCCCAGGGCCTAAGATTATAGCTAGAGAAGGTGACCAGCTTCTGATCAAAGTTGTT
AACCATGTCCCAAACAATATCTCTCTCCACTGGCATGGGATCAGGCAATTAAGGTCTGGTTGGGCTGATG
GGCCAGCTTACATAACCCAATGTCCTATCCAGACAGGGCAAAGCTATGTATACAACTACACCATTGTTGG
TCAAAGAGGCACTTTATGGTACCATGCTCACATCTCATGGCTCAGAGCAACGGTCTATGGTCCTCTCATC
ATTCTTCCCAAACACGGTGTTCCTTACCCGTTCCCCAAACCTCACAAAGAAGTCCCCATGGTCTTTGGTG
AGTGGTTCAACGCAGACACTGAGGCAATCATCCGCCAAGCAACGCTGACAGGAGGTGGTCCAAATGTCTC
TGATGCTTACACAATCAACGGTCTTCCTGGTCCACTATACAACTGCTCAGCTAAAGATACATTCAGACTG
AGAGTGAAGGCAGGGAAGACATACCTTCTCAGGATAATCAACGCTGCACTTAATGACGAGCTCTTCTTCA
GCATCGCAAACCACACGGTCACAGTCGTTGAAGCTGATGCCGTCTACGTCAAACCATTCGAGACTAACAC
CATCTTGATCGCTCCTGGCCAGACCACAAACGTCCTGCTCAAGACCAAACCGAGTTACCCTAGCGCCTCT
TTCTTCATGACCGCTAGACCATACGCCACAGGTCAAGGGACTTTCGATAACACTACAGTCGCAGGCATCT
TAGAATACGAACAGCCTAAACATGCAAAGACCAATATAAAGAATCTTCCTCTCTTCACGCCGGTGCTCCC
CGCTCTAAACGACACAAACTTCGCTACCAAGTTCAGCAACAAGCTGCGTAGCTTGAACAGCAAGAAGTTC
CCGGCGAACGTGCCTCAGGAGGTTGATAGGAAGTTCTTTTTCACGGTGGGGTTGGGGACTAACCCTTGTA
ACCATAAGAACAACCAGACGTGCCAGGGTCCTACCAACACCACAATGTTTGCTGCATCGATCAGCAACAT
CTCATTCACACTACCAACGAAGGCTCTCCTTCAATCTCACTACTCTGGGAGATCTAACGGAGTTTATTCT
CCGAACTTCCCGTGGAGTCCCATTGTTCCTTTTAACTACACAGGCACTCCGCCTAACAACACCATGGTTA
GCACCGGGACGAACTTGATGGTTCTGAGGTATAACACGAGTGTGGAGCTTGTGATGCAGGACACTAGCAT
TCTTGGTGCGGAGAGCCATCCTCTTCACCTTCATGGGTTCAACTTCTTTGTCGTTGGCCAAGGGTTTGGG
AACTTTGATCCGAACAAGGATCCTAAGAAGTTTAATCTTGTTGATCCGATAGAGAGGAACACGGTTGGTG
TGCCTTCTGGTGGATGGGCTGCTATTAGATTCCTTGCAGACAACCCAGGAGTGTGGTTCATGCACTGTCA
CTTGGAAGTGCATACCAGTTGGGGTCTGAGGATGGCTTGGCTTGTTCTTGATGGAGATAAGCCTGATCAG
AAACTGATTCCTCCTCCTGCAGACTTGCCCAACCGTTCTGTATCTCCCGCCTGTTACTCGATCTGCTGCT
TAAACCCTAAAACCCAACCTCCATCGCCATCCTTCGATTGGAGAGAGATGGAGACGAAGACGGAGATCGA
ATTCACCAAGGAGCCCTACATCGAAGACGTTGGTCCTCTCAAAATACAAAGCATAAACTTCTCAATGCTC
TCTGATATTGAAGTCATGAAAGCTGCTGAGGTTCAGGTCTGGAAGAATATGTACTACGAGTCCAATTTTA
AGCCTATTGAAGGCGGCTTGTTGGATCCTCGAATGGGTCCTCCTAACAAAAGGTCTACATGCGCAACCTG
TCATGGCAACTTCCAAAACTGTCCTGGACACTATGGCTATCTGAAGCTTGACCTTCCGGTTTATAACGTT
GGATTCTTCAATTTTATCCTTGACATTTTGAAGTGCATCTGTAAGAGCTGTTCCAGCATGCTTATTGAAG
AGAAGATGTATGAAGATCACTTGAAGAAGATGCGGAATCCAAGAACGGAGCCATTGAGGAAGACTGAATT
GGCCAAAGCGGTTGTCAAGAAGTGCAGTCTGATGGCTGGCCAGAGAGTTATTACTTGCAAAAAATGTGGA
TACCTCAATGgcatggtAAAGAAGGTCGCAGCGCAGTTGGGTATAGGCATCAGTCATGACCGATCTAAAA
TCCATGGTGGGGAGATTGATGAATTTAAATCTGCAATATCCCACACAAAAGAGTCTGCTGGTGGAATAAA
TCCTCTTACCTATGTTCTTGATCCTAACGTGGTGCTTAGACTTTTTAAAGGAATGAGTGACAAGGACTGC
GAACTTCTGTatattgctcatagaCCTGAGAATCTCATCATAACGTGCATGCTCGTGCCACCGTTATCAA
TCCGACCGTCTGTTATGATTGGTGGTACACAAAGCAATGAAAACGACATAACAGAGAGATTAAAGAAAAT
CATTCAAGACAATGCTTCTCTTCATAAGATTTTAAGCCAACCTACCACATCGCCCAAAAACATGCAAGTA
TGGGATACAGTTCAAAGCGAGGTTGCACAATACATTAATAGTGAAGTCCGAGGTGTCCAGATCATGCCAA
ACACCAAGCCACTGGCTGGACTCCTTCAGCGTCTCAAGGGAAAAGGGGGACGTTTCCGTGCAAACTTGTC
AGGGAAGCGTGTCGAGTTCACTGGTAGAACTGTTATTTCACCTGATCCCAATTTGAAAATTACAGAGGTA
GGGATTCCTATCCTTATGGCCCGGATCTTAACTTTTCCTGAATGTGTGTCCCGTCATAATATTGATAAGT
TGAGGCAACGCGTCCGCAATGGCCCTAATAAATACCCTGGTGCCAGAAATGTCAGATATCCAGATGGTTC
TTCAAGGACTTTGGTGGGTGATTATCGGAAGCGTATTGCTGATGAATTGACTTATGGATGCATAGTTGAC
CGTCATTTGGAAGACGGGGATGCTATTCTTTTCAACAGACAACCGAGTCTGCATCGGATGTCTATCATGT
GTCACAGGGCAAGAATAATGCCTTGGAGAACATTGAGGTTCAACGAATCGGTTTGTAACCCATATAATGC
TGATTTTGATGGTGATGAGATGAACATGCACGTACCACAAACAGAGGAGGCTCGGACAGAGGCTATTACA
TTGATGGGGgtacaaaacaatttatgcaccccaaaaaaTGGAGAAATGTTAGTAGCATCAACACAGGATT
TTTTAACATCTGCCTATTTGATAACGAGAAAGGACACGTTCTATGACCGTGCAGCCTTTTCACTTATATG
TTCTTACATGGGAGACGCCATGGATTCCATAGATTTGCCCACGCCCACAATCTTTAAGCCAATAGAGCTT
TGGACTGGTAAACAGGTTTTTAATGTTTTGCTGCGTCCAAACGCAAGTGTCAGAGTCTACGTAACTCTCA
ATGTGAAAGAGAAGAACTTCAGGAAGGGAGAACATTATGATGAAACAATGTGCATAAATGATGGATGGGT
TTATTTTCGGAACAGTGAGCTAATATCAGGACAACTGGGGAAGGCTACGTTAGGAAGTGGAAGCAAGGAT
GGATTATATTCAATTCTTCTTCGAGATTACAACTCCCATGCTGCTGCAGTCTGCATGAATCGTCTAGCAA
AGTTGAGTGCTCGATGGATTGGAATCCATGGCTTCTCCATTGGGATCGATGACGTTCAACCTGGAAAAAA
GTTGAAAGAGGACAGAGAGGTTATAGTGAAACGCCGATATAAGGATTGTGATGAATTACTTAAGAACTAT
GAAAAAGGAGATCTAGATGCTGCAAAAACACTGGAAGCTAACTTAACAGGGTTTCTTAATAAAATTCGAG
AAGAGACTGGGAAGCTCTGTATGGACGGATTACATTGGAGAAACAGTCCCTTGATCATGTCGCAATGCGG
TTCCAAGGGATCCCCTATCAATATCAGTCAGATGGTTACATGTGTTGGTCAGCAGACAGTTAATGGTAGC
CGTGCTCCTGATGGATTTATAGATCGAAGTCTTCCTCATTTCCCTAGAATGTCCAGAACCCCTGAAGCTA
AAGGTTTTGTTGCTAATTCGTTCTACGACGGCCTTAGTGCCACAGAGTTTTTCTTTCACACTATGGGTGG
ACGAGAAGGTCTAGTTGATACAGCGGTGAAAACTGCCAGTACAGGTTACATGTCTCGAAGACTGATGAAA
GCCTTGGAGGATCTCTTAGTCCATTATGATAACACAGTGCGAAACGCCAGCGGAAGCATACTTCAGTTTA
CTTATGGGGATGATGGGATGGACCCAGCACTGATGGAAGGAAAGAATGGAACGCCTTTAAATTTTGATAG
ATTATTTCTTAAAATTCAGGCCACTTGTCCTCCTAGATCACATCACAATTATCTTTCTTCAGACGAACTG
TTGCAAAAGTTCGAAGAGGAGTTAGTCAGGCAAGATACAAGTCGGGTGTGCACTGACGCCTTCGTGAAAT
CTCTAAGAGAATTTGTTTGTTTGCTCGGAGTAAAGTCTGCAAGCCCGAGCCAGATTTTCTCTAAAGGATC
TGGTGTGACTGATAAGCAACTCGAGGTATTTGTGAAAATTTGTGTATCTCGATACCGGGGGAAAACAATT
GAACCTGGGACTGCAATTGGACCAATAGGAGCTCAGAGTATCGGAGAACCAGGGACACAAATGACTCTGA
AAACTTTTCACTTTGCTGGAGTTGCTAGCATGAATATCACCCAAGGAGTTCCTCGAATCAACGAAATCAT
AAATGCTACCAAAACTATAAGCACACCCGTCATCTCTGCAGAACTTGAGAACCCCCTGGTAGAGGCTAGT
GCCCGAATGGCCAAAGGACGCATCGAGAAAACTACTTTAGGACAGGTTGCTGAGAGTATCGAGGTGCTAA
TGACTTCAACATCAGCGTCAGTGAGGATAACCCTTGACAAGAAAATAATAGAGGAGGCGTTTTTGTCTAT
AACCCCCTGGTCGGTTAAAAATTCCATACTAAAGACCAGAATCAAACTGCAGGATGAGAATATCAGGGTC
TTAGATACGGGATTGGATATTATTCCAAAGGGAGATCAAAATGGGACTCATTTCACTCTCCACAATCTGA
AGAATGTGCTGCCAAATGTTATAGTGAATGGGATCAAAACAGTTGAGCGAGTCGTTATAGCAGAGGATAC
AGATAAAAAGAAAGAGATTGGTGGGAAGAAAAGATTGAAACTGTTCGTGGAGGGAACAAACCTCCTGGAT
GTAATGGGCACTCCGGGAATCGATGGGAGAACTACTACAAGCAACAATATTGTCGAAGTGAGCAAAACAC
TGGGAATTGAGGCTGCAAGGACGACAATTATTGATGAAATAGGGTCAGTTATGGGAAACCATGGAATGAG
TATAGACATTCGTCACATGATGCTTTTGGCTGATGTCATGACTTACCGGGGGGAGGTACTTGGGATCCAA
AGAACCGGGATACAGAAGATGGACAAAAGTGTGCTGATGCAGGCATCTTTTGAGAGGACTGGAGATCATT
TATTTAGTGCAGCAATTAGCGGAAAAGTTGATAACATAGAGGGAGTCACAGAGTGTGTGATTATGGGCAT
ACCAATGAAACTCGGAACCGGAATCCTCAAAGTCCTCCAAAAGACTAAGACTGAGGATCTGCCAAAGCTG
AACTATGGTGCTGATCCAATCATCTCTTGA