PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_028118553.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; Ericales; Theaceae; Camellia; Camellia sinensis
Family C2H2
Protein Properties Length: 422aa    MW: 47068.1 Da    PI: 8.4001
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_028118553.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H218.74.9e-065173123
                    EEETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                    y+C++C++ F r  nL+ H r+H
  XP_028118553.1 51 YVCEICNQGFQRDQNLQMHRRRH 73
                    9*********************9 PP

2zf-C2H217.41.3e-05130152123
                     EEETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                     ++C++C+k +  +s++k H++t+
  XP_028118553.1 130 WVCEKCSKAYAVQSDYKAHLKTC 152
                     58*******************98 PP

3zf-C2H2110.0013160178523
                     TTTEEESSHHHHHHHHHHT CS
         zf-C2H2   5 dCgksFsrksnLkrHirtH 23 
                     dCg++Fsr +++  H+ t+
  XP_028118553.1 160 DCGRVFSRVESFIEHQDTC 178
                     9***************987 PP

Sequence ? help Back to Top
Protein Sequence    Length: 422 aa     Download sequence    
MLDNNTGSVG PSSSSENGVA KKRKRRPAGT PDPDAEVVSL SPKTLLESDQ YVCEICNQGF  60
QRDQNLQMHR RRHKVPWKLV KREESVGEVK KRVFVCPEPS CLHHDPCHAL GDLVGIKKHF  120
RRKHSNNKQW VCEKCSKAYA VQSDYKAHLK TCGTRGHSCD CGRVFSRVES FIEHQDTCTL  180
RPALPELQAL QPSWSPRTAS SMTPTNDTNF SIAPILPRLA VPKPVGPVFL CPERHSSSNT  240
DDQQHNLELQ LLSSSSNAYA SALQICDENH ATNLKLSIGS CSQGEQNCTN LDAGRSYRPE  300
RNATETRLEA SRLKEEADEK LKLAMFEKAY AEEARIQAKK QIEMAELEFA NAKRIRQQAQ  360
AELERAQLLK EKATKKISST ILLITCHSCK QQFQATTTSI ARADETSLAA SYMSSATTTE  420
GD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12126KKRKRR
22227KRKRRP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01940.11e-146C2H2 family protein