PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62020997-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family C2H2
Protein Properties Length: 383aa    MW: 44411.4 Da    PI: 7.3722
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62020997-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H221.94.6e-076788223
                    EETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2  2 kCpdCgksFsrksnLkrHirtH 23
                    +C+ Cg sF+ + +Lk+H+ +H
  AUR62020997-RA 67 TCEECGASFKKPAHLKQHMLSH 88
                    6*******************99 PP

2zf-C2H221.56.1e-07114138123
                     EEET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2   1 ykCp..dCgksFsrksnLkrHirtH 23 
                     ++Cp  dC+ s++rk++L+rHi  H
  AUR62020997-RA 114 FVCPinDCNASYRRKDHLNRHILQH 138
                     89********************988 PP

3zf-C2H219.42.9e-06183207123
                     EEET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2   1 ykCp..dCgksFsrksnLkrHirtH 23 
                     ++C+   Cgk F+  s+L++H  tH
  AUR62020997-RA 183 HVCEenGCGKEFKYASQLRKHAETH 207
                     789999***************9999 PP

Sequence ? help Back to Top
Protein Sequence    Length: 383 aa     Download sequence    
MGEELEEREI GPIFRDIRRY YCEFCSICRS KKSLINSHIL SHHQEELEKK REVEGEDKNE  60
GQNCNNTCEE CGASFKKPAH LKQHMLSHSL EDGGPPEKLR QAQPLDTHYY KRSFVCPIND  120
CNASYRRKDH LNRHILQHQG KLFKCPIENC CKEFSVQGNV SRHLKEFHED KPEPDTGKGE  180
FKHVCEENGC GKEFKYASQL RKHAETHGKF LLCLCYFEMF VVLVVKLTED ITCSVLESSE  240
TFCADPSCMK PFANVDCLKA HIRSCHQYVN CEKSNMQQHV KAVHLKLRPF ICSFTGCGMR  300
FAFKHVRDNH EKSTRHVYVH GDLEEFDEQF RSRPRGGRKR KLPNIEMLMR KRISAPNQCD  360
GVLDDSSEFL SWLLSDDDNS ES*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1333341RPRGGRKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72050.21e-102C2H2 family protein