PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g417520.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family C3H
Protein Properties Length: 1193aa    MW: 129051 Da    PI: 6.7308
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g417520.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH34.92.6e-11190212527
                       --SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH   5 lCrffartGtCkyGdrCkFaHgp 27 
                        C+f+++tGtCk+G++CkF+H++
  Pd.00g417520.m01 190 DCSFYLKTGTCKFGSSCKFNHPR 212
                       5********************96 PP

2zf-CCCH24.44.9e-08283306326
                       S---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   3 telCrffartGtCkyGdrCkFaHg 26 
                       ++ C+f++r+  C +G +C+F+H+
  Pd.00g417520.m01 283 EKDCSFYMRNASCMFGTNCRFNHP 306
                       679********************9 PP

3zf-CCCH24.64.2e-08680704226
                       -S---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   2 ktelCrffartGtCkyGdrCkFaHg 26 
                       +++ C+f++r+  C +G +C+F+H+
  Pd.00g417520.m01 680 GEKDCSFYMRNASCMFGTNCRFNHP 704
                       7889********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1193 aa     Download sequence    
MLPAARFLLP VSVSVLYVLY KDLRSGRATE LSEDIAVPLN PDNNDVVSRQ PSHLDPAILE  60
EIRKFDPAVL DEFRKFDPAL LDEFRKFDPA ALDEFRKFDP AALDVIRKLD PAIIDQIHKL  120
CVKEKVEGKE EAERNCSGRE NENGNETQSE ESGGGGGENK NENGGEVEKK VVNEERSRRH  180
HYPVRPEAGD CSFYLKTGTC KFGSSCKFNH PRRRKTNKVS KVKMKEREGL AAEKPGLTES  240
KDYSGSGGCK YEKPCSFNPR REEPSVAPIL ECNFLGLPIR PEEKDCSFYM RNASCMFGTN  300
CRFNHPDPTA ARESDPPSGY GNGGSASLQG ALSSTAAPWS APRSLNDAPL YVPMVIPPSQ  360
GIPSQDTEWN GYQAPAYLQE RSMPAHQPYL MNNSVTGTNV YKQYPHQQAE EFPERPGQPY  420
PPHAPFETRS EHLHTLHSLW AFANLGQLVN LTILLYRFYK YLRTELIRVP QNPPELSDLD  480
LAILEEIRKL GPALLDKILK DPTLLDEFRK IDPAVLHEFR KLDPAIIGKI LQFCLKDNKE  540
GEEEEDRSTC SGRETAKQNE NGNEETDGGG GENKNENGGE VEKKVVDEER SRSQHYPVRP  600
GAGEGKTARK NRKNNQVSKD KMKEREGLAA EKPGQTECKD YSRSGGRKYE KACSLNPGRG  660
EPSVAPILEC NFLGLPIRPG EKDCSFYMRN ASCMFGTNCR FNHPDPTAAR ESEPPSGYGN  720
GGSASLQSAS SSTAAPWSAP RSLNDAPLYV PMVIPHVQIG SWGNADGVWA QSEAGNKDEA  780
SGWTKPAFIN ENQNDSWKKP SGVDDNKRAS WGKADGGSTW TKQDGDATWN KQGEGSTWNK  840
QDGSSDWNKP AGDSSWSKQA GGSSWGKQAD VTAGHESGGV GNQDIGWKRA SSFGGSQSID  900
GVNGDQPEDF NNNRSGGNWR GGSGRGNSDR GGFRGGRGFV GRGGDREEDR GGFGGRGGDR  960
GSFGGRGRSD KGGFGGRGYG GRGRGRYQSG GWSNRNESIE NNSSGWSKGA DGAGEGWKKD  1020
NGGGSWNQTV EPKYTAGTQD KGTGSHNEVG RSWGNNWKSS DASNGDQSSR WKQSTAAEEV  1080
KGNTDQDGGW NKGPSSNAQA GGWGNQGSGW NKGTGSGFGG GTGEHPSAAV GGQSSDWKQS  1140
SAAGGAQSSS WNQSGEAKQG TDEGAKPTIS WGKAAAASSW GKGSDGGSSK GGW
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1211215PRRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48440.12e-59C3H family protein