PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022721316.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C3H
Protein Properties Length: 816aa    MW: 94888.4 Da    PI: 7.9922
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022721316.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.32e-06224243625
                     -SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                     C+f+++tG C++G rC++ H
  XP_022721316.1 224 CPFHLKTGACRFGQRCSRVH 243
                     ******************99 PP

2zf-CCCH24.83.9e-08353380126
                     --S---SGGGGTS..--TTTTT-SS-SS CS
         zf-CCCH   1 yktelCrffartG..tCkyGdrCkFaHg 26 
                     +k ++C  f+++   tC++G  C+F+H+
  XP_022721316.1 353 WKVAICGEFMKSRfkTCSHGTACNFIHC 380
                     899************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 816 aa     Download sequence    
MGEAEAEAEA ILKQEEGGGK RENHPMEKSR KEKRKEMKKI KRKQLRKEAA EKEREAEEAR  60
LNDPEEQKRI AREEEEELKR REIALNEFEE RERSWIEAME MKRKAQEDEE QEEKKMNDLN  120
EDSNGEQRKQ EEMGDDWEYV EEGPAEIIWQ GNEIIVRKKK VRVPKGEGNQ KSKGEDADRP  180
TSNPLPPQSE AFADYLNASS AQQVLESVAK EVPNFGTEQD KAHCPFHLKT GACRFGQRCS  240
RVHFYPDKSC TLLIRNMYNG PGLAWEQDEG LEYTDEEVER CYEEFYEDVH TEFLKFGEIV  300
NFKVCKNGSF HLRGNVYVHY KLLESAVLAY DSINGRYFAG KQVQCEFVNI TRWKVAICGE  360
FMKSRFKTCS HGTACNFIHC FRNPGGDYEW ADLDKPPPRY WVKKMAALFG YADEAVFEKQ  420
TEQEHSGHSR TSRMVKLDDG RHYSRRSKSR EKVRFIDGAQ ESKHSQRGMN NDRKQRKILD  480
GRWDGQNTSL KWDQNSERSH DTSSDGGCLD SQMGKNNDRK QTKILDERLD RQSKSLKRGQ  540
NSERIHDTSS DGGYSESDIN GTRDTDQVRR HCHAKKNSKR QNESSEYLAD HKNIENRVYE  600
DTENLHGHTK KRRRHCSKVG YPDDDGGSDI QTHKDNGDQL NRDRDNEKHC SHGGKSSRHH  660
IKESPLDDLG ASKNRTNVGD SLLKQSSRVS DREGRHHDRQ KSLGNLDQVS GLLDDHGKAY  720
GTSDSNSSDH GSERDKRRHH GHRRKRSRHV VECSEFPDAS GHQAKKLKGK RDLKRYRHST  780
HRYETDSSED YEMKHNVKRR SHGHKRHDEK KPNRDS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1736746KRRHHGHRRKR
2743768RRKRSRHVVECSEFPDASGHQAKKLK
3743770RRKRSRHVVECSEFPDASGHQAKKLKGK
4744769RRKRSRHVVECSEFPDASGHQAKKLK
5798802KRRSH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-162C3H family protein