PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022741079.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C3H
Protein Properties Length: 1436aa    MW: 158668 Da    PI: 5.6251
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022741079.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.29e-0614131436226
                      -S---SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                      +++ C+ f+++G+Ck G++C++ H+
  XP_022741079.1 1413 GQRVCK-FYENGYCKKGASCRYWHP 1436
                      6799**.6666***********997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1436 aa     Download sequence    
MENCQQVEST TALYKPCLQD NVGGGGCVDG GGNNSDSLNR DSLQGVEFMS VDQCQKIPEM  60
DDSQLVGNAD VEVRGDAGAK AETGALNGVK VVELSGGKRR RGRPPRNQVR TTASSVPPPQ  120
RKKKDEEDVC FICFDGGSLV LCDRRGCPKA YHPACIKRDE AFFKSKAKWN CGWHICSTCQ  180
KSSYYMCYTC TYSLCKNCTK DADYLNVRGN KGFCGMCMRT VVLIENSTLE NKEMVQVDFD  240
DQTSWEYLFK VYWIFLKEKL SLSLDELTKA KNPWKETAIM DPKGESSGEL NNISNARGAN  300
MERSCEDLGA SYSKRRKGMK KEKFLNKVES LEAEKPGVMK GMLLPEGTNW ATKELLEFVA  360
HMKHGDLSVL SHFDVQALLL EYVKRSNLRD PRQKSYIVCD SMLIKLFGKA RVAHFEMLKL  420
LESHFLTQDH SRPINTIRAG GSEAVAQLAI DGSNDSHPLI VNNKRRKTRK KVDEKGQKAN  480
LDEFAAIDVH NMNLIYLKRN LMENLIDDTD KFNDKVVGSF VRIKISGSDQ KQDMYRLVQV  540
IGTSKVVEPY KIGERTTDIM LEILNLDKKE IVSIDGISNQ EFAEDECKRL RERIKCGLMK  600
WFTVGEIQEK AVALQAVRVN DWLESEILRV KNLRDRASEK GHMKEFRECI EKLQLLNSPD  660
ERQRRLHEIP DVHSDRNMNL YCKSEEVAGE LDEKKKDNHM KSRNSGSGIK EKEPASPLKG  720
GDILSDIGSR ETSIPLHSTG MELSVNNSET DKIWHYQDPL GKIQGPFSMA TLHRWSLSRH  780
FPPDLRIWRV SEKQKDSILL TDVLDGQYNQ AQQLFHNSCV RTEDMMIVSN DGCQNGDGDV  840
RDSMDLNVTQ MESKQVEGNT MQNDTNGHCY VNKEFVKSKE LALQSSPSTA PVEIVSNAIQ  900
AGSPLPHWDS VNGENDFPCQ PQVSSSLPLS TLSGKPFETQ SHQFSKGHGA ERWDCGLMNI  960
NENLNKTPES QIIAGNVEHD DCEGRSGKSC GQSWRSTPLN DASNGWDSNS GLISLARALE  1020
ASEHNQDIDF PDIPTSTSKL NLEDSKSQAT ENKQSLSSNV PHQDSGPSWS TASSLVGNGP  1080
QLPEIAGEWG GCSSTQVKPS AEEWDSNLVP ESSLKPTVLG SDHVATPTSG SGQLTHFSPT  1140
DPANNASGWD SIVPEPNDSS WGDESVSDLL AEVEAMESLN GLSSPTSILH SDGELAQGSD  1200
PDCFSPVGGL SPAPDPGKSD SLSSTNDLQM PSESTVTNKP FGVSQSQVLD AKKSSGGHSS  1260
TSAEMDEDKR PSDVSVNQNE AGSDMQPPAS TVTTWGMATV DTAWRAGPET TGTNWGTVQG  1320
NANFNWGCLG QETTSVNWGT GQGTFQENGS INSGTSAANP PYWGGQQRYA GQRERDFQGR  1380
DSSFVRGRSS VNRQSYSYGG PNRVGSFRPP LKGQRVCKFY ENGYCKKGAS CRYWHP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
198124KRRRGRPPRNQVRTTASSVPPPQRKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G51120.10.0C3H family protein