PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024960401.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family HB-PHD
Protein Properties Length: 724aa    MW: 81364.6 Da    PI: 7.8109
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024960401.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox31.92.2e-10491535953
                     HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
        Homeobox   9 keqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                     +e +e L  +F+kn  ps++ +e+L+k+lgL+ ++V  WF+N R+
  XP_024960401.1 491 SEAVEKLRLVFAKNELPSRAVKEDLSKQLGLDLEKVNKWFKNARY 535
                     678999*************************************97 PP

2PHD29.81.4e-10220273250
                     BTTTSSTCT..TSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHT CS
             PHD   2 CkvCgksde..egelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCke 50 
                     C+ C+  d+  +++++lCd +C+ +fH+ C++++l +e++p g   w+C+ C +
  XP_024960401.1 220 CAKCKLRDAfpDNDIILCDgTCNCAFHQMCIDPPLLTENIPPGdqGWFCKYCIC 273
                     88899844445********56********************9999*******87 PP

Sequence ? help Back to Top
Protein Sequence    Length: 724 aa     Download sequence    
MTGAETKEAP ENIGKLVHLQ SSTPKSYAKR SRKLKPHGNT LSSPMSKKKL VDLLTNSKKK  60
EFKRRNASNK ASKSLSKPSL VSCQNQKKQL AATGSNENEK CEKGDPTSEK FKQRRKRKRT  120
NNKVEVDEAS RLQRRTRYLL IKMKIEQNLL DAYSTEGWKG QSREKIKPEK ELQRAKKQIL  180
KCKLGIRDAL RQLDLLSSDG CIDESVIAPD GSVHHEHIHC AKCKLRDAFP DNDIILCDGT  240
CNCAFHQMCI DPPLLTENIP PGDQGWFCKY CICKTEIMEA MNAHLGTSYP HDSNWQEIFK  300
VEATLPDGGN TLLNQEEWPS DDSGDDDYDP DRVEKRDSSC SRVCSEGESC DDDASSSYSL  360
QSLDVKALDD ESQKLDMGLE SISADLIGAV SGSGSDCEFV SGRRQRQAVD YRKLYDEMFG  420
KDALANEQAS EDEDWGPTNR KRREKESDAA STLMTLCETE EKSVKDVPDT SKVDTNLSCK  480
ETKRSFFRIP SEAVEKLRLV FAKNELPSRA VKEDLSKQLG LDLEKVNKWF KNARYLSLKT  540
KRAGEENPTQ NDGISISKES GSEPAKNEAI DEILSEDMPT TLAHTPGNGH MKKFRRRKNP  600
QSPTSTAKQQ QVEREFNLTT STNKVDGNED LGDDDLSLKM LRENVKKEKI RAVDIGGSSE  660
GDDQQAVAAA ESQMEKLCFL KTKMEKLKQV LLLRTPNRRA KTTATSIDHT NTIFVPVAHL  720
KEKH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12963KRSRKLKPHGNTLSSPMSKKKLVDLLTNSKKKEFK
2114119RRKRKR
3114144RRKRKRTNNKVEVDEASRLQRRTRYLLIKMK
4115145RRKRKRTNNKVEVDEASRLQRRTRYLLIKMK
5116146RRKRKRTNNKVEVDEASRLQRRTRYLLIKMK
6440445RKRREK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.21e-156HB-PHD family protein