PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001422.1_g110.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family HB-other
Protein Properties Length: 1074aa    MW: 117863 Da    PI: 4.5776
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001422.1_g110.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox406.8e-13909952952
                                HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHH CS
                   Homeobox   9 keqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrR 52 
                                +  +++L + F++n+yp+++++e LA++lgL  +qV+ WF N R
  Pav_sc0001422.1_g110.1.mk 909 EAVTQRLCKSFKENHYPDRSMKESLARELGLMAKQVSKWFENAR 952
                                667899************************************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1074 aa     Download sequence    
MADAAQLDIP PECVSSQTAK CQEESTSGQI HEIGSESQCS EKTKENIGCK VVQNELLEIC  60
KASNNADEQS QSFSENLTEN SHVENLGQPA EDVNKSSQGG AQNVTKNSLT EQLEMPHEDP  120
DVNNQTDKTS CSGQMSLEQT NDSGFGTSSS EPAEEKHPSG TFCVQNGLLQ TIMPLPICGG  180
SEQLQLISEN VNMASLNDQA GLPPEDVSKT CQTEKISCSL QITLQQINEF GSGSVPSETA  240
KQRDQLDSVP AQNDEVKTSK AVSSSIVFEQ PGPSIEAMTE DSPIGHSEPP LEDASKSLSD  300
KEMGPLPEDV TQNSSLQQSE TPLKNALKIS SCLGPKDKKN PKSRKRKYMS RSFVRSDRVL  360
RSRTGEKEKP KDLKLSNNVP TLESSNSIAN VSNGEEKKRK KRKSRRDNRA IADEFSRIRT  420
HLRYLLNRIG YEKSLIDAYS GEGWKGSSLE KLKPEKELQR ATSEILRRKL KIRDLFQRLE  480
SLCAEGMFPE SLFDSEGQID SEDIFCAKCG SKDVSLDNDI ILCDGACDRG FHQFCLEPPL  540
LSEDSNDQST PLFFAIYDSS VAYSFVWLAV PPDDEGWLCP GCDCKVDCID LLNDSQGTDL  600
SVTDSWEKVF PEAAAAASAG ENQDNHGLPS DDSDDNDYDP DGPETDNEIQ GEESSSDESE  660
YASASDGLET PNSNDEQYLG LPSEDSEDDD YNPYDPDVNE DVKQESSSSD FTSDSEDLGA  720
ALDDNIMSYE DVEGPKSMSL DDSKPHRGSG EQSSRHGQKK HSLQDELISL LESGPGQGES  780
APPSGKRHIE RLDYKRLHDE AYGNVPTDSS DDEDWNDIAT QRKRKKGTGQ VANRSPNGKT  840
SNIKNGVITK DIKPDVDENE NTPRRTPHRK SNVEDTSNLS NKSPKGSTKS GSTSGRAGSS  900
RSTYSRLGEA VTQRLCKSFK ENHYPDRSMK ESLARELGLM AKQVSKWFEN ARHCLKVGVD  960
KSASENCAPS PQTNSKPLEQ GDAIVGGSDH NGAQNKELHR TDDPMIGCCS RDVKDSELAT  1020
PGSSRSKLST PNNRKRKRRS DDPDPKTETP TPPAEPETNR KPSRVMTRRR KSVS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1344371RKRKYMSRSFVRSDRVLRSRTGEKEKPK
2397401KKRKK
3397403KKRKKRK
4398406KRKKRKSRR
5822826KKRKK
6822826RKRKK
710341039RKRKRR
810671072TRRRKS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G19510.14e-91HB-PHD family protein