PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_40124_BGI-A2_v1.0
Common NameF383_11244
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 760aa    MW: 82993 Da    PI: 6.1115
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_40124_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.84.2e-1995150156
                                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                 +++ +++t++q++++e++F+++++p+ ++r+eL + lgL+  qVk+WFqN+R+++k
  Cotton_A_40124_BGI-A2_v1.0  95 KKRYHRHTQHQIQQMEAFFKECPHPDDKQRKELGRVLGLEPLQVKFWFQNKRTQMK 150
                                 688999***********************************************999 PP

2START209.51.2e-652784981206
                                 HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-T CS
                       START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWde 74 
                                 ela +a++elv++a+ +ep+W++s     +  n++e++++f+++ +     ++ ea++++ vv+m++  l e+l+d+  qW++
  Cotton_A_40124_BGI-A2_v1.0 278 ELAVAAMEELVRLAQMGEPLWMTSLdgstTVFNEEEYIRTFPRGIGpkptgFKCEASKETCVVIMNHISLIEILMDVQ-QWST 359
                                 57899********************88888889**********999********************************.**** PP

                                 T-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-E CS
                       START  75 tla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRae 146
                                  +     ka+tl+v+s+g      galq+m+ae+q++splvp R++++vRy++ + +g+w++vdvS+d+ +++p     +R++
  Cotton_A_40124_BGI-A2_v1.0 360 VFSaivsKASTLDVLSTGiagnynGALQVMTAEFQVPSPLVPtRESYYVRYCKHHAEGTWAVVDVSLDNIRPNPA----MRCR 438
                                 **************************************************************************5....**** PP

                                 ESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                       START 147 llpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                 ++pSg+li++++ng+skvtw+ehv++++r +h+l+++lv+sg+a+gak+w atl+rqce+
  Cotton_A_40124_BGI-A2_v1.0 439 RRPSGCLIQEMPNGYSKVTWIEHVEVEDRGVHNLYKQLVSSGRAFGAKRWIATLDRQCER 498
                                 **********************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.4E-2177150IPR009057Homeodomain-like
SuperFamilySSF466892.26E-1883152IPR009057Homeodomain-like
PROSITE profilePS5007116.19692152IPR001356Homeobox domain
SMARTSM003896.7E-1893156IPR001356Homeobox domain
PfamPF000469.2E-1795150IPR001356Homeobox domain
CDDcd000861.59E-1795153No hitNo description
PROSITE profilePS5084844.353269501IPR002913START domain
SuperFamilySSF559614.3E-34270500No hitNo description
CDDcd088751.37E-124273497No hitNo description
SMARTSM002347.2E-59278498IPR002913START domain
PfamPF018526.6E-55279498IPR002913START domain
Gene3DG3DSA:3.30.530.206.7E-5374466IPR023393START-like domain
SuperFamilySSF559619.42E-24518752No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 760 aa     Download sequence    Send to blast
MPAGVMVPAR NMPSMISGNG NIGAFGTISG LSLGQQPNNN MMEGQLHPFE MTQNTSESEI  60
ARMRDEEFDS ALKSGSENHE GVSGEDDQDP RPNKKKRYHR HTQHQIQQME AFFKECPHPD  120
DKQRKELGRV LGLEPLQVKF WFQNKRTQMK TQHERQENSQ LRAENEKLRA DNMRYREALS  180
TASCPNCGGP TAVGQMSFDE HHLRLENARL REEIDRISAI AAKYVGKPVV SYPLLSSPMT  240
PRPLDFGSQT GSGEMYGTGE LLRSINAPAE ADKPMIIELA VAAMEELVRL AQMGEPLWMT  300
SLDGSTTVFN EEEYIRTFPR GIGPKPTGFK CEASKETCVV IMNHISLIEI LMDVQQWSTV  360
FSAIVSKAST LDVLSTGIAG NYNGALQVMT AEFQVPSPLV PTRESYYVRY CKHHAEGTWA  420
VVDVSLDNIR PNPAMRCRRR PSGCLIQEMP NGYSKVTWIE HVEVEDRGVH NLYKQLVSSG  480
RAFGAKRWIA TLDRQCERLA SLMATNIPTG DAGVITNQDG RKSMLKLAER MVISFCAGVG  540
ASTAHTWTTL SGTGADDVRV MTRKSVDDPG RPPGIVLSAA TSFWLPVSPK RVFDFLRDEN  600
SRNEWDILSN GGVVQEMAHI ANGRDTGNCV SLLRVNSANS SQTNMLILQE SCTDPTASFV  660
IYAPVDIVAM NVVLNGGDPD YVALLPSGFA ILPDGSSGSS GSGVADAGGS SGGSLLTVAF  720
QILVDSVPTA KLSLGSVATV NNLIACTVER IKASLTCENA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017616160.10.0PREDICTED: homeobox-leucine zipper protein HDG2-like isoform X1
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLA0A0B0Q1260.0A0A0B0Q126_GOSAR; Homeobox-leucine zipper protein HDG2
STRINGGorai.008G172800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2