PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_38798_BGI-A2_v1.0
Common NameF383_25253
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 739aa    MW: 81647.2 Da    PI: 6.7521
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_38798_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.11.7e-1957110356
                                 --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                 k ++++++q++eLe++F+++++p +++r+eL+++lgL+ +q+k+WFqNrR+++k
  Cotton_A_38798_BGI-A2_v1.0  57 KFHRHNPHQIHELESFFKECPHPEEKQRRELSRRLGLESKQIKFWFQNRRTQMK 110
                                 556899*********************************************999 PP

2START167.68.3e-532604792203
                                 HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT CS
                       START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdet 75 
                                 +a++a++el+k+a+ ++p+W k      e +n +e+ ++f++  +     +++ea r++++v+     lv +l+d + +W e+
  Cotton_A_38798_BGI-A2_v1.0 260 VALAAMDELIKMAQMGSPLWIKGFgdgmETLNLEEYKRTFSSFIGmkpsgFTTEATRETAMVPLRGLALVDTLMDAN-CWAEM 341
                                 7899*******************9999999999999999977555999*****************************.***** PP

                                 -S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EE CS
                       START  76 la....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRael 147
                                 ++    +a t++v+ssg      +alqlm ae+q+lsplvp R + f+R+++q+++ +w+ivdvS++  +  +    ++ +++
  Cotton_A_38798_BGI-A2_v1.0 342 FPcmisRAVTIDVLSSGkgvtrhNALQLMEAEFQVLSPLVPiRQVQFIRFCKQHSDSVWAIVDVSINLSNAAN-ALMFANCRR 423
                                 ***********************************************************************99.9******** PP

                                 SSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXX CS
                       START 148 lpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrq 203
                                 lpSg++i++++n +skvtwveh +++++++h llr+l++sg  +gak+w atl+rq
  Cotton_A_38798_BGI-A2_v1.0 424 LPSGCVIQDMDNKYSKVTWVEHSEYDESTVHHLLRPLLGSGFGFGAKRWIATLRRQ 479
                                 *****************************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-1941106IPR009057Homeodomain-like
SuperFamilySSF466896.68E-2041112IPR009057Homeodomain-like
PROSITE profilePS5007117.62152112IPR001356Homeobox domain
SMARTSM003892.1E-1653116IPR001356Homeobox domain
PfamPF000465.6E-1757110IPR001356Homeobox domain
CDDcd000861.86E-1759112No hitNo description
PROSITE patternPS00027087110IPR017970Homeobox, conserved site
PROSITE profilePS5084837.198250485IPR002913START domain
SuperFamilySSF559613.3E-29251480No hitNo description
CDDcd088751.03E-108254479No hitNo description
SMARTSM002344.2E-33259482IPR002913START domain
PfamPF018521.3E-44260479IPR002913START domain
SuperFamilySSF559612.47E-15522733No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 739 aa     Download sequence    Send to blast
MASHGELRLI GENYDPGFIG MMKEDDGYGS SDDFEGALGN DQDTADNGRP PRKKKKKFHR  60
HNPHQIHELE SFFKECPHPE EKQRRELSRR LGLESKQIKF WFQNRRTQMK TQLERHENVI  120
LKQENDKLRA ENDLLRQAIA SAICNNCGVP AVPDEISYEP SQLMMENSRL KDELNRARAL  180
TNKFLGRHLS SSSANPSPSP SQGLNSNVEV VVRRTGFCGL NNGSTSLPMG FEFGHGATMP  240
LMNPSFAYEM PYDKSALVDV ALAAMDELIK MAQMGSPLWI KGFGDGMETL NLEEYKRTFS  300
SFIGMKPSGF TTEATRETAM VPLRGLALVD TLMDANCWAE MFPCMISRAV TIDVLSSGKG  360
VTRHNALQLM EAEFQVLSPL VPIRQVQFIR FCKQHSDSVW AIVDVSINLS NAANALMFAN  420
CRRLPSGCVI QDMDNKYSKV TWVEHSEYDE STVHHLLRPL LGSGFGFGAK RWIATLRRQY  480
SSLALLMSPD IHGEDINTVG KKSMLKLAQR MAYNFSAGIG ASSVNKWDKL NVGNVGEDVR  540
VMTRKNVNDP GEPLGIVLSA ATSVWMPITQ QTLFNLLRNE RMRNQWDILS SGRPMQAMYS  600
VAKGPGQGNC VSILRGAAVN GSDTNMLILQ ETWSDDCGAL IVYAPVDASS IRVVMNGGDS  660
SHVALLPSGF AILPGVQTDG PSMQPDIDEN TSDGCILTVG FQILVNSVPT AKLTVESVET  720
VNHLLTCTVE KIKAALSVT
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY3384950.0AY338495.1 Gossypium hirsutum homeodomain protein BNLGHi6313 (bnlghi6313) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017631789.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
RefseqXP_017631791.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
RefseqXP_017631792.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
RefseqXP_017631793.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0B0P3V70.0A0A0B0P3V7_GOSAR; Homeobox-leucine zipper ANTHOCYANINLESS 2-like protein
STRINGGorai.006G047300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]