PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.002G155800.1
Common NameB456_002G155800, LOC105785443
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 322aa    MW: 35810.2 Da    PI: 7.6791
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.002G155800.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox54.71.7e-17162216256
                         T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         rk+ +++keq   Lee F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  Gorai.002G155800.1 162 RKKLRLSKEQSAFLEESFKEQNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTK 216
                         778899***********************************************98 PP

2HD-ZIP_I/II123.51e-39162249188
         HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLr 88 
                         +kk+rlskeq+++LEesF+e+++L+p++K +la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+ke +eLr
  Gorai.002G155800.1 162 RKKLRLSKEQSAFLEESFKEQNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRSCETLREENKRLQKELQELR 249
                         69*************************************************************************************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046181.9E-927129IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.606.9E-18143219IPR009057Homeodomain-like
SuperFamilySSF466894.14E-18155219IPR009057Homeodomain-like
PROSITE profilePS5007117.2158218IPR001356Homeobox domain
SMARTSM003892.0E-15160222IPR001356Homeobox domain
PfamPF000465.8E-15162216IPR001356Homeobox domain
CDDcd000861.81E-15162219No hitNo description
PROSITE patternPS000270193216IPR017970Homeobox, conserved site
CDDcd146860.00551211249No hitNo description
PfamPF021831.4E-9218250IPR003106Leucine zipper, homeobox-associated
SMARTSM003405.7E-20218261IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 322 aa     Download sequence    Send to blast
MELALSLGDP SKQFSLLGNT PKLSRSKDLG FCMGLGDGFK SQHKIDAFEA QSKGGDSDEK  60
RVPSDLPLYR HLLPSSQTQL RIPCLTHKYG DGRGLDVNQL PPAAAAEEDD EESEEGAGIS  120
SPNSTVSSFR MDFGIRNGKN KGKRDLEVER VSDDDDENGS TRKKLRLSKE QSAFLEESFK  180
EQNTLNPKQK LALAKQLNLR PRQVEVWFQN RRARTKLKQT EVDCEYLKRS CETLREENKR  240
LQKELQELRV LKTCQPFYMQ SPATTLTMCP SCERLATKGS AATAATGYPP FFSSANTCDP  300
ETSPSPPGNF FKSLDVTSEK T*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1160166TRKKLRL
2210218RRARTKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5909751e-51JX590975.1 Gossypium hirsutum clone NBRI_GE25995 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012466991.10.0PREDICTED: homeobox-leucine zipper protein HOX11-like isoform X1
SwissprotP466652e-90HAT14_ARATH; Homeobox-leucine zipper protein HAT14
TrEMBLA0A0D2MBK30.0A0A0D2MBK3_GOSRA; Uncharacterized protein
STRINGGorai.002G155800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49282852
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.12e-74homeobox from Arabidopsis thaliana
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]