PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.007G235200.1
Common NameB456_007G235200, LOC105804177
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 302aa    MW: 34081.9 Da    PI: 5.5007
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.007G235200.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.11.5e-1852105356
                         --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         k+++++ +q+++Le+ Fe +++++  ++ +LA++lgL  rqV vWFqNrRa++k
  Gorai.007G235200.1  52 KKRRLNVDQVKALEKDFEVENKLDPGRKLKLAQQLGLRPRQVAVWFQNRRARWK 105
                         5568999**********************************************9 PP

2HD-ZIP_I/II123.79e-4051143193
         HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                         ekkrrl+ +qvk+LE+ Fe e+kL+p rK +la++Lgl+prqvavWFqnrRAR+ktkqlEkdy  Lk++y++lk + ++L+++++ L +++
  Gorai.007G235200.1  51 EKKRRLNVDQVKALEKDFEVENKLDPGRKLKLAQQLGLRPRQVAVWFQNRRARWKTKQLEKDYGLLKNRYETLKLNYDSLQHDNQVLLKQI 141
                         69*************************************************************************************9999 PP

         HD-ZIP_I/II  92 ke 93 
                         +e
  Gorai.007G235200.1 142 EE 143
                         86 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007116.55247107IPR001356Homeobox domain
SuperFamilySSF466892.65E-1850109IPR009057Homeodomain-like
SMARTSM003893.3E-1750111IPR001356Homeobox domain
PfamPF000467.6E-1652105IPR001356Homeobox domain
CDDcd000862.58E-1452108No hitNo description
Gene3DG3DSA:1.10.10.602.4E-1954113IPR009057Homeodomain-like
PRINTSPR000311.5E-57887IPR000047Helix-turn-helix motif
PROSITE patternPS00027082105IPR017970Homeobox, conserved site
PRINTSPR000311.5E-587103IPR000047Helix-turn-helix motif
PfamPF021833.7E-14107148IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 302 aa     Download sequence    Send to blast
MKRVGSSHSL GAMMSICPIS DDNQIYSREF QSILDGLDEE EGVEESGYVA EKKRRLNVDQ  60
VKALEKDFEV ENKLDPGRKL KLAQQLGLRP RQVAVWFQNR RARWKTKQLE KDYGLLKNRY  120
ETLKLNYDSL QHDNQVLLKQ IEEVKAKLNG KNNVSVKEEV NVTKTANRTL EQSEAPVEVK  180
YESLKNNSKG SNGAILFLDL KDGSSDSDSS AVLNEDNNNG SNYVGGSSSG ILQSQHVWMS  240
PTTASSLNFN SSSSSSSMKC FQPQQFVKME EQNFFSADEA CKFFSDEEAP SLHWYCPEHW  300
N*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
199107RRARWKTKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012492121.10.0PREDICTED: homeobox-leucine zipper protein ATHB-6-like
RefseqXP_012492122.10.0PREDICTED: homeobox-leucine zipper protein ATHB-6-like
TrEMBLA0A0D2PE170.0A0A0D2PE17_GOSRA; Uncharacterized protein
STRINGGorai.007G235200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM54528143
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G40060.11e-56homeobox protein 16
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]