PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_30465_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 189aa    MW: 21481.1 Da    PI: 9.7877
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_30465_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox603.9e-193589256
                                T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                                rk+ +++keq + Lee F++n++++ +++e LA++l+L  rqV vWFqNrRa+ k
  Cotton_A_30465_BGI-A2_v1.0 35 RKKLRLSKEQSRLLEESFRQNHTLNPRQKEALASELKLRPRQVEVWFQNRRARSK 89
                                788899***********************************************98 PP

2HD-ZIP_I/II115.43.3e-3735123190
                 HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLeke 83 
                                 +kk+rlskeq++lLEesF+++++L+p++K++la+eL+l+prqv+vWFqnrRAR k+kq+E+++++Lkr+++ l+++n++L++e
  Cotton_A_30465_BGI-A2_v1.0  35 RKKLRLSKEQSRLLEESFRQNHTLNPRQKEALASELKLRPRQVEVWFQNRRARSKLKQTEMEFQYLKRWFEFLTKQNQELQSE 117
                                 69********************************************************************************* PP

                 HD-ZIP_I/II  84 veeLree 90 
                                 veeLr +
  Cotton_A_30465_BGI-A2_v1.0 118 VEELR-A 123
                                 ****9.4 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.88E-182392IPR009057Homeodomain-like
PROSITE profilePS5007117.0383191IPR001356Homeobox domain
SMARTSM003891.3E-163395IPR001356Homeobox domain
CDDcd000862.59E-153492No hitNo description
Gene3DG3DSA:1.10.10.605.7E-183589IPR009057Homeodomain-like
PfamPF000462.1E-163589IPR001356Homeobox domain
PROSITE patternPS0002706689IPR017970Homeobox, conserved site
PfamPF021832.5E-891124IPR003106Leucine zipper, homeobox-associated
SMARTSM003406.2E-1591134IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 189 aa     Download sequence    Send to blast
MNRLPSPGSD DEWIASTMEV VVDEENTTND GVVPRKKLRL SKEQSRLLEE SFRQNHTLNP  60
RQKEALASEL KLRPRQVEVW FQNRRARSKL KQTEMEFQYL KRWFEFLTKQ NQELQSEVEE  120
LRALQVGPPT VISPHSREPL PASTLTTCPR CERVTTISSR GAGLINTTTS TNNTSTTSAL  180
QSRPSSAAG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18391RRARSKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription repressor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
UniProtProbable transcription repressor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017605967.11e-136PREDICTED: homeobox-leucine zipper protein HOX3-like
SwissprotQ0JKX14e-65HOX3_ORYSJ; Homeobox-leucine zipper protein HOX3
SwissprotQ9XH384e-65HOX3_ORYSI; Homeobox-leucine zipper protein HOX3
TrEMBLA0A0D2TZ181e-133A0A0D2TZ18_GOSRA; Uncharacterized protein
TrEMBLA0A1U8J2I61e-133A0A1U8J2I6_GOSHI; homeobox-leucine zipper protein HOX3-like
TrEMBLA0A2P5RRZ81e-133A0A2P5RRZ8_GOSBA; Uncharacterized protein
TrEMBLA0A2P5X4U11e-133A0A2P5X4U1_GOSBA; Uncharacterized protein
STRINGGorai.009G331300.11e-134(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48902747
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01430.16e-62homeobox-leucine zipper protein 17