PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1036s0042.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family HD-ZIP
Protein Properties Length: 313aa    MW: 35996 Da    PI: 6.9721
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1036s0042.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.16.1e-18117169456
                          -SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   4 RttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          + +++ eq+++Le+ Fe  +++  e++ +LAk lgL+ rq+ +WFqNrRa++k
  Cagra.1036s0042.1.p 117 KKRLNLEQVRALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWK 169
                          4568889*********************************************9 PP

2HD-ZIP_I/II128.43e-41115206192
          HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLree 90 
                          ekk+rl+ eqv++LE+sFe  +kLeperK++la++Lglqprq+a+WFqnrRAR+ktkqlE+dy++Lk+++d lk++n++L +++++L++e
  Cagra.1036s0042.1.p 115 EKKKRLNLEQVRALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYDSLKKQFDVLKSDNDSLLAHNKKLHAE 204
                          69**************************************************************************************99 PP

          HD-ZIP_I/II  91 lk 92 
                          l 
  Cagra.1036s0042.1.p 205 LV 206
                          86 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.87E-19105173IPR009057Homeodomain-like
SMARTSM003892.6E-17114175IPR001356Homeobox domain
PROSITE profilePS5007116.552115171IPR001356Homeobox domain
CDDcd000863.19E-16116172No hitNo description
PfamPF000463.2E-15117169IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.604.0E-19119179IPR009057Homeodomain-like
PRINTSPR000311.2E-5142151IPR000047Helix-turn-helix motif
PROSITE patternPS000270146169IPR017970Homeobox, conserved site
PRINTSPR000311.2E-5151167IPR000047Helix-turn-helix motif
PfamPF021834.6E-15171210IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 313 aa     Download sequence    Send to blast
MYMYEEGRNN ISSNQEGLRL EMAFPQHGFM FQQLHEDNAH HLPSPTSLPS CPPHLFYGGG  60
GNYMMNRSMS FTGVSDHHHN LTQKSPTTTH NMNDQDQVGE EDNLSDDGSH MMLGEKKKRL  120
NLEQVRALEK SFELGNKLEP ERKMQLAKAL GLQPRQIAIW FQNRRARWKT KQLERDYDSL  180
KKQFDVLKSD NDSLLAHNKK LHAELVALKK HDRKESAKIK RELAEASWSN NGSTENNNTS  240
DINHVSMIKD LFPSSIRTAT ATTTSTHIDQ HMVQEQDQGF CNMFNGIDET TSASYWAWPD  300
QQQQHHNHHQ FN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1163171RRARWKTKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.1036s0042.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2293640.0AK229364.1 Arabidopsis thaliana mRNA for homeobox protein, complete cds, clone: RAFL16-61-C06.
GenBankBT0440590.0BT044059.1 Arabidopsis thaliana unknown protein (At5g15150) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006288253.10.0homeobox-leucine zipper protein HAT7
SwissprotQ004660.0HAT7_ARATH; Homeobox-leucine zipper protein HAT7
TrEMBLR0FGQ40.0R0FGQ4_9BRAS; Uncharacterized protein
STRINGCagra.1036s0042.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM30952666
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G15150.11e-173homeobox 3