PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa20g025400.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 709aa    MW: 79780.3 Da    PI: 6.9586
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa20g025400.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.36.4e-192580156
                    TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                    rr+ +++t++q+++Le++F+++++p+  +r++L ++l+L+ +q+k+WFqN+R++ k
  Csa20g025400.1 25 RRNYHRHTNQQIQRLEAYFKECPHPDDLQRRQLGEELNLKPKQIKFWFQNKRTQAK 80
                    789999***********************************************988 PP

2START102.57.5e-332354509206
                     HHHHHHHHC-TT-EEEE.......EXCCTTEEEEEEESSS...SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.......EEEEEEEECT CS
           START   9 elvkkalaeepgWvkss.......esengdevlqkfeeskv..dsgealrasgvvdmvlallveellddkeqWdetla.......kaetleviss 87 
                     e++ +++  e++W+kss       +  n++++  k++  k+   + e +++++vv m++ +lv  +ld+  +W + ++        + +l+  ++
  Csa20g025400.1 235 EVMNLIQ-MEELWKKSSidnrlviDPTNYEKCFGKISHFKGpsGRPESSKEVVVVQMDARNLVDMFLDTE-KWARLFPtivneakTMHVLDSMDN 327
                     4454444.4689************9***********777767767789**********************.9*9999999995555555555555 PP

                     T..EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.H CS
           START  88 g..galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp.h 178
                     g  +   +++ ++  lsplvp R+f ++R+++q+++ +w+i+dvS + ++    +s    ++++pSg lie ++n+ skvtw+ehv++++++  h
  Csa20g025400.1 328 GrqTLARVIYEQMHILSPLVPpREFIILRSCQQMEENVWMIADVSCNLPNVEF-NSMAPICTKHPSGVLIEALPNRCSKVTWIEHVEVSDKMRpH 421
                     5*999999*************************************99998877.6777778****************************99955* PP

                     HHHHHH.HHHHHHHHHHHHHHHTXXXXXX CS
           START 179 wllrsl.vksglaegaktwvatlqrqcek 206
                      l+r l +  gl  ga++w+ tl+r ce+
  Csa20g025400.1 422 RLYRDLfLYGGLGYGARRWTVTLERMCER 450
                     ****97257789***************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.89E-18681IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.5E-19982IPR009057Homeodomain-like
PROSITE profilePS5007116.8442282IPR001356Homeobox domain
SMARTSM003894.0E-172486IPR001356Homeobox domain
PfamPF000461.7E-162580IPR001356Homeobox domain
CDDcd000866.43E-172583No hitNo description
PROSITE profilePS5084836.463218453IPR002913START domain
SuperFamilySSF559611.79E-27220451No hitNo description
CDDcd088752.03E-83222449No hitNo description
SMARTSM002343.7E-12227450IPR002913START domain
Gene3DG3DSA:3.30.530.202.2E-10230413IPR023393START-like domain
PfamPF018528.5E-27234450IPR002913START domain
SuperFamilySSF559614.05E-8473670No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 709 aa     Download sequence    Send to blast
MDSSHDDSSS DERGTSTDTN NNQDRRNYHR HTNQQIQRLE AYFKECPHPD DLQRRQLGEE  60
LNLKPKQIKF WFQNKRTQAK SHSERVDNAV LRAENMRMRR ENEAMEDALK NVVCGPCSGR  120
GFGREEKQRN IQKLRAENAF LKREFERYSN FLAQQGGHSM PSVDAFTYPR GPSTYGSTSN  180
NRRASYGTSS NHLPQPSCSL RGPYARGNIS LNQPHQLSQM EKLVMFETAA KAVAEVMNLI  240
QMEELWKKSS IDNRLVIDPT NYEKCFGKIS HFKGPSGRPE SSKEVVVVQM DARNLVDMFL  300
DTEKWARLFP TIVNEAKTMH VLDSMDNGRQ TLARVIYEQM HILSPLVPPR EFIILRSCQQ  360
MEENVWMIAD VSCNLPNVEF NSMAPICTKH PSGVLIEALP NRCSKVTWIE HVEVSDKMRP  420
HRLYRDLFLY GGLGYGARRW TVTLERMCER LHLSSISDLP NNDYAGVVQT IEGRRSVLKL  480
GERMLKDFAW MIKMEDKLDF AQQSETNNSG VTIAMRLNHE AGQPPGLILC AGSSLCLPLP  540
PPQVYDFLRN LDIRHQWDVL CNGNSVTEAA RFVTGTDTNN NVNFIQASSG GDNNSKLMIL  600
QDGFIDALGG MVVYAPMDLK TAAAAISGQV DPSAIPILPS GFIISRDGRP SSAEDPDGGS  660
STLLTVAFQI LVCDPDNCTN FNLEESATTV NTVISSTVQR IKRMLNCD*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-GCATTAAATGCGCA-3'. {ECO:0000269|PubMed:16778018}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa20g025400.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankCP0026844e-50CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF21H24e-50AC007894.2 Arabidopsis thaliana chromosome 1 BAC F21H2 sequence, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010492677.10.0PREDICTED: homeobox-leucine zipper protein HDG9-like
SwissprotQ9FFI00.0HDG9_ARATH; Homeobox-leucine zipper protein HDG9
TrEMBLR0H7P30.0R0H7P3_9BRAS; Uncharacterized protein
STRINGXP_010492677.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM84681531
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G17320.10.0homeodomain GLABROUS 9
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]