PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.0094s0026.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family HD-ZIP
Protein Properties Length: 685aa    MW: 76165.4 Da    PI: 6.1796
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.0094s0026.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.81.8e-1862117156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          r k +++t++q++eLe++F+ +++p++++r eL kkl L+ +q+k+WFqNrR+++k
  Cagra.0094s0026.1.p  62 RTKYHRHTSYQIQELESFFKVCPHPNEKQRLELGKKLTLESKQIKFWFQNRRTQMK 117
                          3455678999*******************************************999 PP

2START146.62.4e-462284322206
                          HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
                START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                          la ea++e++k+ +++ p+W + s    ++e++   e         +r+sg+v  +++ lv  l+d++ +W e+++     a+t++vis+
  Cagra.0094s0026.1.p 228 LAMEAMEEFLKLEELDNPLWNSKS----EKESMNHNE-----YRSSSRESGLVLINSVALVDALMDTN-KWAEMFEcivaVASTVKVISN 307
                          5789****************9999....444444442.....22347*********************.********************* PP

                          T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-E CS
                START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehv 170
                          g      g lqlm+ae+q++splvp +   f+Ry++q+g+g w++vdvS d +++ ++ +s+  ++++pSg++i++ +ng skvtw+eh 
  Cagra.0094s0026.1.p 308 GsdgsrnGSLQLMQAEFQVMSPLVPiKQKKFLRYCKQHGDGLWAVVDVSYDINREDENLKSYGGSKKFPSGCIIQDIGNGCSKVTWIEHL 397
                          *******************************************************999******************************** PP

                          E--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 171 dlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          +++++++ +++ +l++s++a ga +w+atlqrqce+
  Cagra.0094s0026.1.p 398 EYEESHINSVY-QLLGSSVALGATKWLATLQRQCES 432
                          *******9998.689999****************95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.6E-1946119IPR009057Homeodomain-like
SuperFamilySSF466896.68E-1855119IPR009057Homeodomain-like
SMARTSM003895.7E-1558123IPR001356Homeobox domain
PROSITE profilePS5007116.8659119IPR001356Homeobox domain
CDDcd000861.67E-1562119No hitNo description
PfamPF000463.2E-1662117IPR001356Homeobox domain
PROSITE profilePS5084839.207218435IPR002913START domain
CDDcd088751.20E-100223431No hitNo description
SuperFamilySSF559613.02E-27224433No hitNo description
SMARTSM002346.8E-33227432IPR002913START domain
PfamPF018527.0E-40228432IPR002913START domain
SuperFamilySSF559611.1E-17450671No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 685 aa     Download sequence    Send to blast
MNGDLDVDMS RGDLNPSFFH GRLKDDEFES RSLSDDDSFD AMSGDENKQE EQRPKKKKKK  60
KRTKYHRHTS YQIQELESFF KVCPHPNEKQ RLELGKKLTL ESKQIKFWFQ NRRTQMKTQL  120
ERHENVILRQ ENEKLRVENG FLKESMRGSL CIDCGGAVIP GEVSFEQHQL RIENAKLKDE  180
LDRICALANR FIGGSISLEQ PSNGGIGSQH FPIGNGFSGG TSQMFMDLAM EAMEEFLKLE  240
ELDNPLWNSK SEKESMNHNE YRSSSRESGL VLINSVALVD ALMDTNKWAE MFECIVAVAS  300
TVKVISNGSD GSRNGSLQLM QAEFQVMSPL VPIKQKKFLR YCKQHGDGLW AVVDVSYDIN  360
REDENLKSYG GSKKFPSGCI IQDIGNGCSK VTWIEHLEYE ESHINSVYQL LGSSVALGAT  420
KWLATLQRQC ESFTSLLSSQ DHTGLSLAGT KSILKLAQRM KLNFYSGITA SSVHKWEKLN  480
AENVGQDTRI LTRKSFEPSG IVLSAATSLW LPVTQQRLFE FLCDGKCRNQ WDILSNGASM  540
EITLLVPKGQ QEGSCVSLLC AAGKDQNESS MLILQETWND ASGALVVYAP VDFPSMNVVM  600
SGGDSGYVAL LPSGFSILPD GSSLSDQIDT NGNQESKGCL LTVGFQILVN SLPTAKLNVE  660
SVETVNNLIA CTIHKIRAAL RIPA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15561KKKKKKR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-GCATTAAATGC-3'. {ECO:0000269|PubMed:16778018}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00559DAPTransfer from AT5G52170Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.0094s0026.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0256031e-157AB025603.1 Arabidopsis thaliana genomic DNA, chromosome 5, BAC clone:F17P19.
GenBankCP0026881e-157CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006282099.10.0homeobox-leucine zipper protein HDG7
SwissprotQ9LTK30.0HDG7_ARATH; Homeobox-leucine zipper protein HDG7
TrEMBLR0F0C90.0R0F0C9_9BRAS; Uncharacterized protein
STRINGCagra.0094s0026.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52170.10.0homeodomain GLABROUS 7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]