PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.0751s0009.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family HD-ZIP
Protein Properties Length: 825aa    MW: 91442.5 Da    PI: 5.1984
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.0751s0009.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.81.2e-20112167156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t+ q++e+e+lF++n++p+ ++r++L+++lgL+ rqVk+WFqNrR+++k
  Cagra.0751s0009.1.p 112 KKRYHRHTNRQIQEMEALFKENPHPDDKQRKRLSAELGLKPRQVKFWFQNRRTQMK 167
                          688999***********************************************998 PP

2START141.49.1e-453205531206
                          HHHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS................SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT CS
                START   1 elaeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv...............dsgealrasgvvdmvlallveellddkeqWdet 75 
                          e+a ++ qel k+ + eep+W k  + ++g evl   ee+                 +  ea++a +vv+m++ +lv  +l+   +W+e+
  Cagra.0751s0009.1.p 320 EIAVSCVQELTKMCETEEPLWIKKKSDKIGGEVLCLNEEEYMrlfpwpvenpnnkgdFGREASKANAVVIMNSITLVDAFLNAD-KWSEM 408
                          578899*****************9955555555544444333556666678999999999************************.***** PP

                          -S....EEEEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-..TTS--..-TTSEE-EESSEE CS
                START  76 la....kaetlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvds..eqkppe.sssvvRaellpSg 151
                          +     +a+t++ issg     g l lm+aelq+lsplvp R+ +f+Ry +q  + g w+ivd  +ds  +q++p  +      ++ pSg
  Cagra.0751s0009.1.p 409 FCsivaRAKTVQIISSGvsgasGSLLLMYAELQVLSPLVPtREAYFLRYVEQnAETGNWAIVDFPIDSfhDQMQPPsTNTPHEYKRKPSG 498
                          *99999**********************************************99*********99885223333334444444558**** PP

                          EEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 152 iliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          ++i++++ng+s+v wvehv++++++ h+ +   vksg+a+ga++w+  lqrqce+
  Cagra.0751s0009.1.p 499 CIIQDMPNGYSQVKWVEHVEVDEKHLHETFADYVKSGMAFGANRWLDVLQRQCER 553
                          *****************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.85E-20101169IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.3E-21107174IPR009057Homeodomain-like
PROSITE profilePS5007117.394109169IPR001356Homeobox domain
SMARTSM003891.9E-18110173IPR001356Homeobox domain
CDDcd000863.69E-19112170No hitNo description
PfamPF000463.5E-18112167IPR001356Homeobox domain
PROSITE patternPS000270144167IPR017970Homeobox, conserved site
PROSITE profilePS5084843.299311556IPR002913START domain
SuperFamilySSF559612.06E-30312555No hitNo description
CDDcd088758.41E-107315552No hitNo description
SMARTSM002343.2E-27320553IPR002913START domain
PfamPF018522.4E-37321553IPR002913START domain
SuperFamilySSF559617.69E-15597800No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 825 aa     Download sequence    Send to blast
MLSMGDDNVM TSNNMRFASQ LLPSSSSPGT IQNPNFNFIP FNSFSSIIPK EEHGMMSMMM  60
MMGDGTVEEM MENGSAGGSF GSGSEQAEDP KFGNESDVNE LQDGEQPPPA KKKRYHRHTN  120
RQIQEMEALF KENPHPDDKQ RKRLSAELGL KPRQVKFWFQ NRRTQMKAHQ DRTENVLLRA  180
ENDSLKSENC HLQAELRCLS CPSCGGPTVL GEIPFSELHI ENCRLREELD RVCSITSRYN  240
GRPMQSMPSS QALITPSPTL PHHQPSLELD MSVYAGNFPE QSCADMMMLP PQDTTCFFPD  300
QTANNNNMLL ADEEKVIAME IAVSCVQELT KMCETEEPLW IKKKSDKIGG EVLCLNEEEY  360
MRLFPWPVEN PNNKGDFGRE ASKANAVVIM NSITLVDAFL NADKWSEMFC SIVARAKTVQ  420
IISSGVSGAS GSLLLMYAEL QVLSPLVPTR EAYFLRYVEQ NAETGNWAIV DFPIDSFHDQ  480
MQPPSTNTPH EYKRKPSGCI IQDMPNGYSQ VKWVEHVEVD EKHLHETFAD YVKSGMAFGA  540
NRWLDVLQRQ CERIASLMAR NITDLGVISS AEARRNIMRL SQRMVRTFCV NISTAYGQSW  600
TALSETTKDT VRITTRKMCE AGQPTGVVLC AVSTTWLPFS HHQVFDLIRD QHHQSLLEVL  660
FNGNSPHEVA HIANGSHPGN CISLLRINVA SNSWHNVELM LQESSIDNSG SLIVYSTVDV  720
DSVQLAMNGE DSSNIPILPL GFSIVPVNPP EGVSVNSNSP PSCLLTVAIQ VLASNVPTAK  780
PNLSTVTTIN NHLCATVNQI TSALTSTVTP AIASSDAVSK QEVS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1137142DKQRKR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.0751s0009.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0133940.0AB013394.1 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MQD22.
GenBankCP0026880.0CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006281890.10.0homeobox-leucine zipper protein HDG5
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLR0GUB60.0R0GUB6_9BRAS; Uncharacterized protein
STRINGCagra.0751s0009.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Lung SC, et al.
    Arabidopsis ACYL-COA-BINDING PROTEIN1 interacts with STEROL C4-METHYL OXIDASE1-2 to modulate gene expression of homeodomain-leucine zipper IV transcription factors.
    New Phytol., 2018. 218(1): p. 183-200
    [PMID:29288621]