PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa08g048930.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 747aa    MW: 81680.2 Da    PI: 5.7423
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa08g048930.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.21.6e-1963118156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t+ q++eLe++F+++++p+ ++r+eL++ l+L+  qVk+WFqN+R+++k
  Csa08g048930.1  63 KKRYHRHTQRQIQELESFFKECPHPDDKQRKELSRDLNLEPLQVKFWFQNKRTQMK 118
                     688999***********************************************998 PP

2START220.65e-692544741206
                     HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                     ela +a++elv++a+a +p+Wv+     e++n++e+ ++f+++ +      ++ea+r+s+vv+m++ +lve+l+d++ qW+  +     +a tle
  Csa08g048930.1 254 ELAVAAMEELVRMAQAVDPLWVSTDnsmEILNEEEYFRTFPRGIGpkplgLRSEASRESAVVIMNHINLVEILMDVN-QWSCVFSgivsRALTLE 347
                     57899******************99999**************999********************************.***************** PP

                     EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE CS
           START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvd 171
                     v+s+g      galq+m+ae+q++splvp R+ +fvRy++q+++g+w++vdvS+ds ++ +   ++ R +++pSg+li++++ng+skvtw+eh++
  Csa08g048930.1 348 VLSTGvagnynGALQVMTAEFQVPSPLVPtRENYFVRYCKQHSDGSWAVVDVSLDSLRPST---PILRTRRRPSGCLIQELPNGYSKVTWIEHME 439
                     **********************************************************976...7****************************** PP

                     --SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 172 lkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     +++r++h+++r+lv+sgla+gak+wvatl+rqce+
  Csa08g048930.1 440 VDDRSVHNMYRPLVHSGLAFGAKRWVATLERQCER 474
                     *********************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.2E-2344118IPR009057Homeodomain-like
SuperFamilySSF466898.36E-1951120IPR009057Homeodomain-like
PROSITE profilePS5007116.95760120IPR001356Homeobox domain
SMARTSM003891.0E-1761124IPR001356Homeobox domain
PfamPF000463.7E-1763118IPR001356Homeobox domain
CDDcd000863.61E-1863121No hitNo description
PROSITE patternPS00027095118IPR017970Homeobox, conserved site
PROSITE profilePS5084845.064245477IPR002913START domain
SuperFamilySSF559611.02E-35247476No hitNo description
CDDcd088753.12E-126249473No hitNo description
SMARTSM002341.3E-77254474IPR002913START domain
PfamPF018522.0E-58255474IPR002913START domain
Gene3DG3DSA:3.30.530.201.1E-7351474IPR023393START-like domain
SuperFamilySSF559615.37E-26493732No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 747 aa     Download sequence    Send to blast
MYHPNMFESH HMFDMTPKST SDNDLGITGS REDDFETKSG AEVTTENPSG EELQDPNQRP  60
NKKKRYHRHT QRQIQELESF FKECPHPDDK QRKELSRDLN LEPLQVKFWF QNKRTQMKAQ  120
SERHENQILK SDNDKLRAEN NRYKEALSNA TCPNCGGPAA IGEMSFDEQH LRIENARLRE  180
EIDRISAIAA KYVGKPLGSS FAPLAIHAPS RSLDLEVGNF GNQTGFGGEM YGTGDILRSA  240
VSIPSDTDKP IIVELAVAAM EELVRMAQAV DPLWVSTDNS MEILNEEEYF RTFPRGIGPK  300
PLGLRSEASR ESAVVIMNHI NLVEILMDVN QWSCVFSGIV SRALTLEVLS TGVAGNYNGA  360
LQVMTAEFQV PSPLVPTREN YFVRYCKQHS DGSWAVVDVS LDSLRPSTPI LRTRRRPSGC  420
LIQELPNGYS KVTWIEHMEV DDRSVHNMYR PLVHSGLAFG AKRWVATLER QCERLASSMA  480
SNIPGDLSVI TSPEGRKSML KLAERMVMSF CSGVGASTAH AWTTMSTTGS DDVRVMTRKS  540
MDDPGRPPGI VLSAATSFWI PVAPKRVFDF LRDENSRKEW DILSNGGMVQ EMAHIANGHE  600
PGNCVSLLRV NSGNSSQSNM LILQESCTDA SGSYVIYAPV DIVAMNVVLS GGDPDYVALL  660
PSGFAILPDG SVGGGDGNHQ EVVSSTSFGS CGGSLLTVAF QILVDSVPTA KLSLGSVATV  720
NSLIKCTVER IKAAVACDNG GAGGGA*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa08g048930.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF4245600.0AF424560.1 Arabidopsis thaliana AT4g04890/T1J1_3 mRNA, complete cds.
GenBankAY0625750.0AY062575.1 Arabidopsis thaliana Unknown protein (At4g04890; T1J1.3) mRNA, complete cds.
GenBankBT0001440.0BT000144.1 Arabidopsis thaliana Unknown protein (At4g04890) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010422328.10.0PREDICTED: homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like isoform X1
RefseqXP_010422329.10.0PREDICTED: homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like isoform X1
RefseqXP_010422330.10.0PREDICTED: homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like isoform X1
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLR0FD460.0R0FD46_9BRAS; Uncharacterized protein
STRINGXP_010422328.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]