PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID MDP0000231216
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Maloideae; Maleae; Malus
Family TALE
Protein Properties Length: 777aa    MW: 87083 Da    PI: 8.1679
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
MDP0000231216genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox27.74.7e-093704042155
                    HSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHH CS
       Homeobox  21 knrypsaeereeLAkklgLterqVkvWFqNrRake 55 
                    k +yps++++  LA+++gL+++q+ +WF N+R ++
  MDP0000231216 370 KWPYPSESQKLALAESTGLDQKQINNWFINQRKRH 404
                    569*****************************885 PP

2ELK39.99e-14324345122
            ELK   1 ELKhqLlrKYsgyLgsLkqEFs 22 
                    ELK qLlrKYsgyLgsLkqEF+
  MDP0000231216 324 ELKGQLLRKYSGYLGSLKQEFM 345
                    9********************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM012559.0E-19124168IPR005540KNOX1
PfamPF037901.8E-21126164IPR005540KNOX1
SMARTSM012561.5E-26178229IPR005541KNOX2
PfamPF037915.2E-21182228IPR005541KNOX2
PROSITE profilePS5121311.174324344IPR005539ELK domain
SMARTSM011882.8E-8324345IPR005539ELK domain
PfamPF037891.5E-10324345IPR005539ELK domain
PROSITE profilePS5007112.989344407IPR001356Homeobox domain
SuperFamilySSF466893.25E-19346422IPR009057Homeodomain-like
SMARTSM003891.3E-12346411IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.2E-27349409IPR009057Homeodomain-like
CDDcd000861.65E-11356407No hitNo description
PfamPF059201.7E-16364403IPR008422Homeobox KN domain
PROSITE patternPS000270382405IPR017970Homeobox, conserved site
Gene3DG3DSA:3.40.1370.109.9E-25579679IPR023574Ribosomal protein L4 domain
SuperFamilySSF521661.31E-58579770IPR023574Ribosomal protein L4 domain
HamapMF_01328_B25.449588772IPR01300550S ribosomal protein uL4
PfamPF005732.5E-35594679IPR002136Ribosomal protein L4/L1e
PfamPF005732.8E-13679769IPR002136Ribosomal protein L4/L1e
Gene3DG3DSA:3.40.1370.101.5E-19680771IPR023574Ribosomal protein L4 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006412Biological Processtranslation
GO:0005634Cellular Componentnucleus
GO:0005840Cellular Componentribosome
GO:0003735Molecular Functionstructural constituent of ribosome
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 777 aa     Download sequence    Send to blast
MEGGSNGTCS MMAFGESSSN GGGMCPMMMM PLMTSSHHAD HSHHHHQHHQ PMNPNPNADA  60
HNTSTTHQFH QLPQLPPSNN HHDRHNTSGG SSFVLHNPAA ASYFMDNNNN NEGVGGSSFS  120
SSSSIVKAKI MAHPHYHRLL ASYINCQKVG APPEVVARLE EACASAATIG QMVSSSSGSG  180
SLGEDPALDQ FMEAYCEMLT KYEQELSKPF KEAMIFFQRI ESQFKALTLS SSSDSAAVLV  240
TSSILDLDSL ACLSENYSVA SDKLQYQTNF EAGTFDLSII KLPFFPRKRS DVRGYGEGID  300
RNNGSSEEEV DVNNFIDPQA EDRELKGQLL RKYSGYLGSL KQEFMKKRKK GKLPKEARQQ  360
LLDWWSRHYK WPYPSESQKL ALAESTGLDQ KQINNWFINQ RKRHWKPTED MQFVVMDASH  420
PGHYYMDAAP DLPPATFLSL SHSPSFHDES TLRLEDQRPT AFPNWVLLSS SSSLCFLGTF  480
DYYAFSILCL DEGLFGQFSI TFVHLRTRDR VRRRNFCGHV PGDTLLSGEL SAVFKQIEGF  540
QPPTRLPNQV EVPFPSNLLS AKPVVASERS IGLXQDLVIP VTNFHYEDKG FTVLAGDVFD  600
VPIRKDIIHR VVLWQLAKRQ QGTHSTKTIS EVSGTGRKPW NQKGTGRARH GTLRGPQFRG  660
GCTMHGPKPR SHEIKLNKKL LVFEDFEVPT HKTKNIVNYV QQMEGSKKFL LVDGXKTEDG  720
KQMLPEKLKL ATQNLHYVNV LPAVGLNVYS ILQHDTLVMT RAAVNEIVKR MHTPINR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4v49_BC3e-345957691019650S ribosomal protein L4
4v4a_BC3e-345957691019650S ribosomal protein L4
5dm6_C3e-345957691019650S ribosomal protein L4
5dm7_C3e-345957691019650S ribosomal protein L4
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1339348LKQEFMKKRK
2345349KKRKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Mdo.129180.0fruit
Expression -- Description ? help Back to Top
Source Description
UniprotDEVELOPMENTAL STAGE: First expressed in the embryo proliferation stage, increases during early somatic embryo development and decreases thereafter.
UniprotTISSUE SPECIFICITY: Expressed mainly in embryonic tissues. Weakly detected in stems and hypocotyl.
Functional Description ? help Back to Top
Source Description
UniProtPossible transcription activator involved in early embryonic development. Probably binds to the DNA sequence 5'-TGAC-3'.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankDQ3642321e-123DQ364232.1 Prunus dulcis homeodomain protein Kn1 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009356665.10.0PREDICTED: homeobox protein knotted-1-like LET6
SwissprotP466081e-124HSBH1_SOYBN; Homeobox protein SBH1
TrEMBLA0A498K0H30.0A0A498K0H3_MALDO; Uncharacterized protein
STRINGXP_009356665.10.0(Pyrus x bretschneideri)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G08150.12e-80KNOTTED-like from Arabidopsis thaliana
Publications ? help Back to Top
  1. Shu Y, et al.
    GmSBH1, a homeobox transcription factor gene, relates to growth and development and involves in response to high temperature and humidity stress in soybean.
    Plant Cell Rep., 2015. 34(11): p. 1927-37
    [PMID:26205508]