PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSBRNA2T00120548001
Common NameGSBRNA2T00120548001, LOC106382489
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica
Family Trihelix
Protein Properties Length: 599aa    MW: 68537.5 Da    PI: 8.4826
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSBRNA2T00120548001genomeGenoscopeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix46.78.6e-1596177285
             trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrt...sessstcpyfdql 85 
                          W+ +evlaL+++r+ +e+++ +       We  s+k++e g++rsp +Ckek+e+ ++ry + +++ ++++    +++ ++++f+++
  GSBRNA2T00120548001  96 WCSDEVLALLRFRSTVENWFPEF-----TWELTSRKLAEVGYKRSPRECKEKFEEEERRYFNSNNNTNDHHisnYNNKGNYRMFSEV 177
                          ********************998.....9*******************************999888887732223344466666665 PP

2trihelix99.62.5e-31450546186
             trihelix   1 rWtkqevlaLiearr.......emeerlr.....rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessst 78 
                          rW+++evlaLi++rr       +++++ +     +++++ plWe++skkm e g++rs+k+Ckekwen+nk++kk+k+ +kkr + +s+t
  GSBRNA2T00120548001 450 RWPRDEVLALINIRRsissindDDHHKGGislssSSSKAVPLWERISKKMLESGYKRSAKRCKEKWENINKYFKKTKNVNKKR-PLDSRT 538
                          8**************777766533333332222224699*******************************************8.9***** PP

             trihelix  79 cpyfdqle 86 
                          cpyf+ql+
  GSBRNA2T00120548001 539 CPYFHQLT 546
                          ******98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138373.9E-1094160No hitNo description
PROSITE profilePS500905.55295147IPR017877Myb-like domain
PfamPF138371.2E-19449547No hitNo description
CDDcd122034.35E-24449526No hitNo description
PROSITE profilePS500906.829450519IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 599 aa     Download sequence    Send to blast
MFDGGVPEQI DQFIASPQPP PLLPPHQPAT ERSLPFPVSF ASFNSNHQAQ HMLSLDSRKI  60
IHHHHHHGIK DGGVSSEWIG HTDHDGHNHR HFHHPWCSDE VLALLRFRST VENWFPEFTW  120
ELTSRKLAEV GYKRSPRECK EKFEEEERRY FNSNNNTNDH HISNYNNKGN YRMFSEVEEF  180
YHHGHNDDEH VSSEVGDNQN KRNNSLKGKE NVEEMGHNLL EEGKRDHQGQ GQVEESSMGN  240
KINPVDNVRN EDGAKSSSSS SLMMIMRDKK KRKRKKKERF GVLKGFCEGL VRNMIVQQEE  300
MHKKLLEDMV KKEEEKMARE EAWKKQEMER LNKEVEIRAN EQAMASDRNT SIIKFICKFT  360
GHDNHDDGNG MVQSPRPSQD SSSLVLPKTQ GRRKCQTSSS LLPQALTPHN PISLQTNDIP  420
LEPISTETLK TKTQNRKPPL SDEKSDTGKR WPRDEVLALI NIRRSISSIN DDDHHKGGIS  480
LSSSSSKAVP LWERISKKML ESGYKRSAKR CKEKWENINK YFKKTKNVNK KRPLDSRTCP  540
YFHQLTALYS QPTTTATDTS TGELETRVGS GDSVMHVDAN GAGEKSNVQF SGFDLALP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1268276KKKRKRKKK
2269274KKRKRK
3269276KKRKRKKK
4270276KRKRKKK
5271275RKRKK
6271277RKRKKKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGSBRNA2T00120548001
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_013677968.10.0trihelix transcription factor GTL2
SwissprotQ8H1810.0GTL2_ARATH; Trihelix transcription factor GTL2
TrEMBLA0A3P6E5R70.0A0A3P6E5R7_BRAOL; Uncharacterized protein
STRINGBo2g152980.10.0(Brassica oleracea)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM82682838
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.10.0Trihelix family protein
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  2. Chalhoub B, et al.
    Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome.
    Science, 2014. 345(6199): p. 950-3
    [PMID:25146293]