PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bostr.20129s0126.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Boechereae; Boechera
Family Trihelix
Protein Properties Length: 613aa    MW: 68733.1 Da    PI: 5.9545
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bostr.20129s0126.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.11.2e-2855139187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km+e g+ r++k+Ckek+en+ k++k++keg+ ++++++  t+++fdqlea
  Bostr.20129s0126.1.p  55 RWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFDQLEA 139
                           8********************************************************************975544..6*******85 PP

2trihelix105.24.6e-33413498187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Bostr.20129s0126.1.p 413 RWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 498
                           8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.1252114IPR001005SANT/Myb domain
PfamPF138376.0E-1854140No hitNo description
CDDcd122032.70E-2254119No hitNo description
PROSITE profilePS500906.95754112IPR017877Myb-like domain
PROSITE profilePS500907.375406470IPR017877Myb-like domain
SMARTSM007170.0017410472IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.605.3E-4411469IPR009057Homeodomain-like
PfamPF138373.4E-22412499No hitNo description
CDDcd122035.95E-27413477No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 613 aa     Download sequence    Send to blast
MMQLGGTPTT TTTASTATAP PPQSNDSAAT EAAAAAVGAF EVSEEMNDRG FGGNRWPRQE  60
TLALLKIRSD MGIAFRDASV KGPLWEEVSR KMAELGYIRN AKKCKEKFEN VYKYHKRTKE  120
GRTGKSEGKT YRFFDQLEAL ETQSTTASLH HQQPQPPLRP HQNNNNNNNS SIFSTPPPVT  180
TVMPAVANIS TLPSSSIPPY TQQMNVPSFP NISGDFLSDN STSSSSSYST SSDMDIGGGG  240
GTKTNRKKRK RKWKEFFERL MKQVVDKQEE LQRKFLEAVE KREHERLVRE ESWRLQEIAR  300
INREHEILAQ ERSMSAAKDA AVMAFLQKLS EKQPNQPTVQ PQPQPQPQPQ QVQPPQMQLN  360
NNNQQQTPQP SPPSPPPPLL QPIQAIVPTS DTTKMDNGDQ NMTPASASAS SSRWPKVEIE  420
ALIKLRTNLD SKYQENGPKG PLWEEISAGM RRLGFNRNSK RCKEKWENIN KYFKKVKESN  480
KKRPEDSKTC PYFHQLDALY RERNKFHSSN NNNNIASSSS ASGLVKPDNS VPLMVQPEQQ  540
WPPAITTTAT TAVAAVQLAQ HPQPSDQNFD DEEGTDEEYD DEDEDEDEEN EEEEGGEFEL  600
VPSDNNKTTN NL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1245250RKKRKR
2245251RKKRKRK
3246251KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapBostr.20129s0126.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792830.0AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002887660.10.0trihelix transcription factor GT-2
SwissprotQ391171e-148TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLD7KTX90.0D7KTX9_ARALL; Uncharacterized protein
STRINGBostr.20129s0126.1.p0.0(Boechera stricta)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.10.0Trihelix family protein