PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G194600.1.p
Common NameSb01g017120, SORBIDRAFT_01g017120
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family Trihelix
Protein Properties Length: 808aa    MW: 85117.1 Da    PI: 6.6683
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G194600.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.93.6e-30106190187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW++qe+l+L+++r++m+ ++r++ lk+plWe+vs+k++++g++rs+k+Ckek+en++k+yk++ke++ +r  ++ +t+++f+qlea
  Sobic.001G194600.1.p 106 RWPRQETLELLKIRSDMDAAFRDATLKGPLWEQVSRKLADKGYSRSAKKCKEKFENVHKYYKRTKESRAGR--NDGKTYRFFTQLEA 190
                           8********************************************************************97..56668*******85 PP

2trihelix109.91.6e-34482567187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k ev+aLi++r+++++r++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Sobic.001G194600.1.p 482 RWPKAEVHALIQLRSNLDTRYQEAGPKGPLWEEISAGMRRLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 567
                           8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.0021103165IPR001005SANT/Myb domain
PROSITE profilePS500906.632105163IPR017877Myb-like domain
CDDcd122031.32E-27105170No hitNo description
PfamPF138372.5E-20105191No hitNo description
PROSITE profilePS500907.201475539IPR017877Myb-like domain
SMARTSM007170.0078479541IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-4480538IPR009057Homeodomain-like
CDDcd122033.94E-30481546No hitNo description
PfamPF138378.9E-24481568No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 808 aa     Download sequence    Send to blast
MQQQQHGGGG GGGGGGGGGP AQQFGAQQVE MPPPFSPAGG ASQRISLAEA PSPISSRPPA  60
SSSAPAQQYD ELGASGAGAV LGFDAEGLAA AAAGEEGASG GSAGNRWPRQ ETLELLKIRS  120
DMDAAFRDAT LKGPLWEQVS RKLADKGYSR SAKKCKEKFE NVHKYYKRTK ESRAGRNDGK  180
TYRFFTQLEA LHGTGGAAPA SSVASQVPPA GPSAVRVPAE PPPAVLAGGV GMPTMGYPSF  240
STSNTEDYTD EDDSDDEGTQ ELVGGGGGGA DERGKRKRVS EGGASAAGGG SGKMMRFFEG  300
LMKQVMERQE AMQQRFLEAI EKREQDRMIR EEAWRRQEMT RLAREQEILA QERAMAASRD  360
AAVLSFIQKI TGQTIPMPSI AAPTINAMPP PPPSHPKPPP PQPHPTPIAS ASPAPPPPQP  420
PASQTPPPQQ QQQQKPPMPA STPQAPAPQQ QSMDIVMTTA ETTPRADTPV HEGSSGGATS  480
SRWPKAEVHA LIQLRSNLDT RYQEAGPKGP LWEEISAGMR RLGYNRNAKR CKEKWENINK  540
YFKKVKESNK KRPEDSKTCP YFHQLDALYR NKAALSSSGA GAVVHAVNAS SSAQPQETVT  600
VVTAAAPISQ TPPPLPPPTT QPSQSHHAAK NGGTASNAAC TGTTGNGNGA GAPVHGSRGM  660
QTQPSNGSVA ASRFVGEGGG ATPAPAKKPE DIMKEMMEQR HPQPQQQTQA VVSGYNRIDG  720
ADSDNMDEDD DEDDYDDDDD EDEDVDGNKM QYKIQFQHQQ QHHQPPQHPN TVRPNAGAGG  780
GNPPGTAAPS TAAAPTTTAG SFLGMVK*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G194600.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0612050.0BT061205.1 Zea mays full-length cDNA clone ZM_BFb0123A12 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002464376.10.0trihelix transcription factor GTL1 isoform X1
SwissprotQ391177e-64TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLC5WVE40.0C5WVE4_SORBI; Uncharacterized protein
STRINGSb01g017120.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.19e-53Trihelix family protein