PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.006G163600.2.p
Common NameSb06g023980, SORBIDRAFT_06g023980
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family Trihelix
Protein Properties Length: 666aa    MW: 71462.5 Da    PI: 9.4002
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.006G163600.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix97.89.8e-3182166187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+++e++aLi++r+em+ ++r++ lk+plWe+vs+k+++ g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++fd+lea
  Sobic.006G163600.2.p  82 RWPREETQALIRIRSEMDATFRDATLKGPLWEDVSRKLADLGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YRFFDELEA 166
                           8*********************************************************************866665..******985 PP

2trihelix107.87e-34490575187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k ev+aLi++r +++ r++++  k+plWe++s+ mr+ g++rs+k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpyf+qlea
  Sobic.006G163600.2.p 490 RWPKTEVHALIQLRMDLDMRYQETGPKGPLWEDISSGMRRLGYNRSSKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPYFHQLEA 575
                           8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.53875139IPR017877Myb-like domain
SMARTSM007171.9E-479141IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.1E-480139IPR009057Homeodomain-like
PfamPF138375.2E-2281167No hitNo description
CDDcd122032.53E-2881146No hitNo description
SMARTSM007170.0019487549IPR001005SANT/Myb domain
PROSITE profilePS500907.236489547IPR017877Myb-like domain
PfamPF138372.0E-23489576No hitNo description
CDDcd122032.43E-28490554No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 666 aa     Download sequence    Send to blast
MGPFSPTAGA TGGPMPLSSR PPSSAQPQPQ PQPQPQQPRS SYDELAVVSG TAGAGGFDDE  60
MMGSGGGGGG GGGGSSGASS NRWPREETQA LIRIRSEMDA TFRDATLKGP LWEDVSRKLA  120
DLGYKRSAKK CKEKFENVHK YYKRTKEGRA GRQDGKSYRF FDELEALHAA APQPQPQPQP  180
PQMQQQQLPP ATTAPAPLHA FAAPPPMSSM PPPTGPMQPA PISSAAPAVV QVHQAPVELP  240
PAAHQPLNLQ GFSFSSMSDS ESDDESEDDD MTAETGGSQD RLGKRKRGDG GGASGSSKKM  300
MTFFEGLMQQ VVDRQEEMQR RFLETMEKRE AERTAREEAW RRQEVARLNR EQEQLAQERA  360
AAASRDAAII AFLQRIGGQS VQPATAVVVP MPAPVPVHTP PPPKQQSRQQ QPPPPPSPQA  420
TPQSKPISAA PLQQQPPQKQ PKDTSSQQDA GTPRSAPPTS GASLELVPVA EHHVDSGLGG  480
GDGGAASSSR WPKTEVHALI QLRMDLDMRY QETGPKGPLW EDISSGMRRL GYNRSSKRCK  540
EKWENINKYY KKVKESNKKR PEDSKTCPYF HQLEAIYSRK HLRAAAASSN AAAAAVAPPP  600
AYPDQLNPSR HEIEGKNIND DKRNNGGSGG GTQVPSSNGE TTAPTTTPAA FDADTGMKKK  660
TSSGS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16374SGGGGGGGGGGS
2124132KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.006G163600.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankEU9568430.0EU956843.1 Zea mays clone 1575747 mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021319136.10.0trihelix transcription factor GT-2 isoform X2
SwissprotQ391173e-73TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A1Z5REB20.0A0A1Z5REB2_SORBI; Uncharacterized protein
STRINGSb06g023980.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.15e-41GT-2-like 1