PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G300500.1.p
Common NameSb04g033390, SORBIDRAFT_04g033390
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family Trihelix
Protein Properties Length: 721aa    MW: 76747.9 Da    PI: 5.67
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G300500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix96.72e-30109193187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+++e+laLi++r+em+ ++r++ lk+plWe+v++k++e g++rs+k+Ckek+en+ k+yk++k+ + +r++++s  +++f+qlea
  Sobic.004G300500.1.p 109 RWPREETLALIRIRTEMDADFRNAPLKAPLWEDVARKLAELGYQRSAKKCKEKFENVDKYYKRTKDARAGRQDGKS--YRFFSQLEA 193
                           8*********************************************************************866665..*******85 PP

2trihelix108.64e-34442527187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k+ev aLi++r+e++e++ ++  k+plWe++++ mr+ g++rs+k+Ckekwen+nk+ykk+ke++k+r +e+s+tcpyf+ql+a
  Sobic.004G300500.1.p 442 RWPKEEVEALIQMRNEKDEQYHDAGGKGPLWEDIAAGMRRIGYNRSAKRCKEKWENINKYYKKVKESNKRR-PEDSKTCPYFHQLDA 527
                           8********************************************************************97.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.13106168IPR001005SANT/Myb domain
PfamPF138374.9E-21108194No hitNo description
CDDcd122033.37E-28109173No hitNo description
PROSITE profilePS500906.98109166IPR017877Myb-like domain
PROSITE profilePS500908.072435499IPR017877Myb-like domain
SMARTSM007172.8E-6439501IPR001005SANT/Myb domain
PfamPF138371.2E-22441528No hitNo description
Gene3DG3DSA:1.10.10.608.3E-5441498IPR009057Homeodomain-like
SuperFamilySSF466891.7E-5441515IPR009057Homeodomain-like
CDDcd122037.07E-26442506No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 721 aa     Download sequence    Send to blast
MLHHHGGAGS PYMAPTTAGT GAGTAPLSPT PTAPAVVSVT DIPAAPPTMQ LQPAGPSANF  60
EELPVVGSGS GVGVGAGAGA AASIIQDDDM QADLGGSGAG ASGSGGHHRW PREETLALIR  120
IRTEMDADFR NAPLKAPLWE DVARKLAELG YQRSAKKCKE KFENVDKYYK RTKDARAGRQ  180
DGKSYRFFSQ LEALHAAAPP QLLLPPPPPS GMSMTTVQAG PHQPMAMAWT AGPSALGPPA  240
GAGLPDLSFS SMSGSESESD SYSDDYDYDD SDAGEEGLGR EQGLGRGECD REMMAIFEGM  300
MKQVTEKQDA MQRVFLETLE RWEAERTARE EAWRRQEVAR MNREREQLAR ERAAAASRDA  360
ALIAFLQRVG GGQGQPVARL PPHSAGVVPA PPIPDHTPSS PRRHDAAAAA TYLQQLVPTS  420
HKAVEALTWT GGEGSGSTSS SRWPKEEVEA LIQMRNEKDE QYHDAGGKGP LWEDIAAGMR  480
RIGYNRSAKR CKEKWENINK YYKKVKESNK RRPEDSKTCP YFHQLDAMYS KKHRAGGGRG  540
SSRTAPAANM ATAVTVAVAV AAAVQDNPSQ RELEGKSSND VDHRKNDELG NVQASPGNGD  600
TAPTTTTPPG DGAKNKTAED NVTETNVQHQ QQQQQGFSAD ETDSDDDINM ARDYTVYTEE  660
GNDEDKMKYK MGVQKPDVIG SSGNVPASPP APAPAPAAPV TAAAPTSSAA PTGSTFLAVQ  720
*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.004G300500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankEU9472750.0EU947275.1 Zea mays clone 336252 mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002452839.10.0trihelix transcription factor GTL1 isoform X2
TrEMBLC5XS350.0C5XS35_SORBI; Uncharacterized protein
STRINGSb04g033390.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-42Trihelix family protein