PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.006G163600.3.p
Common NameSb06g023980, SORBIDRAFT_06g023980
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family Trihelix
Protein Properties Length: 748aa    MW: 80402 Da    PI: 6.1463
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.006G163600.3.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix97.51.2e-3082166187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+++e++aLi++r+em+ ++r++ lk+plWe+vs+k+++ g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++fd+lea
  Sobic.006G163600.3.p  82 RWPREETQALIRIRSEMDATFRDATLKGPLWEDVSRKLADLGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YRFFDELEA 166
                           8*********************************************************************866665..******985 PP

2trihelix107.68.3e-34490575187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k ev+aLi++r +++ r++++  k+plWe++s+ mr+ g++rs+k+Ckekwen+nk+ykk+ke++kkr +e+s+tcpyf+qlea
  Sobic.006G163600.3.p 490 RWPKTEVHALIQLRMDLDMRYQETGPKGPLWEDISSGMRRLGYNRSSKRCKEKWENINKYYKKVKESNKKR-PEDSKTCPYFHQLEA 575
                           8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.53875139IPR017877Myb-like domain
SMARTSM007171.9E-479141IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.3E-480139IPR009057Homeodomain-like
PfamPF138376.1E-2281167No hitNo description
CDDcd122031.44E-2881146No hitNo description
SMARTSM007170.0019487549IPR001005SANT/Myb domain
PROSITE profilePS500907.236489547IPR017877Myb-like domain
PfamPF138372.4E-23489576No hitNo description
CDDcd122031.39E-28490554No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    Send to blast
MGPFSPTAGA TGGPMPLSSR PPSSAQPQPQ PQPQPQQPRS SYDELAVVSG TAGAGGFDDE  60
MMGSGGGGGG GGGGSSGASS NRWPREETQA LIRIRSEMDA TFRDATLKGP LWEDVSRKLA  120
DLGYKRSAKK CKEKFENVHK YYKRTKEGRA GRQDGKSYRF FDELEALHAA APQPQPQPQP  180
PQMQQQQLPP ATTAPAPLHA FAAPPPMSSM PPPTGPMQPA PISSAAPAVV QVHQAPVELP  240
PAAHQPLNLQ GFSFSSMSDS ESDDESEDDD MTAETGGSQD RLGKRKRGDG GGASGSSKKM  300
MTFFEGLMQQ VVDRQEEMQR RFLETMEKRE AERTAREEAW RRQEVARLNR EQEQLAQERA  360
AAASRDAAII AFLQRIGGQS VQPATAVVVP MPAPVPVHTP PPPKQQSRQQ QPPPPPSPQA  420
TPQSKPISAA PLQQQPPQKQ PKDTSSQQDA GTPRSAPPTS GASLELVPVA EHHVDSGLGG  480
GDGGAASSSR WPKTEVHALI QLRMDLDMRY QETGPKGPLW EDISSGMRRL GYNRSSKRCK  540
EKWENINKYY KKVKESNKKR PEDSKTCPYF HQLEAIYSRK HLRAAAASSN AAAAAVAPPP  600
AYPDQLNPSR HEIEGKNIND DKRNNGGSGG GTQVPSSNGE TTAPTTTPAA FDADTGMKKP  660
EDIVRELNEQ PPREFTTEDE TDSDDMGDEY TDDGEEGEDD GKMQYRIQFQ RPNPGGTNTA  720
PAPAAATTAA PAVPTSAPTS TFLAMVQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16374SGGGGGGGGGGS
2124132KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.006G163600.3.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0550330.0BT055033.1 Zea mays full-length cDNA clone ZM_BFc0059A16 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002448251.20.0trihelix transcription factor GT-2 isoform X1
SwissprotQ391171e-72TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A1Z5RF210.0A0A1Z5RF21_SORBI; Uncharacterized protein
STRINGSb06g023980.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP60638175
Representative plantOGRP1696233
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.11e-40GT-2-like 1