PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc07_g17350
Common NameGSCOC_T00037442001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family Trihelix
Protein Properties Length: 498aa    MW: 56246.8 Da    PI: 6.6911
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc07_g17350genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix81.98.8e-2630113186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  rW+++e+laL+++r +m+ ++r++  k+plW+ev +k+ e g++rs+ +Ckek+en+ k++k++k+ +++r  ++ +++++f+qle
  Cc07_g17350  30 RWPREETLALLKIRADMDLAFRDSTVKAPLWDEVTRKLGELGYHRSARKCKEKFENIFKYHKRTKDCRSGR--QNGKNYRFFEQLE 113
                  8********************************************************************98..45567*******8 PP

2trihelix99.33.3e-31344428186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  rW+k ev aL+++r++++ +++++ lk+plWee+s++m++ g+ rs+k+Ckekwen+nk+yk+++e+ k+r +ess+tcpyf+ le
  Cc07_g17350 344 RWPKAEVEALVRLRTNLGMQFQDNGLKGPLWEEISSAMKKLGYDRSAKRCKEKWENINKYYKRVRESHKRR-PESSKTCPYFHLLE 428
                  8********************************************************************97.9**********997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.00832789IPR001005SANT/Myb domain
PfamPF138371.2E-1729114No hitNo description
CDDcd122035.59E-222994No hitNo description
PROSITE profilePS500907.0152987IPR017877Myb-like domain
PROSITE profilePS500908.049337401IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.606.1E-5341400IPR009057Homeodomain-like
SMARTSM007173.7E-4341403IPR001005SANT/Myb domain
PfamPF138377.4E-22343430No hitNo description
SuperFamilySSF466897.69E-5343417IPR009057Homeodomain-like
CDDcd122034.07E-25343408No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 498 aa     Download sequence    Send to blast
MVGAENGGGG GGGGGGGEEE ERGKGEGGNR WPREETLALL KIRADMDLAF RDSTVKAPLW  60
DEVTRKLGEL GYHRSARKCK EKFENIFKYH KRTKDCRSGR QNGKNYRFFE QLERFDNQPS  120
LPSPPLSQIQ THVAETTQTT TIAAPTIIKV TSGSLDSMVP HPSENPNMEF VTPSTSTTSS  180
SGRESEGSVK KKRKLSDYFE KLMKEILEKQ ENLQNQLLAA LEKCERDRIA REEAWRLQQM  240
DRIRKEQEYL ANERAISAAR DATVMAFLQK ISEQAIPGQF AEAATPISEK HPDKQQVQTP  300
GPFTPGTIEN QELGTSIGRQ EDAFDVDKRG NGFGESSIQA TTSRWPKAEV EALVRLRTNL  360
GMQFQDNGLK GPLWEEISSA MKKLGYDRSA KRCKEKWENI NKYYKRVRES HKRRPESSKT  420
CPYFHLLESI YEKKSKGVEQ NAEWSGNNLE PEHILMQMMG QQEQQPQHQQ LTEDEENDNG  480
DGYELVANHP SSVASME*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1189194KKKRKL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027075443.10.0trihelix transcription factor GT-2-like
TrEMBLA0A068UW860.0A0A068UW86_COFCA; Uncharacterized protein
STRINGVIT_04s0044g00510.t011e-168(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA97942328
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.23e-60Trihelix family protein