PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_14977_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 581aa    MW: 65877.4 Da    PI: 6.3639
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_14977_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix46.59.9e-152788268
                    trihelix  2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68
                                W+++evlaL+++r+++e+++ +       We+vs+k+++ gf+rs+ +Ckek+e+ ++++  i+ ++
  Cotton_A_14977_BGI-A2_v1.0 27 WSNDEVLALLKVRSSIENWFPEF-----TWEHVSRKLADLGFKRSADKCKEKFEEESRYFNSINCSK 88
                                ********************998.....9*******************************9988665 PP

2trihelix96.23e-30427515186
                    trihelix   1 rWtkqevlaLiearremeerlrrgk....lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                                 rW+++evlaLi++r ++ ++   +k     k+plWe++s+ m e g++rs+k+Ckekwen+nk+++k+k+++kkr s +s+tc
  Cotton_A_14977_BGI-A2_v1.0 427 RWPRDEVLALINLRCSLYNNGDHEKegtaIKAPLWERISQGMLELGYKRSAKRCKEKWENINKYFRKTKDINKKR-SLDSRTC 508
                                 8**************9998888653344499*******************************************8.9999*** PP

                    trihelix  80 pyfdqle 86 
                                 pyf+ql 
  Cotton_A_14977_BGI-A2_v1.0 509 PYFHQLS 515
                                 *****95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM00717172380IPR001005SANT/Myb domain
PfamPF138376.3E-102588No hitNo description
PROSITE profilePS500906.1332678IPR017877Myb-like domain
SuperFamilySSF1014472.62E-6108116No hitNo description
SMARTSM007170.24424490IPR001005SANT/Myb domain
PfamPF138375.9E-19426515No hitNo description
CDDcd122031.41E-24426495No hitNo description
PROSITE profilePS500906.539427488IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0001158Molecular Functionenhancer sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
Sequence ? help Back to Top
Protein Sequence    Length: 581 aa     Download sequence    Send to blast
MNMEINGRDQ RSIAEPIDNL HHHHHPWSND EVLALLKVRS SIENWFPEFT WEHVSRKLAD  60
LGFKRSADKC KEKFEEESRY FNSINCSKNY RIFSELEELY QAENPPPPPP PPPPPPPPHH  120
HSQQQQVAVV ADENNKNVEK SREDEDNMGQ NLEADSRNID ELYQTSPANN TAMSSDQDNK  180
KVVENKANYD DNKAAAAANN NKKRKRVKKL ELFKGFCEDI VNKLMIQQEE MHNKLIEDMV  240
KRDEEKVARE EAWKKQELDR INQELELRAK EQAIAGDRQA TIIKFLSKFS QTGSSKKQCF  300
GRVNEDVVKV PSECSNPPIA SSSPLVAVAE NPNPIVTDQN KVDQVSTTSP SSMNLAHQNK  360
QSMPISMTES QAPQNPNPET PDTSSLAPQN PNSVSAESNP LPPTSPLTVN KAPQNPTSNE  420
KEDLGKRWPR DEVLALINLR CSLYNNGDHE KEGTAIKAPL WERISQGMLE LGYKRSAKRC  480
KEKWENINKY FRKTKDINKK RSLDSRTCPY FHQLSTLYSQ GTLIAPSDGP ENRSPLPENH  540
SKLPETGKDS CQRGDKDSTV HVSGGNETNM VIQVPGFEFE F
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1201208KKRKRVKK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00011PBMTransfer from AT5G28300Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX9636961e-101JX963696.1 Gossypium hirsutum clone NBRI_TRANS-026 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017630950.10.0PREDICTED: trihelix transcription factor GTL2
TrEMBLA0A2P5Y8370.0A0A2P5Y837_GOSBA; Uncharacterized protein
STRINGGorai.007G283800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM82682838
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.11e-106Trihelix family protein