PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_15314_BGI-A2_v1.0
Common NameF383_01517
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 492aa    MW: 55992 Da    PI: 9.0462
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_15314_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.58.8e-2959143187
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83 
                                 rW++qe+laL+++r++m+ ++r+++ k+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g++++   + +t+++ d
  Cotton_A_15314_BGI-A2_v1.0  59 RWPRQETLALLKIRSDMDLTFREASVKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRSGK--ADGKTYRFCD 139
                                 8********************************************************************96..56668***** PP

                    trihelix  84 qlea 87 
                                 qlea
  Cotton_A_15314_BGI-A2_v1.0 140 QLEA 143
                                 *985 PP

2trihelix100.21.7e-31316400186
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83 
                                 rW+k e+ aLi++r+++++++++++ k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++k+r + +s+tcpyf+
  Cotton_A_15314_BGI-A2_v1.0 316 RWPKVEIEALIKIRTSLDSKYQDNSPKGPLWEEISNEMKKLGYNRNAKRCKEKWENINKYFKKVKESNKRR-PVDSKTCPYFH 397
                                 8********************************************************************97.9999******* PP

                    trihelix  84 qle 86 
                                  l+
  Cotton_A_15314_BGI-A2_v1.0 398 LLD 400
                                 986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.001556118IPR001005SANT/Myb domain
PfamPF138371.1E-1858143No hitNo description
PROSITE profilePS500906.91158116IPR017877Myb-like domain
CDDcd122034.60E-2358123No hitNo description
SMARTSM007170.0011313375IPR001005SANT/Myb domain
PfamPF138379.1E-22315400No hitNo description
CDDcd122033.04E-26315380No hitNo description
PROSITE profilePS500907.131315373IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 492 aa     Download sequence    Send to blast
MLGGGGTTAS VSSVGGRNGR NEAAAAVAVF DTNDGNNNNT GEDDRSKVDE GDRSFGGNRW  60
PRQETLALLK IRSDMDLTFR EASVKGPLWE EVSRKLAELG YHRSAKKCKE KFENVYKYHK  120
RTKDGRSGKA DGKTYRFCDQ LEAFQNQPSI QWPPRPPMTA AATINQSISA VQMSNSISSS  180
TSSDLELQGR KKRKRKWMDF FERLMKEVIQ KQQVMQKTFL EAIEKHERER IVRDEAWKVQ  240
EMSRLNTERE ILAQERSIAA AKDAAIMAFL QKLSEKQNLG QSQNGPLPPP AVVPAAVAPP  300
PDNGNQIQTH TSSSSRWPKV EIEALIKIRT SLDSKYQDNS PKGPLWEEIS NEMKKLGYNR  360
NAKRCKEKWE NINKYFKKVK ESNKRRPVDS KTCPYFHLLD VLYREKNKHD CSSKSNPLMV  420
RPEKQWPPPL EPHQQHHDTI MEDMMESDQN DDEEEDEGGS YELVASKPVS MGTAEGCQKE  480
TTEISFAHMI VR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1189194RKKRKR
2189195RKKRKRK
3190195KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJQ0130920.0JQ013092.1 Gossypium hirsutum trihelix transcription factor (GT7) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017618378.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391172e-77TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0B0PJ770.0A0A0B0PJ77_GOSAR; Trihelix transcription factor GT-2-like protein
STRINGGorai.002G231500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.22e-70Trihelix family protein