PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG66249.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 981aa    MW: 105062 Da    PI: 5.4383
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG66249.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix31.83.8e-10296369268
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +e l+L++ +re e +l        + + k  +W++++k+m++ g  + +  C +kw+nl + ykki++ +
  GBG66249.1 296 WSPEEQLQLVRCKREQEMHLAglghnygQMRTKDWKWDDIAKRMASAGKPKDADDCMKKWDNLFQNYKKIQRFQ 369
                 **************77777763333222567999***********************************98755 PP

Sequence ? help Back to Top
Protein Sequence    Length: 981 aa     Download sequence    
MAGMQGRAGV GVGQGGIRPA TAEPPRRYDL SMYAHLQSWE TPLPPSDEEP ETEELPTLPL  60
ASGSTQLWSQ TVRAGGSGCN EGGEYTSLLQ QGLGDDDDGG LDLRFGSCSG GSKEASRTLI  120
IDTDPSPRGV EQGGSQRTDQ TTLCARASVA ASVGPSAVLR RQGSTAPSAD RSTSNWPARN  180
GAAAGSPRVE CARPSMPERT TPTQSDVRDA GACRPPVRPV PTVENITRGV SNMRAHSDGG  240
DDDGVGGDDA DKRLMEDVDA GDDDEDIPVR PLGKTGGRGE HAAGVLDDAG KSVTYWSPEE  300
QLQLVRCKRE QEMHLAGLGH NYGQMRTKDW KWDDIAKRMA SAGKPKDADD CMKKWDNLFQ  360
NYKKIQRFQN ASGRPDFFGL TNEERKEHNF KFRMERALYN EIHAGMLGNH TNFPPNIADT  420
GNPDGRQLPR RGGGGGESVG SEAGGEGFPE ERSSARESEN NAGSGAGGGK RKNARQQVSE  480
SIADVMDRHG ELMSSTIESS SKQQCSIFTR QCDILEQEVA VQKEHYAASN ETQRMMCHAL  540
MEIAAKRLRS DDASTRVPLG GAQGWAAEVG GDYDDDFATG EDQAQATTSG VRESRGQRQS  600
HHSASKRLMT PPPDTKQVRA RDRRTDNADV HDVGDDDDEP LERRHLRSHT AGTPPPAASG  660
HALAGETTVT GRLPAITAAP RPRNTAAEGG SVQRGGGGVV AAHAGAVGVE AAGAAGAVAG  720
VSGSAPPVPA AKEEGAVVAM ARDDARGEKR SDREGGEGGS NRPRRGVLSN DLVDCVVLSV  780
DDKPFWTMGE GRRLYNIVHE TREYFVAIAS GLPPPAIPRS VVMPKSITRI ARIADPSQLQ  840
QATSHASAVE NIALCVLHGW VFKSVNRPRG YNLAFQYALE SVATDIVRAM WYGEEWCNVV  900
SPAVCAHTID LSMDLPLWFA GVNVENRPDD DDMAAYQEST IICVAHAFRK AVQMGAVIDG  960
EFISHDRLCR VADCFRLLLV A