PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG84583.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family CPP
Protein Properties Length: 1489aa    MW: 148826 Da    PI: 5.9867
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG84583.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR40.65.1e-1310411079240
         TCR    2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40  
                  ++k+CnCkkskClk+YC+Cfaag++C+ +C C++C N  
  GBG84583.1 1041 SCKRCNCKKSKCLKLYCDCFAAGSYCAASCCCQNCLNLP 1079
                  689*********************************975 PP

2TCR47.53.6e-1511711209139
         TCR    1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                  ++k+gCnC++s+ClkkYCeC++a++ Cs +CkC++C N 
  GBG84583.1 1171 RHKRGCNCRRSHCLKKYCECYQAEVPCSVSCKCSGCGNP 1209
                  589*********************************995 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1489 aa     Download sequence    
MVDIKMNLDG GGGLSSRQES SLERDLEGDP EGGDLGGCGA AKSCAAGGED GSHLFGEEDE  60
GAPATRSPAR SNPAARYAPE DSPFFKYACS LSPIKLSKTV HIQTYNELCP RYSPPPTVFK  120
SPCSRFARQW RTSDVEQRGG GGPGAAAGVG AGVGAGGGAG GGNVDGGRLV AGGGMVMTTM  180
GSFTPAIATG SFVPPCATVM PTTTAAEMQI PILVGGGGGG GGGGGVKLAL GAVRSSNGSA  240
MATPSASTVA MAGGGSAAPG NAVKISSTCS SSEIVMKREK CVLLDASHVS VSAVTAATDM  300
RGAEEDACTS VSEGRVGSST GFSGSDVSAT SDERSSSLLR AGGLREAAAG PMCRPLTLAF  360
QPLPAQTAEV AARESDKADV HSVMDEGATA TTAAAIAAAS ADASGGASAA AGIAATVNLP  420
PWFSNNVVLT GLSITKSMKN HDDSSRGFRR RRCLDFEPGA GQKRGALAAG EVVVGERGGE  480
ERAEERGERE GEKGGEREET VEVGGRVAVG GAEGGGEGEE VKVKRECESS AEEPEGDVKF  540
VRPSVVTPGG RSMELGTVEE VSGGSNAAPT AMRRTGEEEM VSTSPLKTAS AMEATSALDA  600
SDAVSTVMVC DKTPPVAQVS VPRVTAGAGG EGCGEGAKRL GRGDGGRGSY LDAGRPPGAL  660
PIASSASAAD AAAHFVTSGD LAQRGGGGGG GAEGAYGRCN EASTVESGSM RIPLSPVGLA  720
NLQQGVNDGS GIRWGGKGGL VDLCNKKVNT FSPAIQSHYR KTVAGEGEGE STTLGSSGCS  780
ESQGEVMVDG GGFQQPCLAS SAVREALSSG AVAGNRVSMQ EKAMSIEKER EERKTTAVQK  840
KTEKKGGANA NVKSSCIKRA AEDRLMGGKR SGGGGGGAAS GGCEELMGGG GGLNRGGGIT  900
WARAPTWGPQ SAIRLVRSDP VSAGELSLAA ENQGSVSGPV QLWDYDFYHH HIQQQQHNHH  960
HQQQQQQLQQ QHQQHRHQQQ QHQQQRRRLL PRVEGPDLQQ IGERSADASE SEGAGRLPIL  1020
KRRKSSGSSG ITGEEGTPSS SCKRCNCKKS KCLKLYCDCF AAGSYCAASC CCQNCLNLPE  1080
YEEIIQDTRA GIEARDPLAF TPKIIMAGGT GGGGGGSAGG ATAAAAALSA PGGGGGGGGG  1140
GGGMADAGRG EGDNPAAAQE EEFVESSASA RHKRGCNCRR SHCLKKYCEC YQAEVPCSVS  1200
CKCSGCGNPF GTKVGVGGKE QGAEEREDAK AAAVGAGLVV AEIETTERSV NGAAVVVASG  1260
RRSAIGGMEF RKQQQQQQHH FVGQSGVLPS PMSLVGESGG GGSAAVAGAI APVGGGGNRI  1320
PILFSPGSFA TSGSAEIGSH SWTSNPHFDF PASSPGGGGL LTPTSALASA QAQPSLLGAG  1380
MSGVGGDPAG CQLAMSATSV DGERRESLSS SISGGLAIAA GGAVVGGEGG GGGGVGEGGQ  1440
GGDTKQIGVG EDGAGSGLCS VPVAPCSVGI SQTVSILPPT VAYQGLLPR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1166174GGRLVAGGG
2867875GGRLVAGGG
310211051KRRKSSGSSGITGEEGTPSSSCKRCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-23CPP family protein