PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TEY88799.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia; Calosphace; core Calosphace
Family CPP
Protein Properties Length: 653aa    MW: 71709 Da    PI: 6.5563
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TEY88799.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.73.5e-16400438240
         TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                 ++k+CnCkkskClk+YCeCfaag++C e C C dC+Nk 
  TEY88799.1 400 SCKRCNCKKSKCLKLYCECFAAGVYCVEPCACIDCFNKP 438
                 689**********************************96 PP

2TCR51.42.1e-16486524139
         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                 ++k+gCnCkks ClkkYCeC++ g+ Cs +C+Ce+CkN 
  TEY88799.1 486 RHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNA 524
                 589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 653 aa     Download sequence    
MMEEGCKNSI GEGEIMETPE RSKNSRIASS IAKFEDSPVF NFLNNLSPIK PVHFSQTINP  60
LSFASLPSVF SSPQLASLRE SRLLRRHSLS DPSKPEFSSD PSTRASVNLD EQRENLDPNF  120
KNDGEPSRLA NCESSIEKSN EDVLGTCAAL VEGQRSVNMS EANVDVLGNA NVNSWDSLIN  180
DASNLLAFES PNAEAYNKPV DPATAFYRSI RNAIQNVCGE RGNAAEDMPC QPDGEIENEN  240
LSGICRGMRR RCLVYEMAGG RRKRVEDGSA DTPLLLTTSG SASSDKQTAR TNTENDCYRI  300
LPGIGLHLNA LAATPKDFKI FNHESSASGR LLIGPSSSAN FRQDLLRNYS LERENDALEN  360
DGFVMEDHSW ESGHVANEDI NPSSPKKRRR RSEQGGDGES CKRCNCKKSK CLKLYCECFA  420
AGVYCVEPCA CIDCFNKPIH EDVVLATRKQ IESRNPLAFA PKVIRTSDSP SEIGGDDLTN  480
TPASSRHKRG CNCKKSGCLK KYCECYQGGV GCSINCRCEG CKNAFGRKDG SCASAAEHEH  540
EEDETDSISL YNARIHDDLD SQQSHRSFSK KRPPRAPFPP IGSYAFPPPT NMRPRCEEED  600
EMPHFLVGGG SPIKTSSPNK KRVSPPQTLP GLRSSRKLIL QSIPSFPSLA PSQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1249264RRRCLVYEMAGGRRKR
2386391KKRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-101CPP family protein