PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ro04_G26085
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rubus
Family CPP
Protein Properties Length: 795aa    MW: 87139 Da    PI: 6.066
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Ro04_G26085genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR512.9e-16500538240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                   +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Ro04_G26085 500 ACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 538
                  589**********************************96 PP

2TCR50.63.9e-16586624139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks+ClkkYCeC++ g+ Cs  C+Ce+CkN 
  Ro04_G26085 586 RHKRGCNCKKSNCLKKYCECYQGGVGCSIGCRCEGCKNA 624
                  589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 795 aa     Download sequence    
MDTPERNQIG TPKAKFEDSP VFNYINSLSP IKPVKSIHFT QTFSSLSFAS LPSVFTSPHV  60
SSHKESRFLK RHNPSDLSKP EFSPESGNKV SVSEDAAQLY NNSSEIQEDG VPVVPIGEPS  120
VESPSEHSKF VIELPRNLNY DCGSPDCNPT SRCGIIKEDC DSELADFSAS LAPYVQETSD  180
KGSSDDEAHQ QGICQSAQRK EGTGCDWESL ISDAADILIF DSPNSTEAFK ELMQHSLDPM  240
TRFSSSIVPH LLQNNFSDEQ LVQVIDTVGS GQQLEIEDPS SQNGEASEMK ETEQRQNHLN  300
EYMIDNPTVK EDNNAETSMQ LTCKPVINLH RGLRRRCLDF EMTGTRRKSF DNVSNYSSSM  360
VSQSNEKVTT NDKQLVSMKP GGESSRCILP GIGLHLNALA TTSKDYKIIK NENLASGREL  420
NFPSSNASIH SPTAGQGTVY ESFPSASSER DMDGTESGVQ LLQDASQASA LLANEDLNQN  480
SPKKKRHVLR RTEHTGEGEA CKRCNCKKSK CLKLYCECFA AGVYCIEPCS CQDCFNKPIH  540
EDTVLATRKQ IESRNPLAFA PKVIRNSDSA APEYGDESSK TPASARHKRG CNCKKSNCLK  600
KYCECYQGGV GCSIGCRCEG CKNAFGTKEG SIIGTEAELD EEETEATEKS VADRDQPKNE  660
IQKNEEQTSA LPSTPLRLSR QLVPVSFSSK SKPPRSSVFS IGSSSGLYAS QKLGKPNILR  720
PQAKFVQRHN QTVPEDEMPE ILQGDCSPTT GVKTASPNSK RVSPPCTFGS SPGRRSGRKL  780
ILQSIPSFPS LTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1483510KKKRHVLRRTEHTGEGEACKRCNCKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-167CPP family protein