PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0001299.1_g000020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family CPP
Protein Properties Length: 927aa    MW: 100947 Da    PI: 6.2144
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0001299.1_g000020.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR41.42.9e-13677714138
                        TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkN 38 
                                ++k+gCnCkkskClkkYCeCf+a++ Cs  C+C+ C+N
  Rmu_sc0001299.1_g000020.1 677 RHKRGCNCKKSKCLKKYCECFQANVGCSGACRCDCCQN 714
                                589**********************************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 927 aa     Download sequence    
MDGSPETNRV AAAAASSPAQ ASSPAVQVTI RFLISKCAVN AVVLESPFSN FLSNLTPINT  60
VKSDSYSQRL VGAFFPTPPV VFTSPHIDLE RETSFLERND IVEAGYKECN TTIFQNPSFE  120
KEVQLCSASG CIDEYLADPA KVDCTGSADI QSSINAGKES NTEVHEVIDG STKSEPLPTL  180
NQAEVGLPLP SLTQIETTII EKVDKNSVDL VTRAQSRGQN TAKESPFSNF LSNLTPINTV  240
KSDSYSQRLV GAFFPTPPVV FTSPHIDLER ETSFLERNDI VEAGYKECNT TIFQNPSFEK  300
EVQLCSASGC IDEYLADPAK VDCTGSADIQ SSINAGKESN TEVHEVIDGS TKSEPLPTLN  360
QAEVGLPLPS LTQIETTIIE KVDKNSVDLV TRAQSRGQNT AKEVSQHHRG IRRHLQFETA  420
VAQKSSNFGN HRSLCSLTHD TDNLRSQSKL TNLKTLASSH LDSKAFSSPQ GVSCDTSQLS  480
SCLCESVRSS QIGGNSSTSA PIRSGIGLHL NSITRSTSMS SDVLLSRKAT GCLSTPDQML  540
EHDNRNNIEH ESSSSFISVV LGQNYSSVGN DQQESQTAAH TTFTFYSTDG VPYLPCDSMH  600
QILVDQHSVP SGVFCLDSCA CENCYNKPEF EDTVFDARQQ IESRNPLAFA PKVVKHEINS  660
SPNIMEEADW TTPSSARHKR GCNCKKSKCL KKYCECFQAN VGCSGACRCD CCQNPCGTKA  720
DHEYNRAEKW ETHPAEKLDT IKGGNDCIKA LDMDPYSPWE GLSVISNLTP LSNPCSSTRV  780
SSASSSMRNR TKISQAQLQS SRLQPSGTGH LKRGRSPVIF TPQVFESKGP SQLSSDGASY  840
ERMDDDIPEL LKEPSIPPKV VKGSPNQKRV SPPQSRAEKL RSRSPKGLRS GRKFILQAMP  900
SFPPLSPYSD SKAINDTRND HKGTQNN
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.13e-43CPP family protein