PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.005G044800.2
Common NameB456_005G044800
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 755aa    MW: 82420.2 Da    PI: 5.2681
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.005G044800.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.93.1e-16458496240
                 TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                         ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Gorai.005G044800.2 458 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 496
                         689**********************************96 PP

2TCR52.31.1e-16543581139
                 TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                         ++k+gCnCkks+ClkkYCeCf+ g+ Cs +C+Ce+CkN 
  Gorai.005G044800.2 543 RHKRGCNCKKSNCLKKYCECFQGGVGCSINCRCEGCKNA 581
                         589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-18457498IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.365458583IPR005172CRC domain
PfamPF036381.0E-11460495IPR005172CRC domain
SMARTSM011142.1E-17543584IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.6E-12545581IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 755 aa     Download sequence    Send to blast
MMDTPEKAQI SSSLSKFEDS PVFNYINSLS PIKPVKSIHV TQTFNPLSFA SLPSIFTSPH  60
VSSHKESRFL KSYTDPLKPE SSSADGTKVR TNEEAGADAQ ENFDQGVSRG ETSFEMPNEP  120
SRIAIGLPQT LKYDCGSPDC DATPCVIKTT CVSDTSLAIV PFVQEASEKG LSDGVEIRDT  180
FQVEQKRDTI GSEWESLISD TSDLLIFNSP NDSEAFRGVI QKSLDPGVLI SQFSQDDINE  240
ACQTTVDLDK YKDQTEGAVE MNEMNPVNES FEDASVTNFI SGSLTDYMET RMSAPYSFKP  300
DSNLHRGFRR RCLDFEMLAA RRKNLDGGST TNSSTDNKLV PGKPDSDSPR CIVPGIGLHL  360
NALAIASRDN KNMKLETLSS GTQKLSFPSL NSPRTGGAET AYESLPSAST ERESDAVENG  420
VQLAEDASQA SAYLVNEEFN QNSPKKKRRL EQAGEGESCK RCNCKKSKCL KLYCECFAAG  480
VYCIEPCSCQ DCFNKPIHED TVLATRKQIE SRNPLAFAPK VIRTSDSVPE VRDDLITTPS  540
SARHKRGCNC KKSNCLKKYC ECFQGGVGCS INCRCEGCKN AFGRKDGSAI VETDGEPGEE  600
EMDPSEKNAL DKNFEKPDIL NNEEQNPASA LPTTPLQLCR PLVQLPFSSK SKPPRSFIAI  660
GSSSALYTGQ RYGKPSIIRP QNIIEKHFQT IAEDETPEIL RGNSSPGTGI KTSSPNSKRI  720
SPPQCELGST PGGRSGRKLI LQSIPSFPSL TPKH*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1646258014120Protein lin-54 homolog
5fd3_B1e-1646258014120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1443449PKKKRRL
2444448KKKRR
3444449KKKRRL
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012481904.10.0PREDICTED: protein tesmin/TSO1-like CXC 2
TrEMBLA0A0D2RFH50.0A0A0D2RFH5_GOSRA; Uncharacterized protein
STRINGGorai.005G044800.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-150TESMIN/TSO1-like CXC 2
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]