PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.013G121300.1
Common NameB456_013G121300, LOC105783718
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 671aa    MW: 74188.6 Da    PI: 7.3023
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.013G121300.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.39.7e-16384421340
                 TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                         +k+CnCk+skClk+YC+Cfaag +C e C+C+dC+Nk 
  Gorai.013G121300.1 384 CKRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKP 421
                         79**********************************96 PP

2TCR52.41.1e-16470509140
                 TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                         ++k+gCnCk+s+ClkkYCeCf+ag+ Cs +C+Ce+CkN+ 
  Gorai.013G121300.1 470 RHKRGCNCKRSSCLKKYCECFQAGVGCSLSCRCEGCKNSF 509
                         589***********************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011142.2E-17382423IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163436.442383510IPR005172CRC domain
PfamPF036384.6E-11385420IPR005172CRC domain
SMARTSM011141.2E-17470511IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.0E-12472508IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 671 aa     Download sequence    Send to blast
MDTPDKTQIT PTSNLCKFED SPVFKYIDSL SPIELAKSRQ TDNGFSSLAF LSPSSLFPSP  60
QITCHRESRF SVKRHHFLEP LNSRVARSGH ESNTSEGASK VDEHLGCLNN DSSYKETSSD  120
QVDEQPNLAT DLPRTLKYDC RSPDGDFEPC DEILKKTNVE VAGQERSPFQ CNRDKWEERQ  180
QSFQNERNLR KICGIKRSEE SAGSDWDWRT TNSGTTSFIS NILQFPLDKA NNSENAESGD  240
PSGSCKQSKS GEPVMDQMHG ILSTCLPDEL VVNDSGLKKD DKEGNCNQSN HQQFSIRRRC  300
LVFEKSPGFG LHLNSLPNIP KGQSPLSKST LSSMNRGEVP DDNKGVVTEN SYEMPATFGG  360
NEADHNSPEK KRQKFELVEE SVACKRCNCK RSKCLKLYCD CFAAGLYCIE PCSCQDCFNK  420
PIHENVVLET RRQIESRNPL AFAPKVIRTT NGVSDSMVQG EINKTPASAR HKRGCNCKRS  480
SCLKKYCECF QAGVGCSLSC RCEGCKNSFG RKDGGCESES DGDNLEACEK NASEKSSHDT  540
VISKGQEHPN LSVPSPDISR LPFAYTGKLA AFFPHSIKSS PQLCSTQEQG SSDTSSCKPK  600
LESNLDGIPE NGTPEIPKHK CFTLVSNPTS PNCKRVFSTP NHDSTSSSTK WRSRKLILRS  660
VPSFPSFSPP *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1938550712120Protein lin-54 homolog
5fd3_B1e-1938550712120Protein lin-54 homolog
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818422e-33LN681842.1 Cucumis melo genomic scaffold, anchoredscaffold00022.
GenBankLN7132592e-33LN713259.1 Cucumis melo genomic chromosome, chr_5.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012464782.10.0PREDICTED: protein tesmin/TSO1-like CXC 2 isoform X5
TrEMBLA0A0D2VE360.0A0A0D2VE36_GOSRA; Uncharacterized protein
STRINGGorai.013G121300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17096510
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.17e-86Tesmin/TSO1-like CXC domain-containing protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]