PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa19g032010.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family CPP
Protein Properties Length: 1391aa    MW: 153000 Da    PI: 7.8169
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa19g032010.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.21e-15336374240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     ++k+CnCkkskClk+YCeCfaag++C e C+C dC+Nk 
  Csa19g032010.1 336 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCMDCFNKP 374
                     689**********************************96 PP

2TCR49.96.2e-16421459139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Csa19g032010.1 421 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNA 459
                     589***********************************7 PP

3TCR28.82.4e-0910091035228
             TCR    2 ekkgCnCkkskClkkYCeCfaagkkCs 28  
                      ++k+CnCkkskClk+YCeCfaag++C 
  Csa19g032010.1 1009 SCKRCNCKKSKCLKLYCECFAAGVYCX 1035
                      689***********************5 PP

4TCR49.39.8e-1610941132240
             TCR    2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40  
                      ++k+CnCkkskClk+YCeCfaag++C e C+C dC+Nk 
  Csa19g032010.1 1094 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKP 1132
                      689**********************************96 PP

5TCR50.15.6e-1611791217139
             TCR    1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                      ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Csa19g032010.1 1179 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNV 1217
                      589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011145.8E-19335376IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.301336461IPR005172CRC domain
PfamPF036382.8E-11338373IPR005172CRC domain
SMARTSM011143.7E-17421462IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.4E-11423459IPR005172CRC domain
SMARTSM011143.3E-610081041IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS516349.13810091034IPR005172CRC domain
PfamPF036384.2E-610111035IPR005172CRC domain
SMARTSM011142.4E-1910931134IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.57810941219IPR005172CRC domain
PfamPF036381.5E-1110961131IPR005172CRC domain
SMARTSM011141.9E-1711791220IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.0E-1111811216IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1391 aa     Download sequence    Send to blast
MDTPQKSITQ IGTPVSKLKS EDSPVFNYIS NLSPIKTLKS IPIAQTLSSL NFTSPPSVFT  60
SPHAVSHEES RFRSQINKDV VASVGEEVKE EALIGNEPSQ NVKNDFSTPK VSNDVRGGGN  120
GSCEDGGMDL LKMCDNVKKK SDTPDWETLI AATTELIYGS PSESEAFSCL LKRRSNSGAR  180
VRGSGVTMST SEEPVSSTVV ANNETESFDA NSILHRGVRR RCLDFEVPGN NQQTLGESSS  240
SCVVPSIGLH LNAIAMSSKE NNNVPNEYSL SGNVKVGLQS STSLVLHSQQ DAGQAVEEFP  300
KSLALVKMKP ASPKKKRQVC CPETCYKQIN GEGESSCKRC NCKKSKCLKL YCECFAAGVY  360
CIEPCSCMDC FNKPIHEDVV LATRKQIESR NPLAFAPKVI RNSDSIIEVG DDTSKTPASA  420
RHKRGCNCKK SNCLKKYCEC YQGGVGCSIN CRCEGCKNAF GRKDGSLFEQ DEENEVSAKR  480
VTAKTQQNVE LFNPVAPPST PIPYRQPLSQ LAISSNNRLL PPVSHFNHGA SGSSSSGIYN  540
IRKPDMSMVS HSRIETITED IDDDDMHENL LHHSPVTNMK AVVSPNSKRV SLPHLDSSEE  600
TPWRRNGGRK LIHSIPTFDS CWKMDKPQKN PTTSQIGTST PKSKFEDSPV FNYISNLSPI  660
ESVKSISTAQ TFSSLSFTSP PPVFTSPHVN SHRESRFFRC HNSIDRTKPL EDLNGSVYKE  720
DVVVPVIEDL NKEAPLEDED ETSVETSSEL PQILKSDSQT PDRSDSPCTD DVTLEAPSDI  780
PRGEGGSSSE DVKMGMLNVR EVNDTPDCGR LISNATELLV FRSPNDSEAF RCLVDKISSS  840
ERRFCAGVKS TKRPDINKDV PANGSSNETE PSVVLPNESV FSLNRGGMRR RCLDFEMPGK  900
RKKEIADDQQ SMCDNKAAGE SSSGCVVPGI GLHLNAIALS ARDSNINVVH DYSISGEIHK  960
NFSGSTTPIH SQDIVQETSD QAENEPVEEV PRALVFPERV SEQAGEGESC KRCNCKKSKC  1020
LKLYCECFAA GVYCXCVTIR RLVNLPRGVL YLALPLQFTP KTLCKKLRTK QKMNLSKKFP  1080
EHWRVSEQAG EGESCKRCNC KKSKCLKLYC ECFAAGVYCI EPCSCIDCFN KPVHEETVLA  1140
TRKQIESRNP LAFAPKVIRS ADSIMEAGDD ASKTPASARH KRGCNCKKSN CLKKYCECYQ  1200
GGVGCSINCR CEGCKNVFGR KEGSLLVIME SKQEEDQETY EKRRTKIQHK IEVSREVEQN  1260
PSSDQPSTPL PPYRHLVVHQ PFLSKNRLPP TQFFIGMGSS SFRKQDSDLT QSHNGKKPVD  1320
PVTEDKTEIM PEILLNSPIA NIKAISPNSK RVSPPQLGSS ESGSNLRRRG NGRKLILRSI  1380
PAFPSLNPNQ *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-16338121712121Protein lin-54 homolog
5fd3_B2e-16338121712121Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1888902RRRCLDFEMPGKRKK
Functional Description ? help Back to Top
Source Description
UniProtPlays a role in development of both male and female reproductive tissues. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa19g032010.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF2040590.0AF204059.1 Arabidopsis thaliana CXC domain protein TSO1 (TSO1) mRNA, complete cds.
GenBankAF2063240.0AF206324.1 Arabidopsis thaliana putative DNA binding protein (tso1) mRNA, tso1-3 allele, complete cds.
GenBankAY0460190.0AY046019.1 Arabidopsis thaliana putative DNA binding protein (At3g22780) mRNA, complete cds.
GenBankAY1426420.0AY142642.1 Arabidopsis thaliana putative DNA binding protein (At3g22780) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010488308.10.0PREDICTED: protein tesmin/TSO1-like CXC 3
SwissprotQ8L5480.0TCX3_ARATH; Protein tesmin/TSO1-like CXC 3
TrEMBLM4CCF40.0M4CCF4_BRARP; Uncharacterized protein
STRINGXP_010488308.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14672891
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22760.10.0Tesmin/TSO1-like CXC domain-containing protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]