PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.17G208500.1.p
Common NameGLYMA_17G208500
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family CPP
Protein Properties Length: 905aa    MW: 98476.3 Da    PI: 6.0831
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.17G208500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.32e-15480518341
                  TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                          +k+CnCkkskClk+YC+Cfaag++C++ C C++C N+ e
  Glyma.17G208500.1.p 480 CKRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPE 518
                          89**********************************876 PP

2TCR49.48.8e-16565604140
                  TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                          ++k+gCnCk+s ClkkYCeC++a++ Cs+ C+Ce+CkN +
  Glyma.17G208500.1.p 565 RHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGCKNVH 604
                          589***********************************75 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011147.9E-16478519IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.276479605IPR005172CRC domain
PfamPF036381.3E-11481516IPR005172CRC domain
SMARTSM011142.2E-17565606IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.7E-11567603IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 905 aa     Download sequence    Send to blast
MMDSPEPSKN NNGSSSSAST LNNNNNNDDA PSSESPQVQE SPFLRFVKTL SPIPTKASHM  60
TQGCVGLSSP PLVFKSPRIS HRETQLTKRP QGTQSFGGVI PQSVNEGNRL GEAPGDSRTS  120
NSHQSLPERF INDTQQVFDF KNDENTQYYS SPSCIDKYLV DPGDIDQMYS ADQDVQQQST  180
DAAETSLSDQ THSKNNILNF DRKDGPGDKV EESLPLSEDF NKVHLEKAAY GEEPEKMEGE  240
KNDVEWSSQE PAKLESILAA DGFDKRYSHG PLPQWMPNPL QDVKGCEDYN EMVPTSHVTA  300
ENILQDGSEA TLKHHGIRRR CLQFGEAASN ALGRNVKLNA ASNTMITVKP SELVTSLCPR  360
RGSGNFPSTS PKPSGIGLHL NSIINAIPID QAATTGVRLS DSSQGMKSTS SIRLQRMENV  420
KRSILSSNVD GRSLVDTRTE SHEIDDTVAT DTGNSEDLNQ PPSPCKKKKK TSVTADDNGC  480
KRCNCKKSKC LKLYCDCFAA GTYCTDPCAC QGCLNRPEYV ETVVETKQQI ESRNPIAFAP  540
KIVQPTTDIS SHMDDENLTT PSSARHKRGC NCKRSMCLKK YCECYQANVG CSSGCRCEGC  600
KNVHGKKEDY VAFGHTSSKE RVSSIVEEGS DCTFHNKLEM VASKTVYDLH CLSPITPSLQ  660
CSDQGKEDAK SRVISGNYLP SPESDVNMLA SCTNYTKSSE NLHGSEALLD TNEMLGNTPY  720
DSQIECSDAA LLQLTPLPNP EQSGTSSFSS VPNECAKITH SRLSHGCIRQ LPGGSLRWRS  780
SPLTPSTRVG EAQYLQCSES DSKLFDILEN ETPDILKEAS TPMTSVKVNS PTQKRVSPPQ  840
SCHIGIGSSS SGGLRSGRKF ILKAVPTFPS LSPCINSKSN GDEDSCNSPS KSPLKANECP  900
QRES*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A3e-1948360314121Protein lin-54 homolog
5fd3_B3e-1948360314121Protein lin-54 homolog
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.84700.0cotyledon| hypocotyl| meristem| seed coat| somatic embryo| stem
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO42181860.0
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.17G208500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ0101650.0AJ010165.1 Glycine max mRNA for cysteine-rich polycomb-like protein (gpp1 gene).
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_025982093.10.0cysteine-rich polycomb-like protein isoform X1
RefseqXP_025982094.10.0cysteine-rich polycomb-like protein isoform X1
TrEMBLA0A0R0FP020.0A0A0R0FP02_SOYBN; Uncharacterized protein
TrEMBLA0A445G9H30.0A0A445G9H3_GLYSO; Protein tesmin/TSO1-like CXC 3 isoform A
STRINGGLYMA17G31399.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF85652841
Representative plantOGRP9931755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.16e-57Tesmin/TSO1-like CXC domain-containing protein