PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G159500.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family CPP
Protein Properties Length: 748aa    MW: 81124.9 Da    PI: 7.8531
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G159500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.46.9e-14448487241
                   TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                           ++++C+CkkskClk+YC Cfaa+++Cse C+C++C+N++ 
  Sobic.001G159500.1.p 448 SCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHM 487
                           689**********************************986 PP

2TCR50.25.1e-16533572140
                   TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                           ++k+gCnC+ks+ClkkYCeCf++g+ Cs +C+Ce+CkN+ 
  Sobic.001G159500.1.p 533 RHKRGCNCRKSSCLKKYCECFQSGVGCSISCRCESCKNSF 572
                           589***********************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011143.7E-16447488IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163434.978448573IPR005172CRC domain
PfamPF036383.4E-10450485IPR005172CRC domain
SMARTSM011144.9E-15533574IPR033467Tesmin/TSO1-like CXC domain
PfamPF036389.1E-12535571IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    Send to blast
MDAPESPPAG RPSSPAVALY EHSPIFDFIN SLSPIATPKP LGSTQNVQLK SLNLPPLSSI  60
FTSPQVNQRK ESKSTIRDVK LSQELNPNCQ RNQMGTFSCI ELSGSATLAS ENCISYEDAT  120
NSPSKWLQST PFGSETLGDA KKQDTDGKTN HTEDVEQVKQ SSTYSDQNGL DQVDSSTSGR  180
IVQENELAKQ DRNDLAPCSL NHLITNCGTG NSVISISDLA LEAQQRSWKL RGDNVISSTS  240
VLAVDRNFEN SPREHFVEPF GSYIQSAADD THVYCADAAA GVATNHDQEI LPAVIQNQLV  300
LNDYNFDTFK ANTDGTAISQ QQCGTHRRNL FKDKVGPSNK RVQNNSNIHH ASTCGNNYLK  360
PVKPGIGLHL NTLPLNLSNM PLTINPPLLP EQTSPATVIS SSETALYGSE VCTHVDDYSQ  420
KTMTNADKSD QQSHKKRRRK LQNDDGKSCR HCSCKKSKCL KLYCACFAAK VYCSEFCSCQ  480
GCSNNHMHEE AVSHIRKQTE SRNPLAFAPT VTRKCGSVSE LGDDSNNTPA SARHKRGCNC  540
RKSSCLKKYC ECFQSGVGCS ISCRCESCKN SFGKREGVLL LTTEKLKKGG EAKGTHGKEE  600
KLAFDKHHMI SQSGDLAATE NLLATPSLEP YRSSFLLPST CSKLPPSTVG LSSGLHNPRS  660
PMKSDNLLSP FETRAAPMIL GNDFSDIQEL GLSCTTSVKV VSPNKKRVAP LHIGTALSPM  720
GSSSRKLVLK SIPSFPSLTG DADSEPH*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1245257014120Protein lin-54 homolog
5fd3_B1e-1245257014120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1434439KKRRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G159500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021301675.10.0protein tesmin/TSO1-like CXC 3 isoform X1
TrEMBLC5WRA40.0C5WRA4_SORBI; Uncharacterized protein
STRINGSb01g013900.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP147741921
Representative plantOGRP9931755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-63Tesmin/TSO1-like CXC domain-containing protein