PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Lus10039358
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Linaceae; Linum
Family CPP
Protein Properties Length: 1177aa    MW: 128490 Da    PI: 5.6625
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Lus10039358genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.25.2e-16427465240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Lus10039358 427 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 465
                  689**********************************96 PP

2TCR31.92.6e-108769021440
          TCR  14 lkkYCeCfaagkkCseeCkCedCkNke 40 
                  +k YCeCfaag++C e C+C+dC+Nk 
  Lus10039358 876 KKSYCECFAAGVYCIEPCSCQDCFNKP 902
                  599**********************96 PP

3TCR49.87e-16950988139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Lus10039358 950 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNA 988
                  589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-18426467IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163413.5427500IPR005172CRC domain
PfamPF036381.7E-11429464IPR005172CRC domain
PROSITE profilePS5163431.482867990IPR005172CRC domain
SMARTSM011147.7E-5875904IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.2E-6877901IPR005172CRC domain
SMARTSM011142.9E-17950991IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.2E-11952988IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1177 aa     Download sequence    Send to blast
MDTPQKPKAT PPALSKFEES PVFSYINSLS PIKPVKSLNI TQTFHSLSFS TLPSIFSSPH  60
ISSHHKETRF FKRHSNCPDP SRPEHSSEKK QDGGEEIEPQ ENFNQVVFIG DASVDLPKSS  120
KYDCGSPAAQ QNPQQHGAST DCMSEAVAAS EPVDQFTGEA SHKCQPQGLG NYHTQQTKDG  180
AECSWDSLID AADLLLFNSP TITEAFKAAH KHTMDRASGI STSLIEANDP TDSLYQTEIE  240
GPSTQPGDTL LPKEADSDSC MPSNSEKVHN EAASNLYRGL RRRCLDFEMV GMHRKNFTYE  300
QMQAKENNAY EDEQLVPFNS SSDSSRCIVP GMGLHLNSLA RSLNTRGSTN ETLSSGISLD  360
PFNSPIIGQE LLESLNIGTS DGNLDPSENN VKLAEDISPA SAYLVNEEFG LGSPKKKRLK  420
LEGEAESCKR CNCKKSKCLK LYCECFAAGV YCIEPCSCQD CFNKPIHEDT VLSTPEQPLL  480
QLFAMDTPQK PKATPPALSK FEEFQGLLRG LPIQCFQFHS PHPPETRFFK RHSNCPDPSR  540
PEHSSEKKQD GGKEIAVEPQ ENFNQVVFIG DASVDLPKSS KYDCGSPEAQ QNPQQHGAST  600
DCMSEAVAAS EPVDQFTGEA SHKCQPQGLG NYHTQQTKDG AECSWDSLID AADLLLFNSP  660
TITEAFKAAH KHTIDRASGI ATSLIEANDP TDSLNQTEMG GPSTQPSDTL LPKEADSDSC  720
MPSNSDKVHN EAASNLYRGL RRRCLDFEMV GMHRKNFTYE QMQAKENNAY EDEQLVPFNS  780
SSDSSRCIVP GMGLHLNSLA RSLNTRGSTN ETLSSGISLD PFNSPIIGQE LLESLNIGTS  840
DGNLDPSENN VKLAEDISPA SAYLVNEEFG LGSPKKKSYC ECFAAGVYCI EPCSCQDCFN  900
KPIHEDTVLS TRKQIESRNP LAFAPKVIRT SEPGGAETGD EPMKTPASAR HKRGCNCKKS  960
SCLKKYCECY QGGVGCSINC RCEGCKNAFG RKDGSTLPET EAEQEEDTEA NEKNRLETSV  1020
QQIDIHNSEE EQNHNHGLPA TPLQISRPLV QLPFSSSSAI KPPRSLLGIS STGLFPGHKY  1080
GKPSIIRAQS KLEKRPFQSS GEEDMPEILR GGGSFSRGMG IKVSSPNSKR VSPPHQNNNK  1140
NNKMGSSTTS PGRKSGRKLI LHSIPSFPSL TPPPLK*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-1190498749120Protein lin-54 homolog
5fd3_B2e-1190498749120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1413420PKKKRLKL
Cis-element ? help Back to Top
SourceLink
PlantRegMapLus10039358
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012069685.10.0protein tesmin/TSO1-like CXC 2 isoform X2
TrEMBLA0A067KVT50.0A0A067KVT5_JATCU; Uncharacterized protein
STRINGLus100393580.0(Linum usitatissimum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF26193375
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-113TESMIN/TSO1-like CXC 2