PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_22103_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 488aa    MW: 55094.4 Da    PI: 5.0091
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_22103_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR43.18.1e-14227266241
                         TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                                  +k+CnC++s+Clk+YCeCfaag +C++ C Ce+C Nk +
  Cotton_A_22103_BGI-A2_v1.0 227 ACKHCNCRRSRCLKLYCECFAAGIYCEDCCACENCVNKPD 266
                                 589**********************************975 PP

2TCR475.1e-15313352140
                         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                 ++k+gC+Ck+skClkkYCeC+ a++ Cs+ C+Ce+C N+ 
  Cotton_A_22103_BGI-A2_v1.0 313 RHKRGCKCKRSKCLKKYCECYRAKVGCSDGCHCENCDNSF 352
                                 589***********************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.3E-14226267IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163431.717227353IPR005172CRC domain
PfamPF036385.5E-10229264IPR005172CRC domain
SMARTSM011141.3E-14313354IPR033467Tesmin/TSO1-like CXC domain
PfamPF036385.9E-11315351IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 488 aa     Download sequence    Send to blast
MESYRNDIEA SPTTSPSEQV LTPCEKKSPV SEFLNDLSPI MPSRARASLY KQRLYETSFT  60
SNSSLFNSPH LNIHQRPINL NFLQRWCFFT ENIFRSQLIT LFITKDCFVY ANSDEIGASS  120
SNGCYQDDFM IKHSEYHTQH DEQCGACFNA SEVTIGKTSD NANDGVVEIS DDERLDKMLD  180
ISSKNIREHV ESDEFMEHVG QGLDSKLTFL GGNQAMDRKT SPDETEACKH CNCRRSRCLK  240
LYCECFAAGI YCEDCCACEN CVNKPDYEDI VLDIRHQIEL RNPLAFAPPI VNPSNDSPNV  300
TGDENLMNTP SARHKRGCKC KRSKCLKKYC ECYRAKVGCS DGCHCENCDN SFGKKSESMI  360
QRVEKQQNQS HEMLNTTQVM SDSTLVGITN PVSSIWEKLA DNNHLTISTH PYSRDMRDCQ  420
NVSQVESEKG TSFHCYSSAL SPKQPCQSKE IDDIYEIMAD GFPDYLMETS NPINTVNSGS  480
SLVLYNGE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A3e-1322935012120Protein lin-54 homolog
5fd3_B3e-1322935012120Protein lin-54 homolog
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012483597.10.0PREDICTED: uncharacterized protein LOC105798177 isoform X1
RefseqXP_012483599.10.0PREDICTED: uncharacterized protein LOC105798177 isoform X2
RefseqXP_012483600.10.0PREDICTED: protein tesmin/TSO1-like CXC 7 isoform X3
TrEMBLA0A2P5YC790.0A0A2P5YC79_GOSBA; Uncharacterized protein
STRINGEOY265554e-80(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15696813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-53TESMIN/TSO1-like CXC 2