PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A11G2223
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 595aa    MW: 64986.1 Da    PI: 5.9629
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A11G2223genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR45.81.2e-14466502441
          TCR   4 kgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                  k+CnC +s+C+k+YCeCfaag +C++ C C++C N+ +
  Gh_A11G2223 466 KHCNCIRSRCVKLYCECFAAGIYCDN-CACQNCLNNPD 502
                  89***********************9.********975 PP

2TCR45.31.7e-14550588240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  +++gC+Ck++kClkkYC+C+ a++ Cs  C+CedC N+ 
  Gh_A11G2223 550 HRRGCKCKRTKCLKKYCDCYRAEVGCSAICNCEDCDNSF 588
                  899**********************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011146.1E-15463503IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163428.825464589IPR005172CRC domain
PfamPF036382.5E-11466500IPR005172CRC domain
SMARTSM011148.0E-14549590IPR033467Tesmin/TSO1-like CXC domain
PfamPF036388.4E-11552587IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 595 aa     Download sequence    Send to blast
MESSRNAIEA SSPATSPLEK ETTPATSPPE KVTTAATSPP EKGTTPATSP PENETTPATS  60
PPEKETTPVT FPPEKVTTSS TSPLEKVTTP ATATSPPEKV TTPATATSPP AKETTPATSP  120
PAKETTPATS PLEKVTTSEK KSPVSEFLNR LSPIMPSRAR ASLRNQRLSE FSFTSISSLV  180
NSPPLNAHQR PICLSFVERD EIGGSSSNGH NQDDSMIKNG EYWAPHGETR GACFNTSEVT  240
TEKTKDNAND GVVEISGDER WRRMLDFSNG HNQGDSMIKN GEYWAPHDET RGACFNASEV  300
TTEKTKDNAN DGVVEISGDE RWCRMLDFSS KNIPEHVESD EFLEHVSQGF DSQPTFLGGN  360
QGMSIPLPSY ILPLNHHPFI SPAGGVVPLN HHPFISPAGR VSDHVDSDHQ GGRVPEKMED  420
SQEMSFLHLL RSKQFADRVD QQSKSQVSSK GERNKETFPV KIPAWKHCNC IRSRCVKLYC  480
ECFAAGIYCD NCACQNCLNN PDYDDTVLDT RQQIELRDPL AFASPVVFPS NDSPNVTGHG  540
NLMNAGSPRH RRGCKCKRTK CLKKYCDCYR AEVGCSAICN CEDCDNSFGK KPGNF
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A5e-1246858614120Protein lin-54 homolog
5fd3_B5e-1246858614120Protein lin-54 homolog
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017629828.10.0PREDICTED: uncharacterized protein LOC108472783
TrEMBLA0A2P5XD420.0A0A2P5XD42_GOSBA; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15696813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.17e-43TESMIN/TSO1-like CXC 2