PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_20237_BGI-A2_v1.0
Common NameF383_13519
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 353aa    MW: 39195.8 Da    PI: 10.2099
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_20237_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.52.6e-1647130186
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                                 +W++  v  L+ea+++++   +r+klk+++We+v++++      ++  ++++qCk+k+e+++kry+ + +++ +      s++
  Cotton_A_20237_BGI-A2_v1.0  47 EWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNCTKSPKTQTQCKNKIESMKKRYRSESATADG------SSW 123
                                 5*************************************844444455556679****************99997......469 PP

                    trihelix  80 pyfdqle 86 
                                 p++ +l+
  Cotton_A_20237_BGI-A2_v1.0 124 PLYPRLD 130
                                 9999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138377.0E-2045130No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 353 aa     Download sequence    Send to blast
MEKETNQENP SLLSNNNISI TKEDSSPKKH PGNTAAAGGG DRLKRDEWSE GAVSSLLEAY  60
ENKWVLRNRA KLKGHDWEDV ARYVSARANC TKSPKTQTQC KNKIESMKKR YRSESATADG  120
SSWPLYPRLD LLLRGSTAPP PPPLLPPQLQ PSAVPQAATP ISTNPPLMTL PEPSMMVVLQ  180
QQQQHPPPPP PPHLAPQLPG TTQNSHGSNG IDRIPKEDGA GTKLSGHLSD KIAMETDSST  240
PALYSDKERP RSKKAKMKIE TMATMMKKKK KRRKEECEVG ESIQWLAQVV LKSEQARMET  300
MKEIEKMRVE AEAKRGEMDL KRTEIIAKTQ LEIARLFAGS NKGVDSSLRI GRN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1267272KKKKRR
2267273KKKKRRK
3268272KKKRR
4268273KKKRRK
5269273KKRRK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5274986e-94HQ527498.1 Gossypium herbaceum clone NBRI_C_EYT27PB01AP1QB simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017612301.10.0PREDICTED: trihelix transcription factor ASIL1-like
TrEMBLA0A0B0MA230.0A0A0B0MA23_GOSAR; Uncharacterized protein
STRINGGorai.012G018400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-100sequence-specific DNA binding transcription factors