PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_24495_BGI-A2_v1.0
Common NameF383_35822
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 381aa    MW: 41831.4 Da    PI: 10.0345
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_24495_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.13.4e-1650133186
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                                 +W++  v  L+ea+++++   +r+klk+++We+v++++      ++  ++++qCk+k+e+++kry+ + +++++      s++
  Cotton_A_24495_BGI-A2_v1.0  50 EWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNCTKSPKTQTQCKNKIESMKKRYRSESATAEG------SSW 126
                                 5*************************************844444455556679****************99997......469 PP

                    trihelix  80 pyfdqle 86 
                                 p++ +l+
  Cotton_A_24495_BGI-A2_v1.0 127 PLYPRLD 133
                                 9999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.3E-1948133No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 381 aa     Download sequence    Send to blast
MDKETNNQEN PSLLSNNNTN KEDCSPKKHP GSSTVTGGGG GSNDRLKRDE WSEGAVSSLL  60
EAYENKWVLR NRAKLKGHDW EDVARYVSAR ANCTKSPKTQ TQCKNKIESM KKRYRSESAT  120
AEGSSWPLYP RLDLLLRGNA AAAAAAAAPS PPPSQPLPPP SQPLPPPPQQ LHLTAMVQPQ  180
PPGPLFTNLP LTLPEASTLV VLQQQQQQQQ PPPPLPIAAP PPALAPQGLG TAQNSHGSNG  240
FEKIPKDDGA GTKVSDHLSD KVAIETDSST PGLYSDKEKL RSKKLKMKTM ENKKKKRRKK  300
DEYREIGESI RILAEVVLKS EESRMETLRE IEKMRIEAET KRGEMELKRT EIIANTQLEI  360
AKLLAGSSNK GIDPSLRIGR S
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1292297KKKKRR
2292299KKKKRRKK
3293297KKKRR
4293298KKKRRK
5293299KKKRRKK
6294298KKRRK
7294299KKRRKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5275031e-175HQ527503.1 Gossypium herbaceum clone NBRI_B_5023 simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017649844.10.0PREDICTED: trihelix transcription factor ASIL2
TrEMBLA0A0B0N7C10.0A0A0B0N7C1_GOSAR; Uncharacterized protein
STRINGGorai.011G216400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-56sequence-specific DNA binding transcription factors