PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_14075_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family bHLH
Protein Properties Length: 1245aa    MW: 138057 Da    PI: 9.8344
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_14075_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH48.81.2e-1510521099355
                                  HHHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
                         HLH    3 rahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55  
                                  + hn  ErrRRd+iN+++  L+el+P++      K++Ka++L +A+eY+k Lq
  Cotton_A_14075_BGI-A2_v1.0 1052 EVHNLSERRRRDKINKKMRALKELIPNC-----NKVDKASMLDEAIEYLKTLQ 1099
                                  57*************************8.....6******************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.280.105.8E-2010461106IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474594.58E-2010461111IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000835.67E-1610471103No hitNo description
PROSITE profilePS5088818.43810491098IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000103.5E-1310521099IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003531.3E-1710551104IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0007126Biological Processmeiotic nuclear division
GO:0005694Cellular Componentchromosome
GO:0005730Cellular Componentnucleolus
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1245 aa     Download sequence    Send to blast
MSDYRSLGSN FHSSSQSRKI SIGVILDSLV ERKLGDIKED ECKQSNTERI KPDNGIYAEG  60
KNKGEAATTP KRKQTEHAEQ VKSPWITPRK SLAARTASSN LGQKKHKKAK DVSVTYSVQF  120
FSNKTFNAQN ARSKQNSFDS FIDDLTYKRK ERNDGNSQKV EFNLADAGKV PESDKLVLES  180
KANKTQNKPT ETLKMKLQEL LGNVSSPESQ LSRSQDQEAD ANNLTPQISV DHMGHTVVKP  240
RQSSDTIETD SENPDQIIKR PVTNSLTRKR APAKVQTNKT KVGLSSKQKH RERFVSFREG  300
RSTKLDGAVN TASKLSRKKK IQKKSSKIDS RKICFAEEGN VDEIKQTSYR SETPVPAGKT  360
SVLGNKMENS LSFFSEKRRE NFERVQENHF FSSPVTNKNQ PANFENPTSP EKGDKQEDFG  420
NISLWNVVNT QDNFPSPTFG FRTPILNTSP SPTPKTMERE QVVCSPVPSE RGFITGNIRS  480
FRNFQVSRPV CNKSTAQAHS PVSLTIFQDS DMGIEHVKRY AHSKPSSEER LSESFEDFSP  540
TIKRYNCHTE NPISLDTGVF EKPNFNRCPI KRISECTPTT ASQKVVSSLF FSQGARIGES  600
VRFHEPLEQD QEDELTRAVT LFASALETFK RKMDSTTSKK SSEILVSISE EINSLLLNAQ  660
SEIEYDVGKL TSLNKTKKKR LETRLQEQQE QLKLILQNFK EDIHQLLLDC NSILEGMEAH  720
QIELKGIMKK QKVSHQKLLM DVKAAAEIQL NNAERRITSV HEALSKGKDA PAETCNSRVL  780
DRQRDPDFME LVWENGQVLI RGLSSKVTQK SFPFSSRSTA NGSKDGGVAD STFAHPISGL  840
SSLSKLDRHG VDANIVPVNN SNRLKPSYVP NQLVEDEVPC SSLQQFKGSK EEKDGVNFSI  900
LRSNHASSGA MRTPGLAAGA EDLQGNNVRS APPRSSNIPF GNEGDFMAKR MRPTLPDSEP  960
LKESFPDEQS EAVPNARLAP DDTAFKGNPD QMVASSSLCS RGASNCPTYT LKRRYEDTDL  1020
SKNTTMEEAE GTTKAAPVPR GSKGAKRKRK TEVHNLSERR RRDKINKKMR ALKELIPNCN  1080
KVDKASMLDE AIEYLKTLQL QVQMMSIGNG VYMPPMMVPL PSAMQHINAQ HLGGYSPMAL  1140
GMGMRTQMGL GCGAPPQFPT SLMTGAPAVP GNPEARLIML GFPDQMLLRS MSHSPFLSSA  1200
ASFTPQSFQP PAAVVSQSAA PPAAQVDLLG GANPLSTSKD SYPTH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110471061KRKTEVHNLSERRRR
210571062ERRRRD
310581063RRRRDK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017613367.10.0PREDICTED: uncharacterized protein LOC108458465 isoform X1
TrEMBLA0A0B0PIT80.0A0A0B0PIT8_GOSAR; Uncharacterized protein
STRINGGorai.001G148200.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G09530.23e-40phytochrome interacting factor 3