PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_1512_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 1300aa    MW: 147586 Da    PI: 7.1986
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_1512_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox53.93.1e-1754119156
                    TT--SS--HHHHHHHHHH..........HHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
       Homeobox   1 rrkRttftkeqleeLeel..........FeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                    ++k +++t++q++eLe++          F+++++p++++r eL+k+lgL+ +q+k+WFqNrR+++k
  Neem_1512_f_1  54 KKKYHRHTPHQIQELEAYgllisfsfnfFKECPHPDEKQRSELSKRLGLESKQIKFWFQNRRTQMK 119
                    78999***********865555555444889********************************999 PP

2START160.41.4e-502715362206
                    HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEE CS
          START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlev 84 
                    la++a++el+k+a+a++p+W ks     + +n++e++++f++  +     + +ea+r+++vv+ ++  l e+l+d + +W e+++    +a+t++ 
  Neem_1512_f_1 271 LALTAMDELIKMAQADAPLWIKSLdgerDVLNREEYMRTFTPCIGmkpngFATEASRETAVVIINSSALIETLMDAN-RWAEMFPcmiaRAATIDM 365
                    6899********************999999***********998899******************************.****************** PP

                    ECTT................................EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EE CS
          START  85 issg................................galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRael 147
                    issg                                    +m+ae+q+lsplvp R + f+R+++q+ +g+w++vdvS+d +++  +   ++ +++
  Neem_1512_f_1 366 ISSGvtgtkngalqvvffsticaftasnwykvmmqtDHTLQMFAEFQVLSPLVPvRQVKFLRFCKQHAEGVWAVVDVSIDTNREGLNANAFASCRR 461
                    ***********************************877889********************************************999******** PP

                    SSEEEEEEEECTCE................EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
          START 148 lpSgiliepksngh................skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                    lpSg++i++++n++                 kvtwve+ +++++ +h+l r+l+ sg+ +ga++w atlqrqce+
  Neem_1512_f_1 462 LPSGCVIQDMPNNYcktyflhltcdidslcIKVTWVERSEYDESAVHNLCRPLLTSGIGFGAQRWIATLQRQCEC 536
                    ***************************99559*****************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.4E-2040123IPR009057Homeodomain-like
SuperFamilySSF466892.09E-1641121IPR009057Homeodomain-like
PROSITE profilePS5007114.39851121IPR001356Homeobox domain
SMARTSM003899.0E-1453125IPR001356Homeobox domain
CDDcd000863.27E-1654122No hitNo description
PfamPF000461.2E-1454119IPR001356Homeobox domain
PROSITE patternPS00027096119IPR017970Homeobox, conserved site
PROSITE profilePS5084837.517261539IPR002913START domain
SuperFamilySSF559611.35E-25262371No hitNo description
CDDcd088753.39E-108265535No hitNo description
SMARTSM002342.6E-23270536IPR002913START domain
PfamPF018529.2E-42271536IPR002913START domain
SuperFamilySSF559611.35E-25405536No hitNo description
SuperFamilySSF559612.88E-9555676No hitNo description
PfamPF119557.3E-1118241160IPR021099Plant organelle RNA recognition domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1300 aa     Download sequence    Send to blast
MDAHGDMGLL GEHFDPSIVG RIREDGYESR SGSDNVEGAS GDDLETNDDG PPRKKKYHRH  60
TPHQIQELEA YGLLISFSFN FFKECPHPDE KQRSELSKRL GLESKQIKFW FQNRRTQMKT  120
QMERHENVIL RQEHDKLRAE NDMLKEAMKN PICNNCGGPA VPGNVSNYEL QQLRIENARY  180
KDELGRICIL ANKFLGRPLT SSANTMAPQC LDSSLELAVG RNGFGSITNI SGSMMPGIEF  240
VEGPVMSLMK PPITGMMGNE MPYERNMLID LALTAMDELI KMAQADAPLW IKSLDGERDV  300
LNREEYMRTF TPCIGMKPNG FATEASRETA VVIINSSALI ETLMDANRWA EMFPCMIARA  360
ATIDMISSGV TGTKNGALQV VFFSTICAFT ASNWYKVMMQ TDHTLQMFAE FQVLSPLVPV  420
RQVKFLRFCK QHAEGVWAVV DVSIDTNREG LNANAFASCR RLPSGCVIQD MPNNYCKTYF  480
LHLTCDIDSL CIKVTWVERS EYDESAVHNL CRPLLTSGIG FGAQRWIATL QRQCECLAVL  540
VSSTLPGQDH SGINPMGRKS MLKLAQRMTY NFCSGICASS VRKWDKLCVG NVGEDVRVLT  600
RKNINDPGEP PGVVLCAATS VWIPVTRQRV FDFMRNEQMR SEWDILSNGG PMQEMVHIAK  660
GQEHDNCVSL LRAGAMNAND SEGCRVKNEP QVENFWWTVG LINRGYVSSS SSGSGIDSVD  720
QRAAQLAESI LVTFSFSLKM LSVFLFGESA SSNQVKTAAL SISSAINGTI PRATRTSCLF  780
KLTDILPKPG SLDIDVFRNQ RGVFGLAKHI SQKCTSIPKR QQRVRDHAFD NYMEIGKKMR  840
KVVKFQSLIL SQHNQTLPIS RLDFLSRRIG FKRLEAGKFL LKFPHVFEIY EHPVQRILYC  900
RPTRKALQQI EQENQALNAQ IPEAVTRLRK LLMMSNSGRL RLEHVRIARS EFGLPEDFEY  960
SVILKNPQFF RLFDAEETRN KYIELVERDS RLAVCAIENV REKEYRERGI DAEDIRFSFI  1020
VNFPPGFKIG KYYRIAVWKW QRVPYWSPYE DVSGYDLRSL EAQKRMEKRA VATIHELLSL  1080
TVEKKITMER IAHFRLAMNL PKKLKEFLLQ HQGIFYISTR GNHGKLHTVF LRETYRKGEL  1140
IEPNDLYLAR RKLGELVLLS PRTAKMDGDL VSYRWDRDDH DMGRDRRRVY SENVFENFGV  1200
EDNVGGDGKG DDNLDSDLGS DVESDITGED TDSDEIVDTE EDALAVSIVL GRNSPCRKAS  1260
TGMRFKELLC ISLFFNFEEE LTTMEDLCMP AWCPLCTPVI
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006490345.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A067EY320.0A0A067EY32_CITSI; Uncharacterized protein
STRINGXP_006490345.10.0(Citrus sinensis)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]