PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_9748_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 683aa    MW: 76389.2 Da    PI: 6.9894
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_9748_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.55.6e-192879556
                   SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
       Homeobox  5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                   +++t++q++ Le++F+ +++p++++r++L+++lgL+ +q+k+WFqN+R++ k
  Neem_9748_f_1 28 HRHTAYQIQSLETFFKDCPHPDENQRRQLSRELGLDPKQIKFWFQNKRTQTK 79
                   6789********************************************9987 PP

2START90.73.1e-2930942094206
                    EEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.HHHHHHHHHH CS
          START  94 mvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp.hwllrslvks 187
                    m  ++  lsplvp R+++f+R+++q + g wvivdvS d  +++       R+ +lpSg++i++++n  s+vtwvehv++++++  h l+r l+  
  Neem_9748_f_1 309 MHEQMHILSPLVPpREYYFLRHCQQIEPGLWVIVDVSYDWLRENIA---PSRCWRLPSGCMIQEMPNSCSNVTWVEHVEVDDKTLtHRLYRDLICG 401
                    66788899***********************************973...5699*****************************9988********** PP

                    HHHHHHHHHHHHTXXXXXX CS
          START 188 glaegaktwvatlqrqcek 206
                    + a ga +wv tlqr ce+
  Neem_9748_f_1 402 SSAYGAERWVLTLQRMCER 420
                    *****************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.8E-20775IPR009057Homeodomain-like
SuperFamilySSF466892.42E-181082IPR009057Homeodomain-like
PROSITE profilePS5007116.6652181IPR001356Homeobox domain
SMARTSM003892.1E-162185IPR001356Homeobox domain
CDDcd000862.86E-162482No hitNo description
PfamPF000461.6E-162879IPR001356Homeobox domain
PROSITE patternPS0002705679IPR017970Homeobox, conserved site
PROSITE profilePS5084824.946223423IPR002913START domain
SuperFamilySSF559611.65E-26225421No hitNo description
CDDcd088756.22E-83227419No hitNo description
SMARTSM002344.9E-10232420IPR002913START domain
Gene3DG3DSA:3.30.530.203.6E-4309384IPR023393START-like domain
PfamPF018529.4E-24310420IPR002913START domain
SuperFamilySSF559612.06E-14441656No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 683 aa     Download sequence    Send to blast
MDFVARNGSN SGDEQEHSNS RKGKKTFHRH TAYQIQSLET FFKDCPHPDE NQRRQLSREL  60
GLDPKQIKFW FQNKRTQTKA QNERADNSIL RAENERIQCE NLAIREALKN VICPACGGPP  120
FGEEERQRSL EKLQLENAQL KEEASTPSFH EKVSNLLAKY IGKPISQINP LMPVPISTPT  180
SGPVVDHPVP VPAVFPIQDM DLDLNSSGNS ENIPLTFQLK GISDVDKALM VESVNSAMDQ  240
LIRLLQINEP LWIKSSSDGR YTIHRDSYEK IFPRVSHFKT STSRLESSKC SGMVTMNAMQ  300
LVDVFLDSMH EQMHILSPLV PPREYYFLRH CQQIEPGLWV IVDVSYDWLR ENIAPSRCWR  360
LPSGCMIQEM PNSCSNVTWV EHVEVDDKTL THRLYRDLIC GSSAYGAERW VLTLQRMCER  420
LGFSIPENSR AHHKFAGVIN LPEGRKSMLK LAHRMVKNFC AVLSMSGKLD FPQLSEVNNS  480
GVRVSVRKSI EPGQPSGMIV SAATSLWLPL PSENVFNFFK NEKMRVQWDV LSNGNPVHEI  540
ARISTGAHPG NCISIIRPFI PTESNMVMLQ ESCIDILGSM MVYAPIDIPS MNLAISGEDS  600
STIAILPSGF VISGDGRQRD INGASTSTNT ASVGSLLTVA FQILVSSPAS SKDFNMESVA  660
TVNTLISSTV QRIKATLNCS NLD
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006421085.10.0homeobox-leucine zipper protein ROC8
SwissprotQ69T580.0ROC8_ORYSJ; Homeobox-leucine zipper protein ROC8
TrEMBLA0A2H5PLL60.0A0A2H5PLL6_CITUN; Uncharacterized protein
TrEMBLV4S6730.0V4S673_9ROSI; Uncharacterized protein
STRINGXP_006421085.10.0(Citrus clementina)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM30042665
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.10.0homeodomain GLABROUS 11