PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Niben101Scf05034g04004.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae; Nicotiana
Family HD-ZIP
Protein Properties Length: 733aa    MW: 80760.5 Da    PI: 5.056
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Niben101Scf05034g04004.1genomeBTI-
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox27.45.7e-0974993156
                              HHHHHHCTS-HHHHHHHHHHHHHHHH CS
                  Homeobox 31 eeLAkklgLterqVkvWFqNrRakek 56
                               eL kkl+++ +qVk+WFqNrR+++k
  Niben101Scf05034g04004.1 74 LELGKKLSMDSKQVKFWFQNRRTQMK 99
                              5899*******************998 PP

2START165.44.1e-522494712205
                               HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S CS
                     START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla 77 
                               la++a++el+ +a+ +ep+Wv+s     e +n +e+ ++f++  +     + +ea ra+g v+ ++ +lve+l+d + +W e++ 
  Niben101Scf05034g04004.1 249 LALAAMNELLGLAEIGEPLWVRSLdgggETLNLEEYARSFTSCTGmkpghFATEATRATGTVIVNSLTLVETLMDMS-RWVEMFS 332
                               6899************************99**********87666999999**************************.******* PP

                               ....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEE CS
                     START  78 ....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSg 151
                                   k ++++vi         + lql+++e+q +s lvp R+  f+R+++q+ +g+w++vdvSvd   + ++ + +  +++lpSg
  Niben101Scf05034g04004.1 333 civgKTSVVNVIPGStcgswsSNLQLIQTEFQIISDLVPaREMKFLRFCKQQAEGVWAVVDVSVDTVHESSQPHDIGNCRRLPSG 417
                               ************9999*************************************************9999998899********** PP

                               EEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                     START 152 iliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                               +++++++ng+skvtw+eh+++ ++++h l+r+lv++gl +ga++w+atlqrq e
  Niben101Scf05034g04004.1 418 CIVQDMPNGYSKVTWIEHMEYYENVVHHLYRPLVRNGLGFGAQRWMATLQRQSE 471
                               ***************************************************975 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.7E-1043100IPR009057Homeodomain-like
SMARTSM003890.002256105IPR001356Homeobox domain
CDDcd000863.07E-757101No hitNo description
Gene3DG3DSA:1.10.10.607.0E-770100IPR009057Homeodomain-like
PfamPF000461.8E-67499IPR001356Homeobox domain
PROSITE profilePS5007110.5675101IPR001356Homeobox domain
PROSITE profilePS5084839.501239475IPR002913START domain
SuperFamilySSF559611.92E-30240471No hitNo description
CDDcd088759.19E-105243471No hitNo description
SMARTSM002343.0E-39248472IPR002913START domain
PfamPF018522.0E-44249470IPR002913START domain
SuperFamilySSF559619.71E-11497727No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 733 aa     Download sequence    Send to blast
MESHSDMSEK GESSNNVVTG RPKEEDESES RSIGDNFLNG VASEDESDSV LSNSSRKRKY  60
SRHTANQILE LETLELGKKL SMDSKQVKFW FQNRRTQMKS QLERHENGML KQENDRLRIE  120
HIAMQEAMKH PICNRCRSQA IIADINVEEH QTKIEHERLK EEVKRIGVLA DKLLGPLSSL  180
EGSMASVTAN PAFGLPEGIN GFGGINYATA ASPMGLDFDN GLSSPPPVVI STSLANVDVS  240
YDKSMLMDLA LAAMNELLGL AEIGEPLWVR SLDGGGETLN LEEYARSFTS CTGMKPGHFA  300
TEATRATGTV IVNSLTLVET LMDMSRWVEM FSCIVGKTSV VNVIPGSTCG SWSSNLQLIQ  360
TEFQIISDLV PAREMKFLRF CKQQAEGVWA VVDVSVDTVH ESSQPHDIGN CRRLPSGCIV  420
QDMPNGYSKV TWIEHMEYYE NVVHHLYRPL VRNGLGFGAQ RWMATLQRQS EFLAMLMSSV  480
DSPVFCSSGQ TSMAMLAQRM TRNFCAGVCA TIHKWESIQQ ANGEDAKLMM RKNIGDPGEP  540
IGVVLSATKT IWIPVKQQRL LDFLLNEQTR SQWDILFNSG PMQQMVHIAK GQNIDNSISL  600
FRANGDANSD SENNMLILQD TCTDTTGSLI VYATIDAADM DVVMNGGDSS SVAFLPSGIA  660
IVPDYFQDFS GANVETSWEK DNGFSGTGSL VTIGFQVLVN SSPAEKLSME SVQKVNNLIS  720
HTIHGIKAAF KSK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009771833.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A1U7VZN60.0A0A1U7VZN6_NICSY; homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
STRINGXP_009771833.10.0(Nicotiana sylvestris)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA16262465
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]