PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10015764m
Common NameCARUB_v10015764mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family HD-ZIP
Protein Properties Length: 688aa    MW: 77499.2 Da    PI: 5.855
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10015764mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.41.2e-182879556
                     SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox  5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                     ++++++q+++Le++F+++++p+ ++r++L ++l+L+ +q+k+WFqN+R++ k
  Carubv10015764m 28 HRHSPQQIQRLEAYFKECPHPDGSQRRQLCQELKLEANQIKFWFQNKRTQCK 79
                     5789********************************************9988 PP

2START120.62.1e-382014213206
                      HHHHHHHHHHHHHHC-TT-EEEE.......EXCCTTE...EEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
            START   3 aeeaaqelvkkalaeepgWvkss.......esengde...vlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                      a +a +el ++ laee++Wvkss       +se +++   +++ f ++   ++e ++++++v+ ++++l e +ld+  +W+e ++    ka+t+
  Carubv10015764m 201 AACAVEELKRLFLAEEQFWVKSSidgtyviDSESYEKfsrCVKHFRSTS-AHVESSKDVTMVPIEATTLIEMFLDSE-KWKEFFPtivnKAKTI 292
                      6789999****************88866664444443222555554444.6**************************.**************** PP

                      EEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE CS
            START  83 evissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwve 168
                        + s        ++l +m  +l  lsplvp R+f++vR++++ ++g w+i+dvS ++  +   ++s+    ++pSg+li++++n hskv w+e
  Carubv10015764m 293 HELGSElpirencNVLLVMWEQLHILSPLVPpREFMIVRCCQEIQKGLWIIADVSHNVGFDFV-NASC---YKRPSGCLIQSLPNAHSKVMWIE 382
                      **********************************************************99888.4555...559******************** PP

                      -EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
            START 169 hvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                      hv+++ +l  h ++r + + g+  gak+w  tl+r ce+
  Carubv10015764m 383 HVEVDHKLDsHKMYREVSSGGTGYGAKRWIVTLERMCER 421
                      ********88***************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.34E-17882IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.3E-19883IPR009057Homeodomain-like
PROSITE profilePS5007115.8072181IPR001356Homeobox domain
SMARTSM003892.3E-142385IPR001356Homeobox domain
CDDcd000864.10E-162482No hitNo description
PfamPF000464.3E-162879IPR001356Homeobox domain
PROSITE profilePS5084837.027190424IPR002913START domain
SuperFamilySSF559612.43E-25191422No hitNo description
CDDcd088751.82E-88195420No hitNo description
SMARTSM002341.9E-19199421IPR002913START domain
PfamPF018524.3E-32201421IPR002913START domain
Gene3DG3DSA:3.30.530.201.4E-5240390IPR023393START-like domain
SuperFamilySSF559612.9E-8443651No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 688 aa     Download sequence    Send to blast
MEIDGSGSSS NEQYASEDSN QDEKRVSHRH SPQQIQRLEA YFKECPHPDG SQRRQLCQEL  60
KLEANQIKFW FQNKRTQCKA QEERYTNMQL RGENENLQNE NKAMLEVLRN VKCPKCGGPS  120
FGKEDREPNL QKLRLENALL KEQHLSKRPM VEEDSFSSVP SQQSFHHNSS HGINPSNMFE  180
PSSSFGPPTP PMDINLLSET AACAVEELKR LFLAEEQFWV KSSIDGTYVI DSESYEKFSR  240
CVKHFRSTSA HVESSKDVTM VPIEATTLIE MFLDSEKWKE FFPTIVNKAK TIHELGSELP  300
IRENCNVLLV MWEQLHILSP LVPPREFMIV RCCQEIQKGL WIIADVSHNV GFDFVNASCY  360
KRPSGCLIQS LPNAHSKVMW IEHVEVDHKL DSHKMYREVS SGGTGYGAKR WIVTLERMCE  420
RMALNSIQTL PASDWSDVVT TGEKRRRVMK LGGRMIQDFN GMLTMSGKVD FPKQSKCGVR  480
VSIRLNQEQG QPPGLIVSAA SSLSIPLTPS QVFDFLLKLD NRHQWDVLSY GTAVSEIARI  540
ATGSNESNCL TILRVHPMQE ESDDKSMMER DPEEGDMLML QDSYMDALGG MIVYAPMDMA  600
TMNMALSTNV EAPNIPILPS GFIISSDRRP STTEDGGTLL TVAFQILVSG KTTSSIEVNE  660
RSVNIVSALI SSTVQKIKEL LNCPPCE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1443447KRRRV
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10015764m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_023642294.10.0homeobox-leucine zipper protein HDG8 isoform X1
SwissprotQ9M9P40.0HDG8_ARATH; Homeobox-leucine zipper protein HDG8
TrEMBLR0HRS60.0R0HRS6_9BRAS; Uncharacterized protein
STRINGXP_006299586.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM127911530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03260.10.0homeodomain GLABROUS 8