PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Mapoly0213s0014.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Marchantiophyta; Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia
Family HB-other
Protein Properties Length: 1567aa    MW: 177689 Da    PI: 8.9982
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Mapoly0213s0014.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox25.42.5e-08107711092254
                           SSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHH CS
             Homeobox   22 nrypsaeereeLAkklgLterqVkvWFqNrRak 54  
                           ++yp+  ++e LA+ +gLt +qV++WF N R +
  Mapoly0213s0014.1.p 1077 HPYPKDVDKESLASATGLTRSQVSNWFINARVR 1109
                           89*****************************87 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007111.75810501113IPR001356Homeobox domain
SMARTSM003894.9E-910531117IPR001356Homeobox domain
CDDcd000861.32E-810531114No hitNo description
SuperFamilySSF466891.84E-1510541120IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.6E-2510561119IPR009057Homeodomain-like
PfamPF059206.3E-1610701109IPR008422Homeobox KN domain
SuperFamilySSF511617.9E-912411386IPR011004Trimeric LpxA-like
Gene3DG3DSA:2.160.10.101.7E-1112421387No hitNo description
SuperFamilySSF511616.64E-513961478IPR011004Trimeric LpxA-like
Gene3DG3DSA:2.160.10.103.5E-613961478No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048481Biological Processplant ovule development
GO:0005634Cellular Componentnucleus
GO:0005829Cellular Componentcytosol
GO:0009506Cellular Componentplasmodesma
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1567 aa     Download sequence    Send to blast
MKSEGDPDAL SERTSRFLLT GILGEKVVPL TKLLEDITAE SNEETAYSHL LRKGKATPAC  60
LNQIRSSANR PMCPAERRGS LVQKEGHCLD SQTHCLDSQT ESGFPAKHQP AKSAGRGGGL  120
PTHDSDHGKR FHARGDPEQH IHQSQEVRPS ITSRSDAREF QTERKVATED QGRHGEHQTQ  180
RQPLRRDLNV QNEGISVTPA AHQQAGREAF QGGCSTSTQH NEEIHRKCQK QASADQDRGQ  240
HINAEFLAVL MAAHYQEAYL KASLAAKYLQ EYQNAKQSQH LPAGSQKAHV AERFSEAYQV  300
ALELSQLPVN PESQKSLMEE FYTNLSAAPE AASGGKFSAS PPTPYLHPFY EGSATPQGGS  360
FPAQCEPSHG PLVPSSPLPS PPTGPEQDRS DSTGGQSQLP HGLGQTFLHF SQLPSTAEHA  420
KTVTEEGSDR VIDERDQKPA ERSEVFPHHP HMSFACQGKV MTRGAKVESE APFVAIFPLY  480
KPPPQPQPKP EPHYGGNVGK QVVQETGDHA RVMTSQGPHW LYSSGPSIAQ CQSRPPNETK  540
DARIQREQEA PPGTQGVGSE RETYCDGSDG KRSTGFKRWS APQNGDGRAQ NPIQFAVANQ  600
APQGDGPPHQ RAPPHPLTFV DLGDASEDGE SVQTPLPEDK ESDSEPESER FAGQPRDEDT  660
EVGARVERNV EESKSGAETL QESGDDDAGE LEDEEREREG EGEGEVKSER RFYFFDFTKI  720
CDNLYPPDVT TKMLKDFGLR YYPMRFREDE KIREIEALFR EVLSFSRPKN VPEIPIVPFD  780
FLRSQRYFTE MAQRPAARKS ASARSSQSTA AGTSHPAAQL WAALIAQRAR AFRRECSIRI  840
ENQGIDLADS FTADQLDYKP VHRGRSRDVM AWLRKPRSRG VTLHDRKKMF RAYIDEIDDW  900
YYEYRSTLEN INDETDDSFG HFIGNIYTEY PLMELSRRLR RYRDTFLEHL VCVKKVLGDS  960
TAVPDFDKNI RDRLIACRVR AAKRMARKAL MVIRRAHAPY KKRMINGCHK MIVKLEPKQE  1020
NRIAEAVRIA GTNNGPRRQQ RMYSGRLNLN STWRPQRGLP DRAVSVLKAW LFTNFLHPYP  1080
KDVDKESLAS ATGLTRSQVS NWFINARVRV WKPMIEAMYL MDFPEDRHRF ITMSRARKVD  1140
HGQELGDRKP GTSRSRFFPC VDGRTRRSRK RQCIYRPASH SLALGSRVLP SFLRHAQASS  1200
RMRHQASRAF APYSREMLAQ FLRNASVWRA QLGESRQNRA EVEHRFAIQG RAEVGNRFTI  1260
QNRTQVEPKF TIQNRAQVEP RLTVQDRAQV ESRFPIQNRA QVEPRLIVQD RAQVEPRFPT  1320
QNRAQVEPRV TVQDRTQVEH SFIIQNQGQV EPRLTVQDRD QVETRFPIQN RAQVEPSVTV  1380
QDRAQVEHSF ISQNRAPVEP RFTIQDRAQA EPKFTFPSQA KVEPRLTIQD RAQLEPRFTI  1440
QDRAQVGTRF TIQDRAPVET RFSIQDRAKV DTTLTIKDLR AELEPRLTSE KIDYDNHISL  1500
ELQLGSERTK ARPESTGPRL RFQDGRLLSE RMRIGHHTEP PKGEIPMDIL KFEDLERIRK  1560
GKGIVE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1874887KPRSRGVTLHDRKK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2R6W0490.0A0A2R6W049_MARPO; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP13316172
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41410.17e-32TALE family protein