PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_002333-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family Trihelix
Protein Properties Length: 496aa    MW: 55957.8 Da    PI: 6.2131
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_002333-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix75.21.1e-23110211186
       trihelix   1 rWtkqevlaLiearremeerlrrgk...............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcp 80 
                    +Wt+++v++Li ++ +++++ +++                +kk++W++vs++m+e+gf +sp+qC++k+++lnkryk+++++ +++ +++++++ +
  NNU_002333-RA 110 KWTDNMVRLLIMVVFYIGDDGGSEGndpsgagkkkpggllQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVTDILGRGtACKVVENQS 205
                    7***************999999643367788889999999**********************************************55******** PP

       trihelix  81 yfdqle 86 
                    ++d +e
  NNU_002333-RA 206 LLDTME 211
                    ***998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.8E-21108237No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010629Biological Processnegative regulation of gene expression
GO:1900037Biological Processregulation of cellular response to hypoxia
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 496 aa     Download sequence    Send to blast
MEVNGMAGGM ISGLSPGMLG LEMPLHQAQQ QHHPHQHHHQ QQHPQMVSFA GQDADHHSQS  60
QSVKQHVYPP YATKAKPQQP SLSDEEEQGF AGEDGGADGK KRMSPWQRMK WTDNMVRLLI  120
MVVFYIGDDG GSEGNDPSGA GKKKPGGLLQ KKGKWKSVSR AMMEKGFYVS PQQCEDKFND  180
LNKRYKRVTD ILGRGTACKV VENQSLLDTM EQLSPKTKDE VRKLLNSKHL FFREMCAYHN  240
SCGGGGLGGG AGGGHHPSEV AAESANHQQQ QHCFHSSEGP SVISNSKGLS RGETEGLKMR  300
TGSGREEEED CDDDDDDDDD DDDDDDDDEE SMENGGRGHS SEHGGAEGGG GREEEEDNEK  360
LGSRKRKRKR RGMSSTPPLI QQLNSEMMNV LQDGAKSPWE QREWMRTKSV QLEEQRVSYQ  420
CQAVELEKQR FKWIKFSGKK EREMERLKLD NERMRLENER MVLLLRQKEL ELINLHQNQQ  480
QQQQQHSSNK RPDQST
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1364370KRKRKRR
2365370RKRKRR
3438447KKEREMERLK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010243252.10.0PREDICTED: uncharacterized protein LOC104587365
TrEMBLA0A1U7YYP90.0A0A1U7YYP9_NELNU; uncharacterized protein LOC104587365
STRINGXP_010243252.10.0(Nelumbo nucifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10040.15e-72sequence-specific DNA binding transcription factors