PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Tp1g18840
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassicaceae incertae sedis; Schrenkiella
Family Trihelix
Protein Properties Length: 441aa    MW: 50468 Da    PI: 6.2874
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Tp1g18840genomethellungiellaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.19e-27113207185
   trihelix   1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
                +Wt+++v++Li+a++++++++  ++         +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +++d++
  Tp1g18840 113 KWTDKMVKLLITAVSYIGDDSTMDSgsrrkfavlQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQVVENPVLLDSI 207
                7**************8877777654456677888**********************************************569999988888765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.2E-24111234No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 441 aa     Download sequence    Send to blast
MDGNFPQGGV VRGGVSSFGG FDLQGSMRVH PPDSMNQHRQ NPNSRPLHEG LPFTMVTGQT  60
CDHHHMSMTE QHKGEREKNS VSDDEEPSFN EEGGDGQNEA NKSAKGSPWQ RVKWTDKMVK  120
LLITAVSYIG DDSTMDSGSR RKFAVLQKKG KWKSVSKVMA ERGYHVSPQQ CEDKFNDLNK  180
RYKKLNDMLG RGTSCQVVEN PVLLDSIGYL NDKEKDDVRK IMSSKHLFYE EMCSYHNGNR  240
LHLPHDLALQ RSLQLALRNR DDHDNDDSRK LQMEDLDDED HDGEGDEHDE YEEQHYSHGD  300
CRGVHYGGGG LGGGPLKKTR MSHSHEDVDH PSHVNSLECN KVSLAQMPFP QADASQGGAE  360
SGRAASVQKQ WIESRTLQLE EQKLQIQVEL LELEKQRFRW QRFSKKRDQE LERMRMENER  420
MKLENDRMGL ELKQRELGVE L
Cis-element ? help Back to Top
SourceLink
PlantRegMapTp1g18840
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0458500.0AY045850.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankAY0913780.0AY091378.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF16F40.0AC036104.3 Sequence of BAC F16F4 from Arabidopsis thaliana chromosome 1, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006307528.10.0uncharacterized protein LOC17897295
TrEMBLR0GX820.0R0GX82_9BRAS; Uncharacterized protein
STRINGCagra.1632s0015.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.10.0sequence-specific DNA binding transcription factors