PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 472358
Common NameARALYDRAFT_472358
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family Trihelix
Protein Properties Length: 441aa    MW: 50615 Da    PI: 6.3632
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
472358genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.75.7e-27115209185
  trihelix   1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
               +Wt+++v++Li+a+++++++++ ++         +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +++d++
    472358 115 KWTDKMVKLLITAVSYIGDDSSMDSgsrrkfavlQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQVVENPALLDSI 209
               7**************9888888665556778888**********************************************669999998888765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.1E-24113236No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 441 aa     Download sequence    Send to blast
MDGNFPQGGV VRGGAASYGG FDLQGSMRVQ HQDSMNQQHR HNPNSRPLHE GLPFTMVTGQ  60
TCDHHQNMSM SEQQKPEREK NPVSDDDEPS FTEEGGDGHN EANRSTKGSP WQRVKWTDKM  120
VKLLITAVSY IGDDSSMDSG SRRKFAVLQK KGKWKSVSKV MAERGYHVSP QQCEDKFNDL  180
NKRYKKLNDM LGRGTSCQVV ENPALLDSIG YLNDKEKDDV RKIMSSKHLF YEEMCSYHNG  240
NRLHLPHDLA LQRSLQLALR NRDDHDNGDS RKHQMEDLDD EDHDGDGDEH DEYEEQHYSY  300
GECRGNHYGG GGGPLKKIRQ SHSHEDADHP SHVNSLECNK VSLPQMPFSQ ADVNQGGAES  360
ARAASVQKQW IESRTLQLEE QKLQIQVELL ELEKQRFRWQ RFSKKRDQEL ERMRMENERM  420
KLENDRMGLE LKQRELGVEL *
Cis-element ? help Back to Top
SourceLink
PlantRegMap472358
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0458500.0AY045850.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankAY0913780.0AY091378.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF16F40.0AC036104.3 Sequence of BAC F16F4 from Arabidopsis thaliana chromosome 1, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020869362.10.0uncharacterized protein LOC9329218 isoform X1
RefseqXP_020869363.10.0uncharacterized protein LOC9329218 isoform X2
TrEMBLD7KJZ20.0D7KJZ2_ARALL; Uncharacterized protein
STRINGfgenesh2_kg.1__2312__AT1G21200.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.10.0sequence-specific DNA binding transcription factors