PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Tp4g20010
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassicaceae incertae sedis; Schrenkiella
Family GRAS
Protein Properties Length: 727aa    MW: 81831.7 Da    PI: 7.0089
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Tp4g20010genomethellungiellaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3733.9e-1143537221374
       GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaNq 100
                l++lL++cA+av+++d ++a +lL++++++++p gd +qRla++f+++L+arla+++s++yk + +++ s    + +l+a++lf  ++P+ k+s++++N+
  Tp4g20010 353 LRSLLIHCAQAVAADDSRCAGQLLKQIRQHSTPFGDGNQRLAHCFANGLEARLAGTGSQIYKGIVSKPRS---AAAVLKAHQLFLACCPFRKLSYFITNK 449
                689***************************************************************9999...999************************ PP

       GRAS 101 aIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvkp 198
                +I + v ++ rvH+iDf+i +G+QWp+L++ ++    g+p++RiTg++ p++g   ++++eetg+rLa +A+++gvpfe+++ +ak++++++le+L++++
  Tp4g20010 450 TIRDLVGKSPRVHVIDFGILYGFQWPTLIHRFSM--YGSPKVRITGIEFPQPGfrPAQRVEETGQRLAAYAKHFGVPFEYKA-IAKKWDAIQLEDLDIDR 546
                *********************************9..8***************9*****************************.7**************** PP

       GRAS 199 gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvv 298
                +E ++Vn+ ++ ++l+desv++es+rd+vL+l+ ++sP+++v+   +  +n++ F++rf eal ++s++fd+l++ +pre+ er+++E e++gre+ nv+
  Tp4g20010 547 DEITVVNCLYRAENLHDESVKVESCRDAVLSLIGKISPDLFVFGTVNGAYNAPFFVTRFREALFHFSSIFDMLDTVVPREDGERMFLEMEVFGREALNVI 646
                **************************************************************************************************** PP

       GRAS 299 acegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                aceg er+er et+++W+ r +++G+ +vp++ +++k+    +++++ + + +++++ +l++gWk+r+++++S+W+
  Tp4g20010 647 ACEGWERVERPETYKQWHVRAMRSGLVQVPFDPSIMKTSLDKVHTFYHKDFVIDQDNRWLLQGWKGRTVMALSVWK 722
                ***********************************************888*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098561.297327702IPR005202Transcription factor GRAS
PfamPF035141.4E-111353722IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 727 aa     Download sequence    Send to blast
MITDPGLNGL SGAISRTRLP DQSSSLLPNH SFTPVTVLDG FNYNHLSDNR NTVVTSAPDN  60
SAFTRGEEEE DPADEFDFSD AVLGYISQML MDEDMEDKAC MLQESLDLEA AERSLYEAIG  120
KKYPPSPEPN LAFAERNSES LDHVVPRNYT GDRIGSSGFI ARNGGIKPMS GFTLDFRNPQ  180
SRSPVLSVPQ SSYSSSNDLI TTIHGDGFEE SSNALYHPDA NRESQWLFRR GLEEASRFLP  240
EKNELILNFR EENSSNKGRK KPYRDEIVVE EERCSKLPAV FPETISRSDV VDKILVHVPG  300
GEGMKEFDAL RDVLKKGVEK KKTLVPQEGK RRARGKGRGR GGGKNVKKEV VDLRSLLIHC  360
AQAVAADDSR CAGQLLKQIR QHSTPFGDGN QRLAHCFANG LEARLAGTGS QIYKGIVSKP  420
RSAAAVLKAH QLFLACCPFR KLSYFITNKT IRDLVGKSPR VHVIDFGILY GFQWPTLIHR  480
FSMYGSPKVR ITGIEFPQPG FRPAQRVEET GQRLAAYAKH FGVPFEYKAI AKKWDAIQLE  540
DLDIDRDEIT VVNCLYRAEN LHDESVKVES CRDAVLSLIG KISPDLFVFG TVNGAYNAPF  600
FVTRFREALF HFSSIFDMLD TVVPREDGER MFLEMEVFGR EALNVIACEG WERVERPETY  660
KQWHVRAMRS GLVQVPFDPS IMKTSLDKVH TFYHKDFVID QDNRWLLQGW KGRTVMALSV  720
WKPDSKA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-4235972525382Protein SCARECROW
5b3h_A1e-4235972524381Protein SCARECROW
5b3h_D1e-4235972524381Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapTp4g20010
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0046840.0AC004684.3 Arabidopsis thaliana chromosome 2 clone F13M22 map ve018, complete sequence.
GenBankCP0026850.0CP002685.1 Arabidopsis thaliana chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006410955.10.0scarecrow-like protein 9
SwissprotO809330.0SCL9_ARATH; Scarecrow-like protein 9
TrEMBLV4MIK60.0V4MIK6_EUTSA; Uncharacterized protein
STRINGXP_006410955.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein