PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.30465s0001.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family GRAS
Protein Properties Length: 342aa    MW: 38315.7 Da    PI: 6.7928
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.30465s0001.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS318.31.6e-97133338373
                  GRAS  38 mqRlaayfteALaarlarsvselykalppsetsekn..sseelaalklfsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQ 124
                           m+Rlaa+ft++L++   r  +++   l+p++ +++   ++++++a++l++++sP+++f++lta qaIleav+ e+r+Hi+D+di++G+Q
  Araha.30465s0001.1.p   1 MERLAAHFTNGLSKLFER--DNV---LRPQQHRDDVydQADVISAFELLQNMSPYVNFGYLTATQAILEAVKYERRIHIVDYDITEGVQ 84 
                           89***************9..444...4555555443559************************************************** PP

                  GRAS 125 WpaLlqaLasRpegpp..slRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlq 209
                           W++L+qaL s++ gp   +lRiT+++++++g  s  +++etg+rL+ fAe++g pf+++     +++++++++L++ +gEa+++n++l+
  Araha.30465s0001.1.p  85 WASLMQALVSKNTGPSaqHLRITALSRATNGkkSIAAVQETGRRLTAFAESIGQPFSYHH-CKLDMNAFSTSSLKLVRGEAVVINCMLH 172
                           ************665446*********999988999***********************9.788************************* PP

                  GRAS 210 lhrlldesvsleserdevLklvkslsPkvvvvveqeadh.nsesFlerflealeyysalfdsleaklpreseerikvErellgreivnv 297
                           l r+ +++ ++     ++L+  k+l+Pk+v++v++e+ + +++ Fl rf++ l+ +sa+fdslea+l+ ++ +r +vEr+++g+++ n+
  Araha.30465s0001.1.p 173 LPRFSNQTPNSVI---SFLSEAKTLNPKLVTLVHEEVGLmGNQGFLYRFMDLLHQFSAIFDSLEAGLSIANPARGFVERVFIGPWVINW 258
                           *****99888888...9*********************99************************************************* PP

                  GRAS 298 vacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee.sgslvlgWkdrpLvsvSaW 373
                           ++  +a+  ++ e++++W ++le+ GFkp+++s  + +qaklll+ ++ dgy vee  ++ lvlgWk+r+Lvs+S+W
  Araha.30465s0001.1.p 259 LTRITAD-DAEVESFASWPQWLETNGFKPMEVSFANRCQAKLLLSLFN-DGYIVEELgQNGLVLGWKSRRLVSASFW 333
                           ******9.999*************************************.******98789999************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF035145.4E-951333IPR005202Transcription factor GRAS
PROSITE profilePS5098545.6041314IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 342 aa     Download sequence    Send to blast
MERLAAHFTN GLSKLFERDN VLRPQQHRDD VYDQADVISA FELLQNMSPY VNFGYLTATQ  60
AILEAVKYER RIHIVDYDIT EGVQWASLMQ ALVSKNTGPS AQHLRITALS RATNGKKSIA  120
AVQETGRRLT AFAESIGQPF SYHHCKLDMN AFSTSSLKLV RGEAVVINCM LHLPRFSNQT  180
PNSVISFLSE AKTLNPKLVT LVHEEVGLMG NQGFLYRFMD LLHQFSAIFD SLEAGLSIAN  240
PARGFVERVF IGPWVINWLT RITADDAEVE SFASWPQWLE TNGFKPMEVS FANRCQAKLL  300
LSLFNDGYIV EELGQNGLVL GWKSRRLVSA SFWASCESSN L*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A6e-432433387377Protein SCARECROW
5b3h_D6e-432433387377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraha.30465s0001.1.p
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAL0802520.0AL080252.2 Arabidopsis thaliana DNA chromosome 4, BAC clone T12G13, partial sequence (ESSA project).
GenBankAL1615100.0AL161510.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 22.
GenBankCP0026870.0CP002687.1 Arabidopsis thaliana chromosome 4 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002872385.20.0scarecrow-like protein 26
SwissprotQ9SUF50.0SCL26_ARATH; Scarecrow-like protein 26
TrEMBLD7M8D20.0D7M8D2_ARALL; Scarecrow transcription factor family protein
STRINGscaffold_603155.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM99142736
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G08250.10.0GRAS family protein