PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen03g022900.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family RAV
Protein Properties Length: 1835aa    MW: 208001 Da    PI: 7.0162
Description RAV family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen03g022900.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP228.83.1e-094694155
               AP2  1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55
                      s++kGV    ++g W A+I+       + +r++lg+f t+ +Aa a++ a+ +l g
  Sopen03g022900.1 46 SKFKGVV-GQNNGHWGAQIYA------NhQRIWLGTFKTETDAAMAYDSAAIRLLG 94
                      799***6.6679******999......44**********99************987 PP

2B389.72.2e-28170253185
                       EEEE-..-HHHHTT-EE--HHH.HTT..---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE- CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh..ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld 85 
                       f+k+ltpsdv+k++rlv+pkk+a+++  +    ++ +++++d+s r W+++++y+k+s+++v+tkGW++Fvk++gL+++D++vF l 
  Sopen03g022900.1 170 FQKELTPSDVGKLNRLVIPKKYATKYfpQ---IQDEEMIFYDTSRRLWKFRYCYWKSSQSFVFTKGWNKFVKDKGLRAKDTIVFNLC 253
                       99************************633...35668***********************************************975 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000181.27E-1746105No hitNo description
SuperFamilySSF541714.18E-124697IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.103.0E-144796IPR001471AP2/ERF domain
SMARTSM003809.2E-1547109IPR001471AP2/ERF domain
PROSITE profilePS5103215.63347103IPR001471AP2/ERF domain
Gene3DG3DSA:2.40.330.102.2E-31167252IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019368.89E-27168255IPR015300DNA-binding pseudobarrel domain
CDDcd100171.21E-27169252No hitNo description
PROSITE profilePS5086314.395170274IPR003340B3 DNA binding domain
SMARTSM010195.4E-22170274IPR003340B3 DNA binding domain
PfamPF023621.5E-25170253IPR003340B3 DNA binding domain
PfamPF135893.3E-11523645No hitNo description
SuperFamilySSF558747.15E-6525645IPR003594Histidine kinase-like ATPase, C-terminal domain
Gene3DG3DSA:3.30.565.103.3E-6526704IPR003594Histidine kinase-like ATPase, C-terminal domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000724Biological Processdouble-strand break repair via homologous recombination
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1835 aa     Download sequence    Send to blast
MEDETNLISS TTNVTVGDCD SSGSNHLLPK KRNINAREGK GCISGSKFKG VVGQNNGHWG  60
AQIYANHQRI WLGTFKTETD AAMAYDSAAI RLLGPDHSHR NLSWTNSTIQ EPNFQTQFST  120
EDILRMIKEG SYTSRFDEYL KDKFEDHFQS LKNLQKVNEG NAEFSYKQLF QKELTPSDVG  180
KLNRLVIPKK YATKYFPQIQ DEEMIFYDTS RRLWKFRYCY WKSSQSFVFT KGWNKFVKDK  240
GLRAKDTIVF NLCEFKNGTK ENCNAFVIDV VKSIDGNLAL NHHEQEETID DHEVGTQEFT  300
KAQPFDDDLV PVWLFGKQIV LCSSFSILPK TQPLSLFSLL LLCEFSGIYF LQIAICRKLQ  360
RKIKMSSKRH CEDFSTETPR KKPSRVLRIQ VDSDEEVGNN EGKVFYFRVL LPNGITLELQ  420
VPGPPSEMPV EDFVILVRRE YQNIGRRTDS PKPRRQINWT SKDLHFVDAF DNRITKTMDF  480
RKFKSNKSHM LRLCDGSVEA DKYENMWDLT PDTDLLKELP EEYTFETALA DLIDNSLQAV  540
WSKSTDQRRL ISLELTKSRI TIFDTGLGMD GSAENSIVKW GKMGASIHRS ARDRGIGGKP  600
PYLTPYFGMF GYGGPIASMH LGRRASVSSK TKECKKVYVL HLERDSLLRC SSSQQTWRTD  660
GNVRDPLEDE LRHSVDGSFT KVEIFYPKMR SESVQELQYK LKDIYFPYIQ CDEVSKTGKT  720
VMPIEFQVNG TNLAEIEGGE VATTNLLSCN GPEFVMQLSF QVKDSSGLKV GSGTKSSFEA  780
HARLRCVYFP VAQGKESIEV ILEKLEADGY GITENFETFS HVSVRRLGRL LPDARWSWLP  840
FMEPKLRKSD RAEVLKRCCF RVKCFIETDA GFNPTPSKTD LAHHHPFTIA LRNFGNKPSK  900
KENDVLIEIA KDGKKLSLLQ LEKLYQEWLF QMHDRYDEEI DCGEDQPTFV VVGPLHKKEL  960
GVSADVMRIH KAFQRKGITW KAGQKVKILK GAYRGFHKNN IFATLEFIIL EGWQGDSGGE  1020
ARIICRPLNV PAESGCRLTF DKGCACFEIR DSKSLPISVI DTGKCLSVDK TEWENQILKH  1080
QEKTTPSSID ILDAEQCLEL EIEGALPQDV DAGHEPPEEI TAVVRPVSFT SATASKNLDQ  1140
KYIMKENFVM TLEIKFKADE NEKEQHIYSG QLNPSSLKGF HGLYMFPLKK KLPNLFQTAG  1200
IYLFRFSLIE SCTISVKEVR VKALSEPASW ELVSDGKSTH SVRVGSCFPE VFSVACRDRF  1260
CNRIPFKSQT EIEMKLSSGG RAISSECSYD QYITHDSYTM KFKNVTIESS ELDMIRPSYN  1320
ATLHINSREN PFVVAIPCAV IPGPLQRILL RPVDFGKKLV PGMVLKELAL ETFDKYGNHM  1380
RKDEHIKLTL EGLHLLDKGN SFYKVDDHGC VNLSGTLKVT AGYGKLVSLS VLSGDEVVFK  1440
KEFQTDRRSL RVASKVPKVC AAGSHLEDVV FEVVNSAGEV DEDIDSEIED GHSHTLQIRQ  1500
DSLREEDNVR YSFHRGRCIV RSIPLPNNEG LFFFVASHSR FHELQTSIEV HVEKAVIQPR  1560
SPKKEILLLE ESNGKGPETV CHDSYDGRIM IFNDSCASMV LEDRQQKLGD DICRYGLCIR  1620
QCDANVESLS IKQSNIELEM SNLGAYIGLD SFHDLYYDKD VIMEKIEGKA DSAAAVIHKL  1680
LRSPKPEQLY LKYAHDILGV VALLGEVRTH KLSSMLSTYL GEDQMLAIVC KSRAAARALE  1740
NYQMDGNVNC ASALDILAAK LGISIKGRYL VICLEDIRPY KQGVSSDPQR ELAIPQPTLS  1800
NRETXXXXXX XXXXCEETQG SILYKSTPVA IYPSK
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wid_A3e-2516925113101DNA-binding protein RAV1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754420.0HG975442.1 Solanum pennellii chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_025885705.10.0structural maintenance of chromosomes flexible hinge domain-containing protein GMI1 isoform X1
RefseqXP_025885713.10.0structural maintenance of chromosomes flexible hinge domain-containing protein GMI1 isoform X9
TrEMBLA0A3Q7FMZ20.0A0A3Q7FMZ2_SOLLC; Uncharacterized protein
STRINGXP_009607726.10.0(Nicotiana tomentosiformis)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51120.14e-80RAV family protein