PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araip.3K50N
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Dalbergieae; Arachis
Family Trihelix
Protein Properties Length: 480aa    MW: 55491.8 Da    PI: 7.5893
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araip.3K50NgenomeNCGR_PGCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix93.22.7e-2945129187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW+++e++aLi++r+em+ ++r+ + k+plWe+vs+k+ e g+ers+k+Ckek+en+ k+++++keg++++  ++ +t+++fdqlea
  Araip.3K50N  45 RWPREETMALIKIRSEMDGAFRDISPKAPLWEQVSRKLGELGYERSAKKCKEKFENIYKYHRRTKEGRSGK--RNGKTYRFFDQLEA 129
                  8********************************************************************96..55567*******85 PP

2trihelix1002e-31319405187
     trihelix   1 rWtkqevlaLiearremeerlrrgk.lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW+k+ev aLi++r++++e+l++++  k+plWeevs +m+  g+ rs+k+Ckekwen+nk++k++ke +k++ +e+s+tcpy+++lea
  Araip.3K50N 319 RWPKDEVEALIRLRTQVDEQLQQQQgNKGPLWEEVSTAMKGLGYDRSAKRCKEKWENINKYFKRMKEKNKRK-PEDSKTCPYYHHLEA 405
                  8*********************8543899*****************************************96.9***********985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.53838102IPR017877Myb-like domain
SMARTSM007170.001942104IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.600.00144103IPR009057Homeodomain-like
PfamPF138377.1E-1944130No hitNo description
CDDcd122036.43E-2344109No hitNo description
SMARTSM007177.2E-4316379IPR001005SANT/Myb domain
CDDcd122036.47E-21318384No hitNo description
Gene3DG3DSA:1.10.10.602.2E-4318376IPR009057Homeodomain-like
PfamPF138375.8E-22318406No hitNo description
PROSITE profilePS500907.468318377IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 480 aa     Download sequence    Send to blast
MENSTLPENS IENRKVAVAA AEGDSVSSDG LKGEDGERNS SGANRWPREE TMALIKIRSE  60
MDGAFRDISP KAPLWEQVSR KLGELGYERS AKKCKEKFEN IYKYHRRTKE GRSGKRNGKT  120
YRFFDQLEAL DPHPNNNAVI QDAVPCSVRF PVTAMEHSSS ATSSYSSGGG EDEGEGRRRK  180
KKRRLRVFFE GLMREVLEKQ ESLQKKFMEV LDKCDQDRMA REQAWKTEEL ARIKKERELL  240
AQERSIAAAK DEAVMSFIRK FAENSNNNGA LQFPADNNNH LQEQEKEKEK EKEKEKEKEK  300
DEVGNGINVG NFVHMSSSRW PKDEVEALIR LRTQVDEQLQ QQQGNKGPLW EEVSTAMKGL  360
GYDRSAKRCK EKWENINKYF KRMKEKNKRK PEDSKTCPYY HHLEAIYSKK PKNNNKLDDN  420
DNDKKNELKP EELLMHIMNG QEERQQDPDQ DQSSSEDADR DNHNGYQMLD NSPSSIPIMS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1175184GRRRKKKRRL
2177182RRKKKR
3177183RRKKKRR
4177184RRKKKRRL
5179183KKKRR
6179184KKKRRL
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraip.3K50N
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016180760.10.0trihelix transcription factor GT-2
SwissprotQ391173e-64TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A444X5970.0A0A444X597_ARAHY; Uncharacterized protein
STRINGGLYMA09G19750.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.25e-62Trihelix family protein