PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Itr_sc001764.1_g00003.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Convolvulaceae; Ipomoeeae; Ipomoea
Family MYB_related
Protein Properties Length: 1593aa    MW: 178153 Da    PI: 8.8074
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Itr_sc001764.1_g00003.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.34.1e-09158198242
                              SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHH CS
          Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcks 42 
                              ++W++eE+  ++ + ++lG+g+W+ I++ + ++Rt+ q+ s
  Itr_sc001764.1_g00003.1 158 LAWSEEEHRVFLVGLEKLGKGDWRGISKQFVTTRTPAQLAS 198
                              68***********************************9976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS501589.191318IPR001878Zinc finger, CCHC-type
SMARTSM003430.067420IPR001878Zinc finger, CCHC-type
SMARTSM007179.8E-5156206IPR001005SANT/Myb domain
SuperFamilySSF466897.26E-8158204IPR009057Homeodomain-like
CDDcd001678.00E-7159206No hitNo description
Gene3DG3DSA:1.10.10.602.9E-8159199IPR009057Homeodomain-like
PfamPF002491.3E-6159198IPR001005SANT/Myb domain
PROSITE profilePS500906.47160204IPR017877Myb-like domain
PfamPF142441.9E-16280326IPR029472Gag-polypeptide of LTR copia-type
PfamPF142232.6E-8336483No hitNo description
SMARTSM003432.2540556IPR001878Zinc finger, CCHC-type
PfamPF139762.2E-7764836IPR025724GAG-pre-integrase domain
PROSITE profilePS5099417.5878381010IPR001584Integrase, catalytic core
Gene3DG3DSA:3.30.420.102.2E-268461002IPR012337Ribonuclease H-like domain
SuperFamilySSF530983.49E-388471004IPR012337Ribonuclease H-like domain
PfamPF006651.1E-19852960IPR001584Integrase, catalytic core
PfamPF077275.1E-8111311345IPR013103Reverse transcriptase, RNA-dependent DNA polymerase
SuperFamilySSF566723.56E-3111341325No hitNo description
SuperFamilySSF566723.56E-3113551536No hitNo description
CDDcd092723.07E-7114301569No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0015074Biological ProcessDNA integration
GO:0003677Molecular FunctionDNA binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1593 aa     Download sequence    Send to blast
MVRKCSNCGR AGHNSRTCNN SCINHHHESK SSTSCSHTIS IKLFGVKLGI SQSSSSPTNI  60
LSSSSSPLTP AASLAAIRKS FSFECLSSPT ITNRGGTAHS GYLSDGLTTR NHHNKKGWTE  120
VVKGQVQHHV TKELLDKAEA VKGQVQHHVA RGLLGEGLAW SEEEHRVFLV GLEKLGKGDW  180
RGISKQFVTT RTPAQLASCY SYFTNHNELN ISFLSKALSA LLASYFSYFI MVSERYRSYA  240
AAVMMEEDGH GIEAENGGTS SNQTATNQHN DQFDDPFFVH ITENPNVSLV SPLLSELNYS  300
SWSRSMKIAL EVKNKFGFVN GSIPSPEEND PKFASWRRCN RIVSSWILRS VNPSIADGVM  360
YFDTASEIWS ALHKRYSQSD PHRISELQNE IYRNVQGNLT VNEYFTKSNA MWQQLNVLRP  420
LLLCECTPRC TCTLLMRMQK EREEDHIIRF LEGLNEEYET VKSGVLVMDP IPDMERVLNM  480
TLKLERKIKG SINQRSNEFT QANAIQNFPN QSTEEQSVVA VSALNNRKKF TNTGGRNVPK  540
CTYCGMLGHT IEKCFKKHGY PPGWVAGYKS KNKQFQDAPQ TSSASVNQVG DIGLSSDQFQ  600
RLVSILQNQN KGSQATSNAV MTVANSGIKS DFKDVEGNQN EGNLIPNLHI NAVLNLNTTW  660
ILDSGATDHI TCSLGYFETY HKIQGISVKL PNEEAVNVNC VGQIRLNENI VLENVLYIPS  720
FTFNIVSVSK LTKQTGCKLV LEADSCNIQG PLGKVDGFAK ERNGLYLISQ PPVVKMKIQS  780
VNKSANIQCN SLIAELWHNR LGHYPVNKIT SLNGIKSDFC YKQSAFVCDA CHLAKHKRSA  840
FPVSISRAEN CFDLIHMDVW GPFAVASLKG EHYFLTIVDD CSRFTWLHLM KTKSEVKGIF  900
QNFYNYVHTQ FTAKIKVVRT DNGSEFLMNS FFNEKGIVHQ KSCVYTPQQN GVAERKHQHV  960
LNVARALRFQ SGLPIKFWGH CVLHASYIIN RLPSDVNGGH APFELLTGKA VDYDQFKSFG  1020
CLCYGATVAQ GRNKFQPRAL RCVFLGFPAN VKGYILYDIT NSTVFVSRDV KFQEQIYPFK  1080
GQGPGVTAEF NKDERVIPSL PLVPTSVETD SFLEPVSRSG SEILAEEPVS RSGSVERYKA  1140
RLVAKGYTQQ LGVDYIETFS PVARMTTIKT FLAVAVAKGW DIQQLDINNA FLHGDLEEEV  1200
YMVLPPGFKS DRPNQVCKLL RSLYGLKQAS RQWNAKLTKF LQNNGFQQST ADPSLFTKTS  1260
QHSFMALLVY VDDILVAGTD NTQITKLKQL LDTSFRIKDL GKLNYFLGIE ASRNSSGLNL  1320
CQKKYTLEIL QENGFLDAKP AKTPCVPGQR LTHTDGMLLD RPDTFRRLVG KLMYLTNTRP  1380
DICYAVQQLS QFVDKPRDTH LVAAHRVLRY LKGAPGKGLF YDSKSQIKLQ GFSDSDWATC  1440
AETRKSITGY CVYLGESLIS WKTKKQATVS RSSSEAEYRA LASTVCEIQW LLYLLADLKA  1500
DSSIPIPLFC DNNSAVAIGE NYVFHERTKH IEIDCHVVRQ KVHEGVIKLL SIPSHKQIAD  1560
GFTKALTTPL FDTFHSKLGL QDLHAPAYGG MMK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_019166977.10.0PREDICTED: uncharacterized protein LOC109162749
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA239
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G56840.13e-20MYB_related family protein